Picture for Zhiyong Wang

Zhiyong Wang

Intelligent Fish Detection System with Similarity-Aware Transformer

Add code
Sep 28, 2024
Viaarxiv icon

DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech

Add code
Sep 18, 2024
Viaarxiv icon

Mixture of Experts Fusion for Fake Audio Detection Using Frozen wav2vec 2.0

Add code
Sep 18, 2024
Viaarxiv icon

HiSC4D: Human-centered interaction and 4D Scene Capture in Large-scale Space Using Wearable IMUs and LiDAR

Add code
Sep 09, 2024
Viaarxiv icon

Sight View Constraint for Robust Point Cloud Registration

Add code
Sep 08, 2024
Viaarxiv icon

RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning

Add code
Aug 31, 2024
Viaarxiv icon

SITransformer: Shared Information-Guided Transformer for Extreme Multimodal Summarization

Add code
Aug 29, 2024
Viaarxiv icon

Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?

Add code
Aug 20, 2024
Viaarxiv icon

EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech

Add code
Aug 20, 2024
Viaarxiv icon

A Noval Feature via Color Quantisation for Fake Audio Detection

Add code
Aug 20, 2024
Viaarxiv icon