Picture for Wei-Qiang Zhang

Wei-Qiang Zhang

Metadata-Enhanced Speech Emotion Recognition: Augmented Residual Integration and Co-Attention in Two-Stage Fine-Tuning

Add code
Dec 30, 2024
Viaarxiv icon

Improving Acoustic Scene Classification in Low-Resource Conditions

Add code
Dec 30, 2024
Viaarxiv icon

Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models

Add code
Sep 11, 2024
Viaarxiv icon

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Add code
Aug 27, 2024
Viaarxiv icon

Improving Whisper's Recognition Performance for Under-Represented Language Kazakh Leveraging Unpaired Speech and Text

Add code
Aug 10, 2024
Viaarxiv icon

AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection

Add code
Jun 17, 2024
Viaarxiv icon

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

Add code
Jun 17, 2024
Viaarxiv icon

Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Add code
Jun 14, 2024
Viaarxiv icon

SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation

Add code
Mar 13, 2024
Viaarxiv icon

Transferring speech-generic and depression-specific knowledge for Alzheimer's disease detection

Add code
Oct 06, 2023
Viaarxiv icon