Picture for Yanmin Qian

Yanmin Qian

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon

Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment

Add code
Jun 13, 2025
Viaarxiv icon

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Add code
May 29, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Add code
May 25, 2025
Viaarxiv icon

Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction

Add code
Feb 11, 2025
Viaarxiv icon

Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation

Add code
Jan 24, 2025
Viaarxiv icon

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation

Add code
Jan 01, 2025
Figure 1 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 2 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 3 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 4 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Viaarxiv icon

Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling

Add code
Dec 19, 2024
Figure 1 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 2 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 3 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 4 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Viaarxiv icon

Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification

Add code
Dec 02, 2024
Viaarxiv icon