Picture for Yanmin Qian

Yanmin Qian

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Add code
May 29, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Add code
May 25, 2025
Viaarxiv icon

Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction

Add code
Feb 11, 2025
Viaarxiv icon

Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation

Add code
Jan 24, 2025
Viaarxiv icon

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation

Add code
Jan 01, 2025
Figure 1 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 2 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 3 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 4 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Viaarxiv icon

Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling

Add code
Dec 19, 2024
Figure 1 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 2 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 3 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Figure 4 for Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
Viaarxiv icon

Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification

Add code
Dec 02, 2024
Viaarxiv icon

Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification

Add code
Oct 22, 2024
Figure 1 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 2 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 3 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Figure 4 for Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
Viaarxiv icon

WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction

Add code
Sep 24, 2024
Figure 1 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 2 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 3 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Figure 4 for WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction
Viaarxiv icon