Picture for Yanmin Qian

Yanmin Qian

Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection

Add code
Aug 17, 2025
Viaarxiv icon

FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation

Add code
Jul 22, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon

Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment

Add code
Jun 13, 2025
Viaarxiv icon

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Add code
May 29, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Add code
May 25, 2025
Viaarxiv icon

Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction

Add code
Feb 11, 2025
Figure 1 for Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction
Figure 2 for Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction
Figure 3 for Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction
Figure 4 for Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction
Viaarxiv icon

Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation

Add code
Jan 24, 2025
Viaarxiv icon

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation

Add code
Jan 01, 2025
Figure 1 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 2 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 3 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Figure 4 for SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation
Viaarxiv icon