Picture for Yanmin Qian

Yanmin Qian

USE: A Unified Model for Universal Sound Separation and Extraction

Add code
Dec 24, 2025
Viaarxiv icon

A Data-Centric Approach to Generalizable Speech Deepfake Detection

Add code
Dec 24, 2025
Viaarxiv icon

What Does the Speaker Embedding Encode?

Add code
Dec 20, 2025
Viaarxiv icon

Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection

Add code
Aug 17, 2025
Viaarxiv icon

FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation

Add code
Jul 22, 2025
Viaarxiv icon

From Sharpness to Better Generalization for Speech Deepfake Detection

Add code
Jun 13, 2025
Viaarxiv icon

Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment

Add code
Jun 13, 2025
Viaarxiv icon

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

Add code
May 29, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Add code
May 25, 2025
Viaarxiv icon