Picture for Lei Xie

Lei Xie

Ph.D. Program in Computer Science, The Graduate Center, The City University of New York, New York, New York, USA, Ph.D. Program in Biology and Biochemistry, The Graduate Center, The City University of New York, New York, New York, USA, Department of Computer Science, Hunter College, The City University of New York, New York, New York, USA, Helen and Robert Appel Alzheimers Disease Research Institute, Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, Cornell University, New York, New York, USA

Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM

Add code
May 29, 2025
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

Weakly Supervised Data Refinement and Flexible Sequence Compression for Efficient Thai LLM-based ASR

Add code
May 28, 2025
Viaarxiv icon

Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR

Add code
May 28, 2025
Viaarxiv icon

Multi-Mode Process Control Using Multi-Task Inverse Reinforcement Learning

Add code
May 27, 2025
Viaarxiv icon

FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching

Add code
May 26, 2025
Viaarxiv icon

Cross-Sequence Semi-Supervised Learning for Multi-Parametric MRI-Based Visual Pathway Delineation

Add code
May 26, 2025
Viaarxiv icon

Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty

Add code
May 22, 2025
Viaarxiv icon

MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing

Add code
May 22, 2025
Viaarxiv icon

EASY: Emotion-aware Speaker Anonymization via Factorized Distillation

Add code
May 21, 2025
Viaarxiv icon