Picture for Hung-yi Lee

Hung-yi Lee

Walking Through Uncertainty: An Empirical Study of Uncertainty Estimation for Audio-Aware Large Language Models

Add code
Apr 28, 2026
Viaarxiv icon

All That Glitters Is Not Audio: Rethinking Text Priors and Audio Reliance in Audio-Language Evaluation

Add code
Apr 27, 2026
Viaarxiv icon

MoVE: Translating Laughter and Tears via Mixture of Vocalization Experts in Speech-to-Speech Translation

Add code
Apr 19, 2026
Viaarxiv icon

ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

Add code
Apr 11, 2026
Viaarxiv icon

Joint Fullband-Subband Modeling for High-Resolution SingFake Detection

Add code
Apr 06, 2026
Viaarxiv icon

Full-Duplex-Bench-v3: Benchmarking Tool Use for Full-Duplex Voice Agents Under Real-World Disfluency

Add code
Apr 06, 2026
Viaarxiv icon

TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild

Add code
Mar 23, 2026
Viaarxiv icon

TiCo: Time-Controllable Training for Spoken Dialogue Models

Add code
Mar 23, 2026
Viaarxiv icon

The Binding Effect: Analyzing How Multi-Dimensional Cues Form Gender Bias in Instruction TTS

Add code
Mar 21, 2026
Viaarxiv icon

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Add code
Mar 19, 2026
Viaarxiv icon