Picture for Hung-yi Lee

Hung-yi Lee

Multi-Distillation from Speech and Music Representation Models

Add code
Jun 08, 2025
Viaarxiv icon

Towards Generalized Source Tracing for Codec-Based Deepfake Speech

Add code
Jun 08, 2025
Viaarxiv icon

Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding

Add code
Jun 08, 2025
Viaarxiv icon

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition

Add code
Jun 06, 2025
Viaarxiv icon

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

Add code
Jun 05, 2025
Viaarxiv icon

EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

Add code
Jun 05, 2025
Viaarxiv icon

Creativity in LLM-based Multi-Agent Systems: A Survey

Add code
May 27, 2025
Viaarxiv icon

From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data

Add code
May 26, 2025
Viaarxiv icon

Speech-IFEval: Evaluating Instruction-Following and Quantifying Catastrophic Forgetting in Speech-Aware Language Models

Add code
May 25, 2025
Viaarxiv icon