Picture for Hung-yi Lee

Hung-yi Lee

An Exploration of Mamba for Speech Self-Supervised Models

Add code
Jun 14, 2025
Viaarxiv icon

A correlation-permutation approach for speech-music encoders model merging

Add code
Jun 13, 2025
Viaarxiv icon

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

Multi-Distillation from Speech and Music Representation Models

Add code
Jun 08, 2025
Viaarxiv icon

Towards Generalized Source Tracing for Codec-Based Deepfake Speech

Add code
Jun 08, 2025
Viaarxiv icon

Reducing Object Hallucination in Large Audio-Language Models via Audio-Aware Decoding

Add code
Jun 08, 2025
Viaarxiv icon

CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition

Add code
Jun 06, 2025
Viaarxiv icon

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

Add code
Jun 05, 2025
Viaarxiv icon

AudioLens: A Closer Look at Auditory Attribute Perception of Large Audio-Language Models

Add code
Jun 05, 2025
Viaarxiv icon