Picture for Yi-Cheng Lin

Yi-Cheng Lin

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Viaarxiv icon

A correlation-permutation approach for speech-music encoders model merging

Add code
Jun 13, 2025
Viaarxiv icon

Multi-Distillation from Speech and Music Representation Models

Add code
Jun 08, 2025
Viaarxiv icon

CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition

Add code
Jun 06, 2025
Viaarxiv icon

EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

Add code
Jun 05, 2025
Viaarxiv icon

Creativity in LLM-based Multi-Agent Systems: A Survey

Add code
May 27, 2025
Viaarxiv icon

Meta-PerSER: Few-Shot Listener Personalized Speech Emotion Recognition via Meta-learning

Add code
May 22, 2025
Viaarxiv icon

ToxicTone: A Mandarin Audio Dataset Annotated for Toxicity and Toxic Utterance Tonality

Add code
May 21, 2025
Viaarxiv icon

Mitigating Subgroup Disparities in Multi-Label Speech Emotion Recognition: A Pseudo-Labeling and Unsupervised Learning Approach

Add code
May 21, 2025
Viaarxiv icon

Distilling a speech and music encoder with task arithmetic

Add code
May 19, 2025
Viaarxiv icon