Picture for Yi-Cheng Lin

Yi-Cheng Lin

May

On the Fallacy of Global Token Perplexity in Spoken Language Model Evaluation

Add code
Jan 09, 2026
Viaarxiv icon

CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese

Add code
Nov 06, 2025
Viaarxiv icon

Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor

Add code
Oct 09, 2025
Viaarxiv icon

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Add code
Oct 09, 2025
Viaarxiv icon

Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems

Add code
Sep 18, 2025
Viaarxiv icon

How Does Instrumental Music Help SingFake Detection?

Add code
Sep 18, 2025
Viaarxiv icon

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Viaarxiv icon

A correlation-permutation approach for speech-music encoders model merging

Add code
Jun 13, 2025
Viaarxiv icon

Multi-Distillation from Speech and Music Representation Models

Add code
Jun 08, 2025
Figure 1 for Multi-Distillation from Speech and Music Representation Models
Figure 2 for Multi-Distillation from Speech and Music Representation Models
Figure 3 for Multi-Distillation from Speech and Music Representation Models
Figure 4 for Multi-Distillation from Speech and Music Representation Models
Viaarxiv icon

CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition

Add code
Jun 06, 2025
Viaarxiv icon