Picture for Shihong Tan

Shihong Tan

DTT-BSR+: A Generative-Regression Cascade for Music Source Restoration

Add code
Jun 23, 2026
Viaarxiv icon

RAIL: Rethinking Auditory Intelligence in Large Audio-Language Models with a CHC-Grounded Benchmark

Add code
Jun 09, 2026
Viaarxiv icon

DTT-BSR: GAN-based DTTNet with RoPE Transformer Enhancement for Music Source Restoration

Add code
Feb 23, 2026
Viaarxiv icon

CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering

Add code
Feb 03, 2026
Viaarxiv icon