Picture for Ting Dang

Ting Dang

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

Add code
Mar 05, 2026
Viaarxiv icon

LQA: A Lightweight Quantized-Adaptive Framework for Vision-Language Models on the Edge

Add code
Feb 08, 2026
Viaarxiv icon

HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation

Add code
Feb 05, 2026
Viaarxiv icon

Rethinking Perplexity: Revealing the Impact of Input Length on Perplexity Evaluation in LLMs

Add code
Feb 04, 2026
Viaarxiv icon

CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering

Add code
Feb 03, 2026
Viaarxiv icon

Adapting Where It Matters: Depth-Aware Adaptation for Efficient Multilingual Speech Recognition in Low-Resource Languages

Add code
Feb 01, 2026
Viaarxiv icon

Decoding Ambiguous Emotions with Test-Time Scaling in Audio-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

Rethinking Large Language Models For Irregular Time Series Classification In Critical Care

Add code
Jan 26, 2026
Viaarxiv icon

Test-Time Adaptation for Speech Emotion Recognition

Add code
Jan 21, 2026
Viaarxiv icon

Scaling Ambiguity: Augmenting Human Annotation in Speech Emotion Recognition with Audio-Language Models

Add code
Jan 21, 2026
Viaarxiv icon