speech


Test-Time Adaptation for Speech Emotion Recognition

Add code
Jan 21, 2026
Viaarxiv icon

Beyond Prompting: Efficient and Robust Contextual Biasing for Speech LLMs via Logit-Space Integration (LOGIC)

Add code
Jan 21, 2026
Viaarxiv icon

Contrastive Knowledge Distillation for Embedding Refinement in Personalized Speech Enhancement

Add code
Jan 21, 2026
Viaarxiv icon

Neural Tracking of Sustained Attention, Attention Switching, and Natural Conversation in Audiovisual Environments using Mobile EEG

Add code
Jan 21, 2026
Viaarxiv icon

Fast-ULCNet: A fast and ultra low complexity network for single-channel speech enhancement

Add code
Jan 21, 2026
Viaarxiv icon

Performance and Complexity Trade-off Optimization of Speech Models During Training

Add code
Jan 21, 2026
Viaarxiv icon

FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes

Add code
Jan 21, 2026
Viaarxiv icon

Encoding Emotion Through Self-Supervised Eye Movement Reconstruction

Add code
Jan 21, 2026
Viaarxiv icon

VCNAC: A Variable-Channel Neural Audio Codec for Mono, Stereo, and Surround Sound

Add code
Jan 21, 2026
Viaarxiv icon

Inverse-Hessian Regularization for Continual Learning in ASR

Add code
Jan 21, 2026
Viaarxiv icon