speech


Acoustic-to-Articulatory Inversion of Clean Speech Using an MRI-Trained Model

Add code
Mar 12, 2026
Viaarxiv icon

Reconstruction of the Vocal Tract from Speech via Phonetic Representations Using MRI Data

Add code
Mar 12, 2026
Viaarxiv icon

Affect Decoding in Phonated and Silent Speech Production from Surface EMG

Add code
Mar 12, 2026
Viaarxiv icon

Resurfacing Paralinguistic Awareness in Large Audio Language Models

Add code
Mar 12, 2026
Viaarxiv icon

SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns

Add code
Mar 12, 2026
Viaarxiv icon

Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition

Add code
Mar 12, 2026
Viaarxiv icon

Streaming Translation and Transcription Through Speech-to-Text Causal Alignment

Add code
Mar 12, 2026
Viaarxiv icon

One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries

Add code
Mar 12, 2026
Viaarxiv icon

Multimodal Emotion Recognition via Bi-directional Cross-Attention and Temporal Modeling

Add code
Mar 12, 2026
Viaarxiv icon

In the LLM era, Word Sense Induction remains unsolved

Add code
Mar 12, 2026
Viaarxiv icon