speech


Acoustic-to-Articulatory Inversion of Clean Speech Using an MRI-Trained Model

Add code
Mar 12, 2026
Viaarxiv icon

Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2

Add code
Mar 12, 2026
Viaarxiv icon

Reconstruction of the Vocal Tract from Speech via Phonetic Representations Using MRI Data

Add code
Mar 12, 2026
Viaarxiv icon

Silent Speech Interfaces in the Era of Large Language Models: A Comprehensive Taxonomy and Systematic Review

Add code
Mar 12, 2026
Viaarxiv icon

Affect Decoding in Phonated and Silent Speech Production from Surface EMG

Add code
Mar 12, 2026
Viaarxiv icon

RAF: Relativistic Adversarial Feedback For Universal Speech Synthesis

Add code
Mar 12, 2026
Viaarxiv icon

Resurfacing Paralinguistic Awareness in Large Audio Language Models

Add code
Mar 12, 2026
Viaarxiv icon

SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns

Add code
Mar 12, 2026
Viaarxiv icon

Dr. SHAP-AV: Decoding Relative Modality Contributions via Shapley Attribution in Audio-Visual Speech Recognition

Add code
Mar 12, 2026
Viaarxiv icon

AnimeScore: A Preference-Based Dataset and Framework for Evaluating Anime-Like Speech Style

Add code
Mar 12, 2026
Viaarxiv icon