Text To Speech Synthesis


How Open is Open TTS? A Practical Evaluation of Open Source TTS Tools for Romanian

Add code
Mar 25, 2026
Viaarxiv icon

Borderless Long Speech Synthesis

Add code
Mar 20, 2026
Viaarxiv icon

SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection

Add code
Mar 21, 2026
Viaarxiv icon

NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation

Add code
Mar 16, 2026
Viaarxiv icon

MOSS-TTSD: Text to Spoken Dialogue Generation

Add code
Mar 20, 2026
Viaarxiv icon

Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision--Language--Motion Diffusion Architecture

Add code
Mar 19, 2026
Viaarxiv icon

Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?

Add code
Mar 20, 2026
Viaarxiv icon

Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization

Add code
Mar 13, 2026
Viaarxiv icon

CAST-TTS: A Simple Cross-Attention Framework for Unified Timbre Control in TTS

Add code
Mar 17, 2026
Viaarxiv icon

MamTra: A Hybrid Mamba-Transformer Backbone for Speech Synthesis

Add code
Mar 12, 2026
Viaarxiv icon