Speech Synthesis


Speech synthesis is the process of generating artificial speech from text using computer algorithms.

T5Gemma-TTS Technical Report

Add code
Apr 02, 2026
Viaarxiv icon

TRACE: Training-Free Partial Audio Deepfake Detection via Embedding Trajectory Analysis of Speech Foundation Models

Add code
Apr 01, 2026
Viaarxiv icon

Cinematic Audio Source Separation Using Visual Cues

Add code
Mar 27, 2026
Viaarxiv icon

How Open is Open TTS? A Practical Evaluation of Open Source TTS Tools for Romanian

Add code
Mar 25, 2026
Viaarxiv icon

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

Once-for-All Channel Mixers (HYPERTINYPW): Generative Compression for TinyML

Add code
Mar 26, 2026
Viaarxiv icon

Borderless Long Speech Synthesis

Add code
Mar 20, 2026
Viaarxiv icon

Audio Avatar Fingerprinting: An Approach for Authorized Use of Voice Cloning in the Era of Synthetic Audio

Add code
Mar 20, 2026
Viaarxiv icon

Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?

Add code
Mar 20, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon