Picture for Éva Székely

Éva Székely

Michaela

The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs

Add code
Mar 15, 2026
Viaarxiv icon

What Counts as Real? Speech Restoration and Voice Quality Conversion Pose New Challenges to Deepfake Detection

Add code
Mar 14, 2026
Viaarxiv icon

Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias

Add code
Sep 26, 2025
Viaarxiv icon

Will AI shape the way we speak? The emerging sociolinguistic influence of synthetic voices

Add code
Apr 14, 2025
Viaarxiv icon

Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech

Add code
Jun 08, 2024
Viaarxiv icon

Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model

Add code
May 16, 2024
Figure 1 for Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
Figure 2 for Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
Figure 3 for Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
Figure 4 for Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model
Viaarxiv icon

Unified speech and gesture synthesis using flow matching

Add code
Oct 08, 2023
Figure 1 for Unified speech and gesture synthesis using flow matching
Figure 2 for Unified speech and gesture synthesis using flow matching
Figure 3 for Unified speech and gesture synthesis using flow matching
Viaarxiv icon

Matcha-TTS: A fast TTS architecture with conditional flow matching

Add code
Sep 06, 2023
Figure 1 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 2 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 3 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Figure 4 for Matcha-TTS: A fast TTS architecture with conditional flow matching
Viaarxiv icon

On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis

Add code
Jul 11, 2023
Viaarxiv icon

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Add code
Jun 15, 2023
Figure 1 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 2 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Figure 3 for Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Viaarxiv icon