Picture for Álvaro Martín-Cortinas

Álvaro Martín-Cortinas

Investigating self-supervised features for expressive, multilingual voice conversion

Add code
May 13, 2025
Viaarxiv icon

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Add code
Feb 15, 2024
Figure 1 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 2 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 3 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Figure 4 for BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
Viaarxiv icon

Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations

Add code
Feb 05, 2024
Viaarxiv icon