Alert button
Picture for Mateusz Łajszczak

Mateusz Łajszczak

Alert button

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Add code
Bookmark button
Alert button
Feb 15, 2024
Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Viaarxiv icon

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

Add code
Bookmark button
Alert button
Jun 29, 2022
Peter Makarov, Ammar Abbas, Mateusz Łajszczak, Arnaud Joly, Sri Karlapati, Alexis Moinet, Thomas Drugman, Penny Karanasou

Figure 1 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 2 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 3 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Figure 4 for Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
Viaarxiv icon

Discrete acoustic space for an efficient sampling in neural text-to-speech

Add code
Bookmark button
Alert button
Oct 24, 2021
Marek Strelec, Jonas Rohnke, Antonio Bonafonte, Mateusz Łajszczak, Trevor Wood

Figure 1 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 2 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 3 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Figure 4 for Discrete acoustic space for an efficient sampling in neural text-to-speech
Viaarxiv icon

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Add code
Bookmark button
Alert button
Apr 04, 2019
Nishant Prateek, Mateusz Łajszczak, Roberto Barra-Chicote, Thomas Drugman, Jaime Lorenzo-Trueba, Thomas Merritt, Srikanth Ronanki, Trevor Wood

Figure 1 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 2 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 3 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Figure 4 for In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data
Viaarxiv icon