Alert button
Picture for Jaime Lorenzo-Trueba

Jaime Lorenzo-Trueba

Alert button

Weakly-supervised word-level pronunciation error detection in non-native English speech

Add code
Bookmark button
Alert button
Jun 07, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Shira Calamaro, Bozena Kostek

Figure 1 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 2 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 3 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 4 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Viaarxiv icon

Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Add code
Bookmark button
Alert button
Apr 15, 2021
Shubhi Tyagi, Antonio Bonafonte, Jaime Lorenzo-Trueba, Javier Latorre

Figure 1 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 2 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 3 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Figure 4 for Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems
Viaarxiv icon

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

Add code
Bookmark button
Alert button
Feb 08, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek

Figure 1 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 2 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 3 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 4 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Viaarxiv icon

EmoCat: Language-agnostic Emotional Voice Conversion

Add code
Bookmark button
Alert button
Jan 14, 2021
Bastian Schnell, Goeric Huybrechts, Bartek Perz, Thomas Drugman, Jaime Lorenzo-Trueba

Figure 1 for EmoCat: Language-agnostic Emotional Voice Conversion
Figure 2 for EmoCat: Language-agnostic Emotional Voice Conversion
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention

Add code
Bookmark button
Alert button
Dec 29, 2020
Daniel Korzekwa, Roberto Barra-Chicote, Szymon Zaporowski, Grzegorz Beringer, Jaime Lorenzo-Trueba, Alicja Serafinowicz, Jasha Droppo, Thomas Drugman, Bozena Kostek

Figure 1 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Viaarxiv icon

Voice Conversion for Whispered Speech Synthesis

Add code
Bookmark button
Alert button
Jan 17, 2020
Marius Cotescu, Thomas Drugman, Goeric Huybrechts, Jaime Lorenzo-Trueba, Alexis Moinet

Figure 1 for Voice Conversion for Whispered Speech Synthesis
Figure 2 for Voice Conversion for Whispered Speech Synthesis
Figure 3 for Voice Conversion for Whispered Speech Synthesis
Figure 4 for Voice Conversion for Whispered Speech Synthesis
Viaarxiv icon

Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection

Add code
Bookmark button
Alert button
Dec 02, 2019
Shubhi Tyagi, Marco Nicolis, Jonas Rohnke, Thomas Drugman, Jaime Lorenzo-Trueba

Figure 1 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 2 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 3 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 4 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Viaarxiv icon

Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech

Add code
Bookmark button
Alert button
Nov 28, 2019
Vatsal Aggarwal, Marius Cotescu, Nishant Prateek, Jaime Lorenzo-Trueba, Roberto Barra-Chicote

Figure 1 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 2 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 3 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Figure 4 for Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Viaarxiv icon