Alert button
Picture for Thomas Drugman

Thomas Drugman

Alert button

Distribution augmentation for low-resource expressive text-to-speech

Add code
Bookmark button
Alert button
Feb 13, 2022
Mateusz Lajszczak, Animesh Prasad, Arent van Korlaar, Bajibabu Bollepalli, Antonio Bonafonte, Arnaud Joly, Marco Nicolis, Alexis Moinet, Thomas Drugman, Trevor Wood, Elena Sokolova

Figure 1 for Distribution augmentation for low-resource expressive text-to-speech
Figure 2 for Distribution augmentation for low-resource expressive text-to-speech
Figure 3 for Distribution augmentation for low-resource expressive text-to-speech
Figure 4 for Distribution augmentation for low-resource expressive text-to-speech
Viaarxiv icon

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

Add code
Bookmark button
Alert button
Jun 29, 2021
Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens, Sri Karlapati, Thomas Drugman

Figure 1 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 2 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 3 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Figure 4 for Multi-Scale Spectrogram Modelling for Neural Text-to-Speech
Viaarxiv icon

Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments

Add code
Bookmark button
Alert button
Jun 16, 2021
Alejandro Mottini, Jaime Lorenzo-Trueba, Sri Vishnu Kumar Karlapati, Thomas Drugman

Figure 1 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 2 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 3 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 4 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Viaarxiv icon

A learned conditional prior for the VAE acoustic space of a TTS system

Add code
Bookmark button
Alert button
Jun 14, 2021
Penny Karanasou, Sri Karlapati, Alexis Moinet, Arnaud Joly, Ammar Abbas, Simon Slangen, Jaime Lorenzo Trueba, Thomas Drugman

Figure 1 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 2 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 3 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 4 for A learned conditional prior for the VAE acoustic space of a TTS system
Viaarxiv icon

Weakly-supervised word-level pronunciation error detection in non-native English speech

Add code
Bookmark button
Alert button
Jun 07, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Shira Calamaro, Bozena Kostek

Figure 1 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 2 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 3 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 4 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Viaarxiv icon

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

Add code
Bookmark button
Alert button
Feb 08, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek

Figure 1 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 2 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 3 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 4 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Viaarxiv icon

EmoCat: Language-agnostic Emotional Voice Conversion

Add code
Bookmark button
Alert button
Jan 14, 2021
Bastian Schnell, Goeric Huybrechts, Bartek Perz, Thomas Drugman, Jaime Lorenzo-Trueba

Figure 1 for EmoCat: Language-agnostic Emotional Voice Conversion
Figure 2 for EmoCat: Language-agnostic Emotional Voice Conversion
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention

Add code
Bookmark button
Alert button
Dec 29, 2020
Daniel Korzekwa, Roberto Barra-Chicote, Szymon Zaporowski, Grzegorz Beringer, Jaime Lorenzo-Trueba, Alicja Serafinowicz, Jasha Droppo, Thomas Drugman, Bozena Kostek

Figure 1 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention
Viaarxiv icon

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

Add code
Bookmark button
Alert button
Nov 04, 2020
Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman

Figure 1 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 2 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 3 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Viaarxiv icon