Picture for Thomas Drugman

Thomas Drugman

Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments

Add code
Jun 16, 2021
Figure 1 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 2 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 3 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Figure 4 for Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
Viaarxiv icon

A learned conditional prior for the VAE acoustic space of a TTS system

Add code
Jun 14, 2021
Figure 1 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 2 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 3 for A learned conditional prior for the VAE acoustic space of a TTS system
Figure 4 for A learned conditional prior for the VAE acoustic space of a TTS system
Viaarxiv icon

Weakly-supervised word-level pronunciation error detection in non-native English speech

Add code
Jun 07, 2021
Figure 1 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 2 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 3 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 4 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Viaarxiv icon

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

Add code
Feb 08, 2021
Figure 1 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 2 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 3 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 4 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Viaarxiv icon

EmoCat: Language-agnostic Emotional Voice Conversion

Add code
Jan 14, 2021
Figure 1 for EmoCat: Language-agnostic Emotional Voice Conversion
Figure 2 for EmoCat: Language-agnostic Emotional Voice Conversion
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native English with Data Augmentation and Attention

Add code
Dec 29, 2020
Figure 1 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Viaarxiv icon

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

Add code
Nov 04, 2020
Figure 1 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 2 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Figure 3 for Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
Viaarxiv icon

Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation

Add code
Jun 07, 2020
Figure 1 for Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation
Figure 2 for Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation
Figure 3 for Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation
Viaarxiv icon

Maximum Phase Modeling for Sparse Linear Prediction of Speech

Add code
Jun 07, 2020
Figure 1 for Maximum Phase Modeling for Sparse Linear Prediction of Speech
Figure 2 for Maximum Phase Modeling for Sparse Linear Prediction of Speech
Figure 3 for Maximum Phase Modeling for Sparse Linear Prediction of Speech
Figure 4 for Maximum Phase Modeling for Sparse Linear Prediction of Speech
Viaarxiv icon

Analysis and Synthesis of Hypo and Hyperarticulated Speech

Add code
Jun 07, 2020
Figure 1 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 2 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 3 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Figure 4 for Analysis and Synthesis of Hypo and Hyperarticulated Speech
Viaarxiv icon