Picture for Daniel Korzekwa

Daniel Korzekwa

Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need

Add code
Jul 02, 2022
Figure 1 for Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Figure 2 for Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Figure 3 for Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Figure 4 for Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need
Viaarxiv icon

Text-free non-parallel many-to-many voice conversion using normalising flows

Add code
Mar 15, 2022
Figure 1 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 2 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 3 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 4 for Text-free non-parallel many-to-many voice conversion using normalising flows
Viaarxiv icon

Enhancing audio quality for expressive Neural Text-to-Speech

Add code
Aug 13, 2021
Figure 1 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 2 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 3 for Enhancing audio quality for expressive Neural Text-to-Speech
Figure 4 for Enhancing audio quality for expressive Neural Text-to-Speech
Viaarxiv icon

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Add code
Jun 25, 2021
Figure 1 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 2 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 3 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Figure 4 for Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech
Viaarxiv icon

Improving the expressiveness of neural vocoding with non-affine Normalizing Flows

Add code
Jun 16, 2021
Figure 1 for Improving the expressiveness of neural vocoding with non-affine Normalizing Flows
Figure 2 for Improving the expressiveness of neural vocoding with non-affine Normalizing Flows
Figure 3 for Improving the expressiveness of neural vocoding with non-affine Normalizing Flows
Viaarxiv icon

Weakly-supervised word-level pronunciation error detection in non-native English speech

Add code
Jun 07, 2021
Figure 1 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 2 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 3 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 4 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Viaarxiv icon

Universal Neural Vocoding with Parallel WaveNet

Add code
Feb 15, 2021
Figure 1 for Universal Neural Vocoding with Parallel WaveNet
Figure 2 for Universal Neural Vocoding with Parallel WaveNet
Figure 3 for Universal Neural Vocoding with Parallel WaveNet
Figure 4 for Universal Neural Vocoding with Parallel WaveNet
Viaarxiv icon

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

Add code
Feb 08, 2021
Figure 1 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 2 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 3 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Figure 4 for Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling
Viaarxiv icon

Detection of Lexical Stress Errors in Non-native English with Data Augmentation and Attention

Add code
Dec 29, 2020
Figure 1 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 2 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 3 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Figure 4 for Detection of Lexical Stress Errors in Non-native  English with Data Augmentation and Attention
Viaarxiv icon

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

Add code
Jul 10, 2019
Figure 1 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 2 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 3 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Figure 4 for Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech
Viaarxiv icon