Picture for RJ Skerry-Ryan

RJ Skerry-Ryan

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

Add code
Oct 03, 2019
Figure 1 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 2 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 3 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 4 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Viaarxiv icon

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Add code
Jul 24, 2019
Figure 1 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 2 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 3 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 4 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Viaarxiv icon

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Add code
Jul 09, 2019
Figure 1 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 2 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 3 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 4 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Viaarxiv icon

Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis

Add code
Aug 30, 2018
Figure 1 for Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Figure 2 for Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Figure 3 for Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Figure 4 for Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Viaarxiv icon

Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis

Add code
Aug 04, 2018
Figure 1 for Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Figure 2 for Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Figure 3 for Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Figure 4 for Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
Viaarxiv icon

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

Add code
Mar 24, 2018
Figure 1 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 2 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 3 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 4 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Viaarxiv icon

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Add code
Mar 23, 2018
Figure 1 for Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Figure 2 for Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Figure 3 for Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Figure 4 for Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Viaarxiv icon

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Add code
Feb 16, 2018
Figure 1 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 2 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 3 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 4 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Viaarxiv icon

Uncovering Latent Style Factors for Expressive Speech Synthesis

Add code
Nov 01, 2017
Figure 1 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Figure 2 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Figure 3 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Viaarxiv icon

Tacotron: Towards End-to-End Speech Synthesis

Add code
Apr 06, 2017
Figure 1 for Tacotron: Towards End-to-End Speech Synthesis
Figure 2 for Tacotron: Towards End-to-End Speech Synthesis
Figure 3 for Tacotron: Towards End-to-End Speech Synthesis
Figure 4 for Tacotron: Towards End-to-End Speech Synthesis
Viaarxiv icon