Picture for Ron J. Weiss

Ron J. Weiss

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Add code
Jul 24, 2019
Figure 1 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 2 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 3 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 4 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Viaarxiv icon

Direct speech-to-speech translation with a sequence-to-sequence model

Add code
Apr 12, 2019
Figure 1 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 2 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 3 for Direct speech-to-speech translation with a sequence-to-sequence model
Figure 4 for Direct speech-to-speech translation with a sequence-to-sequence model
Viaarxiv icon

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Add code
Feb 21, 2019
Figure 1 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 2 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Figure 3 for Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Viaarxiv icon

A spelling correction model for end-to-end speech recognition

Add code
Feb 19, 2019
Figure 1 for A spelling correction model for end-to-end speech recognition
Figure 2 for A spelling correction model for end-to-end speech recognition
Figure 3 for A spelling correction model for end-to-end speech recognition
Figure 4 for A spelling correction model for end-to-end speech recognition
Viaarxiv icon

Unsupervised speech representation learning using WaveNet autoencoders

Add code
Jan 25, 2019
Figure 1 for Unsupervised speech representation learning using WaveNet autoencoders
Figure 2 for Unsupervised speech representation learning using WaveNet autoencoders
Figure 3 for Unsupervised speech representation learning using WaveNet autoencoders
Figure 4 for Unsupervised speech representation learning using WaveNet autoencoders
Viaarxiv icon

Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation

Add code
Nov 05, 2018
Figure 1 for Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Figure 2 for Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Figure 3 for Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Figure 4 for Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Viaarxiv icon

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

Add code
Nov 05, 2018
Figure 1 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 2 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 3 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Figure 4 for Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Viaarxiv icon

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

Add code
Oct 27, 2018
Figure 1 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 2 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 3 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Figure 4 for VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Viaarxiv icon

Hierarchical Generative Modeling for Controllable Speech Synthesis

Add code
Oct 16, 2018
Figure 1 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 2 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 3 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Figure 4 for Hierarchical Generative Modeling for Controllable Speech Synthesis
Viaarxiv icon

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

Add code
Mar 24, 2018
Figure 1 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 2 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 3 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 4 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Viaarxiv icon