Picture for Ron J. Weiss

Ron J. Weiss

G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR

Add code
Oct 19, 2022
Figure 1 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 2 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 3 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Figure 4 for G-Augment: Searching For The Meta-Structure Of Data Augmentation Policies For ASR
Viaarxiv icon

WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Add code
Jun 19, 2021
Figure 1 for WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Figure 2 for WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Figure 3 for WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Figure 4 for WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Viaarxiv icon

Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation

Add code
Jun 01, 2021
Figure 1 for Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation
Figure 2 for Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation
Viaarxiv icon

Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis

Add code
Nov 06, 2020
Figure 1 for Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Figure 2 for Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Figure 3 for Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Figure 4 for Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Viaarxiv icon

Multitask Training with Text Data for End-to-End Speech Recognition

Add code
Oct 27, 2020
Figure 1 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 2 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 3 for Multitask Training with Text Data for End-to-End Speech Recognition
Figure 4 for Multitask Training with Text Data for End-to-End Speech Recognition
Viaarxiv icon

WaveGrad: Estimating Gradients for Waveform Generation

Add code
Sep 02, 2020
Figure 1 for WaveGrad: Estimating Gradients for Waveform Generation
Figure 2 for WaveGrad: Estimating Gradients for Waveform Generation
Figure 3 for WaveGrad: Estimating Gradients for Waveform Generation
Figure 4 for WaveGrad: Estimating Gradients for Waveform Generation
Viaarxiv icon

Unsupervised Sound Separation Using Mixtures of Mixtures

Add code
Jun 23, 2020
Figure 1 for Unsupervised Sound Separation Using Mixtures of Mixtures
Figure 2 for Unsupervised Sound Separation Using Mixtures of Mixtures
Figure 3 for Unsupervised Sound Separation Using Mixtures of Mixtures
Figure 4 for Unsupervised Sound Separation Using Mixtures of Mixtures
Viaarxiv icon

Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis

Add code
Feb 06, 2020
Figure 1 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 2 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 3 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Figure 4 for Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Viaarxiv icon

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior

Add code
Feb 06, 2020
Figure 1 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 2 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 3 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Figure 4 for Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior
Viaarxiv icon

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Add code
Jul 24, 2019
Figure 1 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 2 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 3 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Figure 4 for Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning
Viaarxiv icon