Alert button
Picture for Rob Clark

Rob Clark

Alert button

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks

Add code
Bookmark button
Alert button
Aug 28, 2022
Lev Finkelstein, Heiga Zen, Norman Casagrande, Chun-an Chan, Ye Jia, Tom Kenter, Alexey Petelin, Jonathan Shen, Vincent Wan, Yu Zhang, Yonghui Wu, Rob Clark

Figure 1 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 2 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 3 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Figure 4 for Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks
Viaarxiv icon

Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs

Add code
Bookmark button
Alert button
Sep 09, 2019
Rob Clark, Hanna Silen, Tom Kenter, Ralph Leith

Figure 1 for Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs
Figure 2 for Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs
Figure 3 for Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs
Figure 4 for Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs
Viaarxiv icon

CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network

Add code
Bookmark button
Alert button
Jun 04, 2019
Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark

Figure 1 for CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
Figure 2 for CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
Figure 3 for CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
Figure 4 for CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
Viaarxiv icon

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

Add code
Bookmark button
Alert button
Mar 24, 2018
RJ Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous

Figure 1 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 2 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 3 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Figure 4 for Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
Viaarxiv icon

Uncovering Latent Style Factors for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Nov 01, 2017
Yuxuan Wang, RJ Skerry-Ryan, Ying Xiao, Daisy Stanton, Joel Shor, Eric Battenberg, Rob Clark, Rif A. Saurous

Figure 1 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Figure 2 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Figure 3 for Uncovering Latent Style Factors for Expressive Speech Synthesis
Viaarxiv icon

Tacotron: Towards End-to-End Speech Synthesis

Add code
Bookmark button
Alert button
Apr 06, 2017
Yuxuan Wang, RJ Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous

Figure 1 for Tacotron: Towards End-to-End Speech Synthesis
Figure 2 for Tacotron: Towards End-to-End Speech Synthesis
Figure 3 for Tacotron: Towards End-to-End Speech Synthesis
Figure 4 for Tacotron: Towards End-to-End Speech Synthesis
Viaarxiv icon