Alert button
Picture for Ron J. Weiss

Ron J. Weiss

Alert button

On Using Backpropagation for Speech Texture Generation and Voice Conversion

Add code
Bookmark button
Alert button
Mar 08, 2018
Jan Chorowski, Ron J. Weiss, Rif A. Saurous, Samy Bengio

Figure 1 for On Using Backpropagation for Speech Texture Generation and Voice Conversion
Figure 2 for On Using Backpropagation for Speech Texture Generation and Voice Conversion
Figure 3 for On Using Backpropagation for Speech Texture Generation and Voice Conversion
Figure 4 for On Using Backpropagation for Speech Texture Generation and Voice Conversion
Viaarxiv icon

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Add code
Bookmark button
Alert button
Feb 23, 2018
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani

Figure 1 for State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Figure 2 for State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Figure 3 for State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Figure 4 for State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Viaarxiv icon

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Add code
Bookmark button
Alert button
Feb 16, 2018
Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu

Figure 1 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 2 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 3 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Figure 4 for Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Viaarxiv icon

Multilingual Speech Recognition With A Single End-To-End Model

Add code
Bookmark button
Alert button
Feb 15, 2018
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro Moreno, Eugene Weinstein, Kanishka Rao

Figure 1 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 2 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 3 for Multilingual Speech Recognition With A Single End-To-End Model
Figure 4 for Multilingual Speech Recognition With A Single End-To-End Model
Viaarxiv icon

Online and Linear-Time Attention by Enforcing Monotonic Alignments

Add code
Bookmark button
Alert button
Jun 29, 2017
Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck

Figure 1 for Online and Linear-Time Attention by Enforcing Monotonic Alignments
Figure 2 for Online and Linear-Time Attention by Enforcing Monotonic Alignments
Figure 3 for Online and Linear-Time Attention by Enforcing Monotonic Alignments
Figure 4 for Online and Linear-Time Attention by Enforcing Monotonic Alignments
Viaarxiv icon

Sequence-to-Sequence Models Can Directly Translate Foreign Speech

Add code
Bookmark button
Alert button
Jun 12, 2017
Ron J. Weiss, Jan Chorowski, Navdeep Jaitly, Yonghui Wu, Zhifeng Chen

Figure 1 for Sequence-to-Sequence Models Can Directly Translate Foreign Speech
Figure 2 for Sequence-to-Sequence Models Can Directly Translate Foreign Speech
Figure 3 for Sequence-to-Sequence Models Can Directly Translate Foreign Speech
Figure 4 for Sequence-to-Sequence Models Can Directly Translate Foreign Speech
Viaarxiv icon

Tacotron: Towards End-to-End Speech Synthesis

Add code
Bookmark button
Alert button
Apr 06, 2017
Yuxuan Wang, RJ Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous

Figure 1 for Tacotron: Towards End-to-End Speech Synthesis
Figure 2 for Tacotron: Towards End-to-End Speech Synthesis
Figure 3 for Tacotron: Towards End-to-End Speech Synthesis
Figure 4 for Tacotron: Towards End-to-End Speech Synthesis
Viaarxiv icon

CNN Architectures for Large-Scale Audio Classification

Add code
Bookmark button
Alert button
Jan 10, 2017
Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

Figure 1 for CNN Architectures for Large-Scale Audio Classification
Figure 2 for CNN Architectures for Large-Scale Audio Classification
Figure 3 for CNN Architectures for Large-Scale Audio Classification
Figure 4 for CNN Architectures for Large-Scale Audio Classification
Viaarxiv icon