Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions



Chung-Cheng Chiu , Arun Narayanan , Wei Han , Rohit Prabhavalkar , Yu Zhang , Navdeep Jaitly , Ruoming Pang , Tara N. Sainath , Patrick Nguyen , Liangliang Cao , Yonghui Wu

* Submitted to Interspeech 2020 

   Access Paper or Ask Questions

A comparison of end-to-end models for long-form speech recognition



Chung-Cheng Chiu , Wei Han , Yu Zhang , Ruoming Pang , Sergey Kishchenko , Patrick Nguyen , Arun Narayanan , Hank Liao , Shuyuan Zhang , Anjuli Kannan , Rohit Prabhavalkar , Zhifeng Chen , Tara Sainath , Yonghui Wu

* ASRU camera-ready version 

   Access Paper or Ask Questions

StarNet: Targeted Computation for Object Detection in Point Clouds



Jiquan Ngiam , Benjamin Caine , Wei Han , Brandon Yang , Yuning Chai , Pei Sun , Yin Zhou , Xi Yi , Ouais Alsharif , Patrick Nguyen , Zhifeng Chen , Jonathon Shlens , Vijay Vasudevan


   Access Paper or Ask Questions

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling



Jonathan Shen , Patrick Nguyen , Yonghui Wu , Zhifeng Chen , Mia X. Chen , Ye Jia , Anjuli Kannan , Tara Sainath , Yuan Cao , Chung-Cheng Chiu , Yanzhang He , Jan Chorowski , Smit Hinsu , Stella Laurenzo , James Qin , Orhan Firat , Wolfgang Macherey , Suyog Gupta , Ankur Bapna , Shuyuan Zhang , Ruoming Pang , Ron J. Weiss , Rohit Prabhavalkar , Qiao Liang , Benoit Jacob , Bowen Liang , HyoukJoong Lee , Ciprian Chelba , Sébastien Jean , Bo Li , Melvin Johnson , Rohan Anil , Rajat Tibrewal , Xiaobing Liu , Akiko Eriguchi , Navdeep Jaitly , Naveen Ari , Colin Cherry , Parisa Haghani , Otavio Good , Youlong Cheng , Raziel Alvarez , Isaac Caswell , Wei-Ning Hsu , Zongheng Yang , Kuan-Chieh Wang , Ekaterina Gonina , Katrin Tomanek , Ben Vanik , Zelin Wu , Llion Jones , Mike Schuster , Yanping Huang , Dehao Chen , Kazuki Irie , George Foster , John Richardson , Klaus Macherey , Antoine Bruguier , Heiga Zen , Colin Raffel , Shankar Kumar , Kanishka Rao , David Rybach , Matthew Murray , Vijayaditya Peddinti , Maxim Krikun , Michiel A. U. Bacchiani , Thomas B. Jablin , Rob Suderman , Ian Williams , Benjamin Lee , Deepti Bhatia , Justin Carlson , Semih Yavuz , Yu Zhang , Ian McGraw , Max Galkin , Qi Ge , Golan Pundak , Chad Whipkey , Todd Wang , Uri Alon , Dmitry Lepikhin , Ye Tian , Sara Sabour , William Chan , Shubham Toshniwal , Baohua Liao , Michael Nirschl , Pat Rondon


   Access Paper or Ask Questions

Model Unit Exploration for Sequence-to-Sequence Speech Recognition



Kazuki Irie , Rohit Prabhavalkar , Anjuli Kannan , Antoine Bruguier , David Rybach , Patrick Nguyen

* 5 pages, 1 figure 

   Access Paper or Ask Questions

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis



Ye Jia , Yu Zhang , Ron J. Weiss , Quan Wang , Jonathan Shen , Fei Ren , Zhifeng Chen , Patrick Nguyen , Ruoming Pang , Ignacio Lopez Moreno , Yonghui Wu

* NIPS 2018 

   Access Paper or Ask Questions

Hierarchical Generative Modeling for Controllable Speech Synthesis



Wei-Ning Hsu , Yu Zhang , Ron J. Weiss , Heiga Zen , Yonghui Wu , Yuxuan Wang , Yuan Cao , Ye Jia , Zhifeng Chen , Jonathan Shen , Patrick Nguyen , Ruoming Pang


   Access Paper or Ask Questions

Speech recognition for medical conversations



Chung-Cheng Chiu , Anshuman Tripathi , Katherine Chou , Chris Co , Navdeep Jaitly , Diana Jaunzeikare , Anjuli Kannan , Patrick Nguyen , Hasim Sak , Ananth Sankar , Justin Tansuwan , Nathan Wan , Yonghui Wu , Xuedong Zhang

* Interspeech 2018 camera ready 

   Access Paper or Ask Questions

State-of-the-art Speech Recognition With Sequence-to-Sequence Models



Chung-Cheng Chiu , Tara N. Sainath , Yonghui Wu , Rohit Prabhavalkar , Patrick Nguyen , Zhifeng Chen , Anjuli Kannan , Ron J. Weiss , Kanishka Rao , Ekaterina Gonina , Navdeep Jaitly , Bo Li , Jan Chorowski , Michiel Bacchiani

* ICASSP camera-ready version 

   Access Paper or Ask Questions

1
2
>>