Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Tara N. Sainath

Tara N. Sainath

Google Inc. USA

Joint Unsupervised and Supervised Training for Multilingual ASR


Nov 15, 2021
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath


  Access Paper or Ask Questions

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition


Oct 01, 2021
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

  Access Paper or Ask Questions

Tied & Reduced RNN-T Decoder


Sep 15, 2021
Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He

* Proc. Interspeech 2021, 4563-4567 

  Access Paper or Ask Questions

Scaling End-to-End Models for Large-Scale Multilingual ASR


Apr 30, 2021
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition


Apr 09, 2021
W. Ronny Huang, Tara N. Sainath, Cal Peyser, Shankar Kumar, David Rybach, Trevor Strohman

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Learning Word-Level Confidence For Subword End-to-End ASR


Mar 11, 2021
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw

* To appear in ICASSP 2021 

  Access Paper or Ask Questions

Transformer Based Deliberation for Two-Pass Speech Recognition


Jan 27, 2021
Ke Hu, Ruoming Pang, Tara N. Sainath, Trevor Strohman


  Access Paper or Ask Questions

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging


Dec 12, 2020
Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath


  Access Paper or Ask Questions

Cascaded encoders for unifying streaming and non-streaming ASR


Oct 27, 2020
Arun Narayanan, Tara N. Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman


  Access Paper or Ask Questions

Multitask Training with Text Data for End-to-End Speech Recognition


Oct 27, 2020
Peidong Wang, Tara N. Sainath, Ron J. Weiss


  Access Paper or Ask Questions

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization


Oct 21, 2020
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang

* tech report 

  Access Paper or Ask Questions

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling


Oct 12, 2020
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang

* tech report 

  Access Paper or Ask Questions

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus


Aug 25, 2020
Cal Peyser, Sepand Mavandadi, Tara N. Sainath, James Apfel, Ruoming Pang, Shankar Kumar


  Access Paper or Ask Questions

Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion


May 19, 2020
Cal Peyser, Tara N. Sainath, Golan Pundak


  Access Paper or Ask Questions

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions


May 17, 2020
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency


Mar 28, 2020
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao

* In Proceedings of IEEE ICASSP 2020 

  Access Paper or Ask Questions

Deliberation Model Based Two-Pass End-to-End Speech Recognition


Mar 17, 2020
Ke Hu, Tara N. Sainath, Ruoming Pang, Rohit Prabhavalkar


  Access Paper or Ask Questions

Recognizing long-form speech using streaming end-to-end models


Oct 24, 2019
Arun Narayanan, Rohit Prabhavalkar, Chung-Cheng Chiu, David Rybach, Tara N. Sainath, Trevor Strohman


  Access Paper or Ask Questions

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model


Sep 11, 2019
Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee

* Accepted in Interspeech 2019 

  Access Paper or Ask Questions

Two-Pass End-to-End Speech Recognition


Aug 29, 2019
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu


  Access Paper or Ask Questions

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models


Jul 22, 2019
Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak


  Access Paper or Ask Questions

Improving Performance of End-to-End ASR on Numeric Sequences


Jul 01, 2019
Cal Peyser, Hao Zhang, Tara N. Sainath, Zelin Wu


  Access Paper or Ask Questions

A spelling correction model for end-to-end speech recognition


Feb 19, 2019
Jinxi Guo, Tara N. Sainath, Ron J. Weiss

* Accepted to ICASSP 2019 

  Access Paper or Ask Questions

Streaming End-to-end Speech Recognition For Mobile Devices


Nov 15, 2018
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-yiin Chang, Kanishka Rao, Alexander Gruenstein


  Access Paper or Ask Questions

Contextual Speech Recognition with Difficult Negative Training Examples


Oct 29, 2018
Uri Alon, Golan Pundak, Tara N. Sainath


  Access Paper or Ask Questions

Deep context: end-to-end contextual speech recognition


Aug 07, 2018
Golan Pundak, Tara N. Sainath, Rohit Prabhavalkar, Anjuli Kannan, Ding Zhao


  Access Paper or Ask Questions

State-of-the-art Speech Recognition With Sequence-to-Sequence Models


Feb 23, 2018
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani

* ICASSP camera-ready version 

  Access Paper or Ask Questions

Multilingual Speech Recognition With A Single End-To-End Model


Feb 15, 2018
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro Moreno, Eugene Weinstein, Kanishka Rao

* Accepted in ICASSP 2018 

  Access Paper or Ask Questions