Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Improving Deliberation by Text-Only and Semi-Supervised Training



Ke Hu , Tara N. Sainath , Yanzhang He , Rohit Prabhavalkar , Trevor Strohman , Sepand Mavandadi , Weiran Wang

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR



W. Ronny Huang , Shuo-yiin Chang , David Rybach , Rohit Prabhavalkar , Tara N. Sainath , Cyril Allauzen , Cal Peyser , Zhiyun Lu


   Access Paper or Ask Questions

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes



Shaojin Ding , Weiran Wang , Ding Zhao , Tara N. Sainath , Yanzhang He , Robert David , Rami Botros , Xin Wang , Rina Panigrahy , Qiao Liang , Dongseong Hwang , Ian McGraw , Rohit Prabhavalkar , Trevor Strohman

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Improving Rare Word Recognition with LM-aware MWER Training



Weiran Wang , Tongzhou Chen , Tara N. Sainath , Ehsan Variani , Rohit Prabhavalkar , Ronny Huang , Bhuvana Ramabhadran , Neeraj Gaur , Sepand Mavandadi , Cal Peyser , Trevor Strohman , Yanzhang He , David Rybach

* In submission to INTERSPEECH 2022 

   Access Paper or Ask Questions

Neural-FST Class Language Model for End-to-End Speech Recognition



Antoine Bruguier , Duc Le , Rohit Prabhavalkar , Dangna Li , Zhe Liu , Bo Wang , Eun Chang , Fuchun Peng , Ozlem Kalinli , Michael L. Seltzer

* Accepted for publication at ICASSP 2022 

   Access Paper or Ask Questions

Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition



Zhiyun Lu , Yanwei Pan , Thibault Doutre , Liangliang Cao , Rohit Prabhavalkar , Chao Zhang , Trevor Strohman


   Access Paper or Ask Questions

A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data



Nathan Howard , Alex Park , Turaj Zakizadeh Shabestary , Alexander Gruenstein , Rohit Prabhavalkar

* To appear in ICASSP 2021 

   Access Paper or Ask Questions

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition



Yuan Shangguan , Rohit Prabhavalkar , Hang Su , Jay Mahadeokar , Yangyang Shi , Jiatong Zhou , Chunyang Wu , Duc Le , Ozlem Kalinli , Christian Fuegen , Michael L. Seltzer

* Submitted to Interspeech 2021 

   Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency



Yangyang Shi , Varun Nagaraja , Chunyang Wu , Jay Mahadeokar , Duc Le , Rohit Prabhavalkar , Alex Xiao , Ching-Feng Yeh , Julian Chan , Christian Fuegen , Ozlem Kalinli , Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

   Access Paper or Ask Questions

1
2
3
4
>>