Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data



Alëna Aksënova , Zhehuai Chen , Chung-Cheng Chiu , Daan van Esch , Pavel Golik , Wei Han , Levi King , Bhuvana Ramabhadran , Andrew Rosenberg , Suzan Schwartz , Gary Wang

* 5 pages, 3 tables 

   Access Paper or Ask Questions

Improving Rare Word Recognition with LM-aware MWER Training



Weiran Wang , Tongzhou Chen , Tara N. Sainath , Ehsan Variani , Rohit Prabhavalkar , Ronny Huang , Bhuvana Ramabhadran , Neeraj Gaur , Sepand Mavandadi , Cal Peyser , Trevor Strohman , Yanzhang He , David Rybach

* In submission to INTERSPEECH 2022 

   Access Paper or Ask Questions

MAESTRO: Matched Speech Text Representations through Modality Matching



Zhehuai Chen , Yu Zhang , Andrew Rosenberg , Bhuvana Ramabhadran , Pedro Moreno , Ankur Bapna , Heiga Zen

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

Ask2Mask: Guided Data Selection for Masked Speech Modeling



Murali Karthick Baskar , Andrew Rosenberg , Bhuvana Ramabhadran , Yu Zhang , Pedro Moreno


   Access Paper or Ask Questions

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition



Yu Zhang , Daniel S. Park , Wei Han , James Qin , Anmol Gulati , Joel Shor , Aren Jansen , Yuanzhong Xu , Yanping Huang , Shibo Wang , Zongwei Zhou , Bo Li , Min Ma , William Chan , Jiahui Yu , Yongqiang Wang , Liangliang Cao , Khe Chai Sim , Bhuvana Ramabhadran , Tara N. Sainath , Françoise Beaufays , Zhifeng Chen , Quoc V. Le , Chung-Cheng Chiu , Ruoming Pang , Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

   Access Paper or Ask Questions

Injecting Text in Self-Supervised Speech Pretraining



Zhehuai Chen , Yu Zhang , Andrew Rosenberg , Bhuvana Ramabhadran , Gary Wang , Pedro Moreno

* submit to ASRU 2021 

   Access Paper or Ask Questions

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes



Arindrima Datta , Guanlong Zhao , Bhuvana Ramabhadran , Eugene Weinstein

* 5 pages, 4 figures. This work was done between summer 2018 and spring 2019 

   Access Paper or Ask Questions

Language-agnostic Multilingual Modeling



Arindrima Datta , Bhuvana Ramabhadran , Jesse Emond , Anjuli Kannan , Brian Roark


   Access Paper or Ask Questions

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior



Guangzhi Sun , Yu Zhang , Ron J. Weiss , Yuan Cao , Heiga Zen , Andrew Rosenberg , Bhuvana Ramabhadran , Yonghui Wu

* To appear in ICASSP 2020 

   Access Paper or Ask Questions

Speech Recognition with Augmented Synthesized Speech



Andrew Rosenberg , Yu Zhang , Bhuvana Ramabhadran , Ye Jia , Pedro Moreno , Yonghui Wu , Zelin Wu

* Accepted for publication at ASRU 2020 

   Access Paper or Ask Questions

1
2
3
>>