Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Bhuvana Ramabhadran

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition


Oct 01, 2021
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

  Access Paper or Ask Questions

Injecting Text in Self-Supervised Speech Pretraining


Aug 27, 2021
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro Moreno

* submit to ASRU 2021 

  Access Paper or Ask Questions

LSTM Acoustic Models Learn to Align and Pronounce with Graphemes


Aug 13, 2020
Arindrima Datta, Guanlong Zhao, Bhuvana Ramabhadran, Eugene Weinstein

* 5 pages, 4 figures. This work was done between summer 2018 and spring 2019 

  Access Paper or Ask Questions

Language-agnostic Multilingual Modeling


Apr 20, 2020
Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark


  Access Paper or Ask Questions

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior


Feb 06, 2020
Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu

* To appear in ICASSP 2020 

  Access Paper or Ask Questions

Speech Recognition with Augmented Synthesized Speech


Sep 25, 2019
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro Moreno, Yonghui Wu, Zelin Wu

* Accepted for publication at ASRU 2020 

  Access Paper or Ask Questions

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model


Sep 11, 2019
Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee

* Accepted in Interspeech 2019 

  Access Paper or Ask Questions

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning


Jul 24, 2019
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, RJ Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran

* 5 pages, submitted to Interspeech 2019 

  Access Paper or Ask Questions

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition


Feb 07, 2018
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson

* Accepted in The 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018) 

  Access Paper or Ask Questions

Building competitive direct acoustics-to-word models for English conversational speech recognition


Dec 08, 2017
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny

* Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 

  Access Paper or Ask Questions

Language Modeling with Highway LSTM


Sep 19, 2017
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy

* to appear in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017) 

  Access Paper or Ask Questions

Direct Acoustics-to-Word Models for English Conversational Speech Recognition


Mar 22, 2017
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo

* Submitted to Interspeech-2017 

  Access Paper or Ask Questions

English Conversational Telephone Speech Recognition by Humans and Machines


Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall


  Access Paper or Ask Questions

End-to-End ASR-free Keyword Search from Speech


Jan 13, 2017
Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury

* Published in the IEEE 2017 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), scheduled for 5-9 March 2017 in New Orleans, Louisiana, USA 

  Access Paper or Ask Questions

Invariant Representations for Noisy Speech Recognition


Nov 27, 2016
Dmitriy Serdyuk, Kartik Audhkhasi, Philémon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio

* 5 pages, 1 figure, 1 table, NIPS workshop on end-to-end speech recognition 

  Access Paper or Ask Questions

Training variance and performance evaluation of neural networks in speech


Jun 14, 2016
Ewout van den Berg, Bhuvana Ramabhadran, Michael Picheny


  Access Paper or Ask Questions

Diverse Embedding Neural Network Language Models


Apr 15, 2015
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran

* Under review as workshop contribution at ICLR 2015 

  Access Paper or Ask Questions

Generalized Ambiguity Decomposition for Understanding Ensemble Diversity


Dec 28, 2013
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan

* 32 pages, 10 figures 

  Access Paper or Ask Questions

Accelerating Hessian-free optimization for deep neural networks by implicit preconditioning and sampling


Dec 10, 2013
Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran

* this paper is not supposed to be posted publically before the conference in December due to company policy. another co-author was not informed of this and posted without the permission of the first author. pls remove 

  Access Paper or Ask Questions

Improvements to deep convolutional neural networks for LVCSR


Dec 10, 2013
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomas Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran

* 6 pages, 1 figure 

  Access Paper or Ask Questions