Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

Language-agnostic Multilingual Modeling

Apr 20, 2020
Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Anjuli Kannan, Brian Roark


  Access Model/Code and Paper
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior

Feb 06, 2020
Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu

* To appear in ICASSP 2020 

  Access Model/Code and Paper
Speech Recognition with Augmented Synthesized Speech

Sep 25, 2019
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro Moreno, Yonghui Wu, Zelin Wu

* Accepted for publication at ASRU 2020 

  Access Model/Code and Paper
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model

Sep 11, 2019
Anjuli Kannan, Arindrima Datta, Tara N. Sainath, Eugene Weinstein, Bhuvana Ramabhadran, Yonghui Wu, Ankur Bapna, Zhifeng Chen, Seungji Lee

* Accepted in Interspeech 2019 

  Access Model/Code and Paper
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning

Jul 24, 2019
Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, RJ Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran

* 5 pages, submitted to Interspeech 2019 

  Access Model/Code and Paper
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

Feb 07, 2018
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson

* Accepted in The 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018) 

  Access Model/Code and Paper
Building competitive direct acoustics-to-word models for English conversational speech recognition

Dec 08, 2017
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny

* Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 

  Access Model/Code and Paper
Language Modeling with Highway LSTM

Sep 19, 2017
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy

* to appear in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017) 

  Access Model/Code and Paper
Direct Acoustics-to-Word Models for English Conversational Speech Recognition

Mar 22, 2017
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo

* Submitted to Interspeech-2017 

  Access Model/Code and Paper
English Conversational Telephone Speech Recognition by Humans and Machines

Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall


  Access Model/Code and Paper
End-to-End ASR-free Keyword Search from Speech

Jan 13, 2017
Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury

* Published in the IEEE 2017 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), scheduled for 5-9 March 2017 in New Orleans, Louisiana, USA 

  Access Model/Code and Paper
Invariant Representations for Noisy Speech Recognition

Nov 27, 2016
Dmitriy Serdyuk, Kartik Audhkhasi, Philémon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio

* 5 pages, 1 figure, 1 table, NIPS workshop on end-to-end speech recognition 

  Access Model/Code and Paper
Training variance and performance evaluation of neural networks in speech

Jun 14, 2016
Ewout van den Berg, Bhuvana Ramabhadran, Michael Picheny


  Access Model/Code and Paper
Diverse Embedding Neural Network Language Models

Apr 15, 2015
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran

* Under review as workshop contribution at ICLR 2015 

  Access Model/Code and Paper
Generalized Ambiguity Decomposition for Understanding Ensemble Diversity

Dec 28, 2013
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan

* 32 pages, 10 figures 

  Access Model/Code and Paper
Accelerating Hessian-free optimization for deep neural networks by implicit preconditioning and sampling

Dec 10, 2013
Tara N. Sainath, Lior Horesh, Brian Kingsbury, Aleksandr Y. Aravkin, Bhuvana Ramabhadran

* this paper is not supposed to be posted publically before the conference in December due to company policy. another co-author was not informed of this and posted without the permission of the first author. pls remove 

  Access Model/Code and Paper
Improvements to deep convolutional neural networks for LVCSR

Dec 10, 2013
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomas Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran

* 6 pages, 1 figure 

  Access Model/Code and Paper