Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for George Saon

4-bit Quantization of LSTM-based Speech Recognition Models


Aug 27, 2021
Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan

* 5 pages, 3 figures, Andrea Fasoli and Chia-Yu Chen equally contributed to this work. Paper accepted to Interspeech 2021 

  Access Paper or Ask Questions

Reducing Exposure Bias in Training Recurrent Neural Network Transducers


Aug 24, 2021
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltan Tuske

* accepted to Interspeech 2021 

  Access Paper or Ask Questions

Integrating Dialog History into End-to-End Spoken Language Understanding Systems


Aug 18, 2021
Jatin Ganhotra, Samuel Thomas, Hong-Kwang J. Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury

* Interspeech 2021 

  Access Paper or Ask Questions

On the limit of English conversational speech recognition


May 03, 2021
Zoltán Tüske, George Saon, Brian Kingsbury


  Access Paper or Ask Questions

RNN Transducer Models For Spoken Language Understanding


Apr 08, 2021
Samuel Thomas, Hong-Kwang J. Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory

* To appear in the proceedings of ICASSP 2021 

  Access Paper or Ask Questions

Advancing RNN Transducer Technology for Speech Recognition


Mar 17, 2021
George Saon, Zoltan Tueske, Daniel Bolanos, Brian Kingsbury

* Accepted at ICASSP 2021 

  Access Paper or Ask Questions

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition


Feb 24, 2020
Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

* Accepted to IEEE Signal Processing Magazine 

  Access Paper or Ask Questions

Improving Efficiency in Large-Scale Decentralized Distributed Training


Feb 04, 2020
Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

* 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP'2020) Oral 

  Access Paper or Ask Questions

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300


Jan 20, 2020
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury

* 5 pages, 2 figures 

  Access Paper or Ask Questions

Challenging the Boundaries of Speech Recognition: The MALACH Corpus


Aug 09, 2019
Michael Picheny, Zóltan Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon

* Accepted for publication at INTERSPEECH 2019 

  Access Paper or Ask Questions

A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition


Jul 10, 2019
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

* INTERSPEECH 2019 

  Access Paper or Ask Questions

English Broadcast News Speech Recognition by Humans and Machines


Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

* \copyright 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

  Access Paper or Ask Questions

Distributed Deep Learning Strategies For Automatic Speech Recognition


Apr 10, 2019
Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung, Michael Picheny

* Published in ICASSP'19 

  Access Paper or Ask Questions

Building competitive direct acoustics-to-word models for English conversational speech recognition


Dec 08, 2017
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny

* Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 

  Access Paper or Ask Questions

Embedding-Based Speaker Adaptive Training of Deep Neural Networks


Oct 17, 2017
Xiaodong Cui, Vaibhava Goel, George Saon


  Access Paper or Ask Questions

Language Modeling with Highway LSTM


Sep 19, 2017
Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy

* to appear in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017) 

  Access Paper or Ask Questions

Direct Acoustics-to-Word Models for English Conversational Speech Recognition


Mar 22, 2017
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo

* Submitted to Interspeech-2017 

  Access Paper or Ask Questions

English Conversational Telephone Speech Recognition by Humans and Machines


Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall


  Access Paper or Ask Questions

The IBM 2016 English Conversational Telephone Speech Recognition System


Jun 22, 2016
George Saon, Tom Sercu, Steven Rennie, Hong-Kwang J. Kuo

* Submitted to Interspeech 2016 

  Access Paper or Ask Questions

The IBM 2015 English Conversational Telephone Speech Recognition System


May 21, 2015
George Saon, Hong-Kwang J. Kuo, Steven Rennie, Michael Picheny

* Submitted to Interspeech 2015 

  Access Paper or Ask Questions

Improvements to deep convolutional neural networks for LVCSR


Dec 10, 2013
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomas Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran

* 6 pages, 1 figure 

  Access Paper or Ask Questions