Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Kartik Audhkhasi

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems


Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

* 5 pages, published in ICASSP 2020 

  Access Paper or Ask Questions

End-to-End Spoken Language Understanding Without Full Transcripts


Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

* 5 pages, to be published in Interspeech 2020 

  Access Paper or Ask Questions

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos


Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass


  Access Paper or Ask Questions

Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300


Jan 20, 2020
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury

* 5 pages, 2 figures 

  Access Paper or Ask Questions

Challenging the Boundaries of Speech Recognition: The MALACH Corpus


Aug 09, 2019
Michael Picheny, Zóltan Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon

* Accepted for publication at INTERSPEECH 2019 

  Access Paper or Ask Questions

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation


Apr 17, 2019
Gakuto Kurata, Kartik Audhkhasi

* Submitted to Interspeech 2019 

  Access Paper or Ask Questions

Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition


Mar 29, 2019
Shane Settle, Kartik Audhkhasi, Karen Livescu, Michael Picheny

* To appear at ICASSP 2019 

  Access Paper or Ask Questions

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition


Feb 07, 2018
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson

* Accepted in The 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018) 

  Access Paper or Ask Questions

Building competitive direct acoustics-to-word models for English conversational speech recognition


Dec 08, 2017
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny

* Submitted to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018 

  Access Paper or Ask Questions

Direct Acoustics-to-Word Models for English Conversational Speech Recognition


Mar 22, 2017
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo

* Submitted to Interspeech-2017 

  Access Paper or Ask Questions

English Conversational Telephone Speech Recognition by Humans and Machines


Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall


  Access Paper or Ask Questions

End-to-End ASR-free Keyword Search from Speech


Jan 13, 2017
Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, Brian Kingsbury

* Published in the IEEE 2017 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2017), scheduled for 5-9 March 2017 in New Orleans, Louisiana, USA 

  Access Paper or Ask Questions

Invariant Representations for Noisy Speech Recognition


Nov 27, 2016
Dmitriy Serdyuk, Kartik Audhkhasi, Philémon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio

* 5 pages, 1 figure, 1 table, NIPS workshop on end-to-end speech recognition 

  Access Paper or Ask Questions

Diverse Embedding Neural Network Language Models


Apr 15, 2015
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran

* Under review as workshop contribution at ICLR 2015 

  Access Paper or Ask Questions

Generalized Ambiguity Decomposition for Understanding Ensemble Diversity


Dec 28, 2013
Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Shrikanth S. Narayanan

* 32 pages, 10 figures 

  Access Paper or Ask Questions