Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Samuel Thomas

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos


May 05, 2021
Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie Boggust, Rameswar Panda, Brian Kingsbury, Rogerio Feris, David Harwath, James Glass, Michael Picheny, Shih-Fu Chang


  Access Paper or Ask Questions

RNN Transducer Models For Spoken Language Understanding


Apr 08, 2021
Samuel Thomas, Hong-Kwang J. Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory

* To appear in the proceedings of ICASSP 2021 

  Access Paper or Ask Questions

Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs


Apr 07, 2021
Sujeong Cha, Wangrui Hou, Hyun Jung, My Phung, Michael Picheny, Hong-Kwang Kuo, Samuel Thomas, Edmilson Morais

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features


Nov 16, 2020
Edmilson Morais, Hong-Kwang J. Kuo, Samuel Thomas, Zoltan Tuske, Brian Kingsbury

* 5 pages, 3 tables and 1 figure 

  Access Paper or Ask Questions

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems


Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

* 5 pages, published in ICASSP 2020 

  Access Paper or Ask Questions

End-to-End Spoken Language Understanding Without Full Transcripts


Sep 30, 2020
Hong-Kwang J. Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis Lastras

* 5 pages, to be published in Interspeech 2020 

  Access Paper or Ask Questions

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos


Jun 16, 2020
Andrew Rouditchenko, Angie Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogerio Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James Glass


  Access Paper or Ask Questions

English Broadcast News Speech Recognition by Humans and Machines


Apr 30, 2019
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltan Tuske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko

* \copyright 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works 

  Access Paper or Ask Questions

Understanding Unequal Gender Classification Accuracy from Face Images


Nov 30, 2018
Vidya Muthukumar, Tejaswini Pedapati, Nalini Ratha, Prasanna Sattigeri, Chai-Wah Wu, Brian Kingsbury, Abhishek Kumar, Samuel Thomas, Aleksandra Mojsilovic, Kush R. Varshney


  Access Paper or Ask Questions

SimplerVoice: A Key Message & Visual Description Generator System for Illiteracy


Nov 03, 2018
Minh N. B. Nguyen, Samuel Thomas, Anne E. Gattiker, Sujatha Kashyap, Kush R. Varshney


  Access Paper or Ask Questions

A Recorded Debating Dataset


Mar 27, 2018
Shachar Mirkin, Michal Jacovi, Tamar Lavee, Hong-Kwang Kuo, Samuel Thomas, Leslie Sager, Lili Kotlerman, Elad Venezian, Noam Slonim


  Access Paper or Ask Questions

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition


Feb 07, 2018
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson

* Accepted in The 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2018) 

  Access Paper or Ask Questions

English Conversational Telephone Speech Recognition by Humans and Machines


Mar 06, 2017
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall


  Access Paper or Ask Questions

Invariant Representations for Noisy Speech Recognition


Nov 27, 2016
Dmitriy Serdyuk, Kartik Audhkhasi, Philémon Brakel, Bhuvana Ramabhadran, Samuel Thomas, Yoshua Bengio

* 5 pages, 1 figure, 1 table, NIPS workshop on end-to-end speech recognition 

  Access Paper or Ask Questions