Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Michael L. Seltzer

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios


Apr 06, 2021
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 (under review) 

  Access Paper or Ask Questions

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition


Apr 06, 2021
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion


Apr 05, 2021
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency


Apr 05, 2021
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

  Access Paper or Ask Questions

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding


Apr 05, 2021
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

* submitted to Interspeech 2021 

  Access Paper or Ask Questions

Memory-efficient Speech Recognition on Smart Devices


Feb 23, 2021
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra

* ICASSP 2021 

  Access Paper or Ask Questions

Deep Shallow Fusion for RNN-T Personalization


Nov 16, 2020
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer

* To appear at SLT 2021 

  Access Paper or Ask Questions

Alignment Restricted Streaming Recurrent Neural Network Transducer


Nov 05, 2020
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer

* Accepted for presentation at IEEE Spoken Language Technology Workshop (SLT) 2021 

  Access Paper or Ask Questions

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition


Nov 03, 2020
Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer

* IEEE Spoken Language Technology Workshop 2021 

  Access Paper or Ask Questions

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer


Oct 26, 2020
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le

* submitted to ICASSP 2021 

  Access Paper or Ask Questions

Weak-Attention Suppression For Transformer Based Speech Recognition


May 18, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer

* submitted to interspeech 2020 

  Access Paper or Ask Questions

AIPNet: Generative Adversarial Pre-training of Accent-invariant Networks for End-to-end Speech Recognition


Nov 27, 2019
Yi-Chen Chen, Zhaojun Yang, Ching-Feng Yeh, Mahaveer Jain, Michael L. Seltzer


  Access Paper or Ask Questions

RNN-T For Latency Controlled ASR With Improved Beam Search


Nov 05, 2019
Mahaveer Jain, Kjell Schubert, Jay Mahadeokar, Ching-Feng Yeh, Kaustubh Kalgaonkar, Anuroop Sriram, Christian Fuegen, Michael L. Seltzer


  Access Paper or Ask Questions

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention


Oct 28, 2019
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer


  Access Paper or Ask Questions

G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR


Oct 22, 2019
Duc Le, Thilo Koehler, Christian Fuegen, Michael L. Seltzer


  Access Paper or Ask Questions

Transformer-based Acoustic Modeling for Hybrid Speech Recognition


Oct 22, 2019
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer


  Access Paper or Ask Questions

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition


Oct 11, 2019
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer

* To appear at ASRU 2019 

  Access Paper or Ask Questions

End-to-end contextual speech recognition using class language models and a token passing decoder


Dec 05, 2018
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen

* submit to ICASSP2019 

  Access Paper or Ask Questions

Improved training for online end-to-end speech recognition systems


Aug 30, 2018
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao

* Interspeech 2018 

  Access Paper or Ask Questions

Towards Language-Universal End-to-End Speech Recognition


Nov 06, 2017
Suyoun Kim, Michael L. Seltzer

* submitted to ICASSP 2018 

  Access Paper or Ask Questions

Large-Scale Domain Adaptation via Teacher-Student Learning


Aug 17, 2017
Jinyu Li, Michael L. Seltzer, Xi Wang, Rui Zhao, Yifan Gong


  Access Paper or Ask Questions

Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks


Mar 08, 2013
Dong Yu, Michael L. Seltzer, Jinyu Li, Jui-Ting Huang, Frank Seide

* ICLR 2013, 9 pages, 4 figures 

  Access Paper or Ask Questions