Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Duc Le

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios


Apr 06, 2021
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 (under review) 

  Access Paper or Ask Questions

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition


Apr 06, 2021
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion


Apr 05, 2021
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency


Apr 05, 2021
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

  Access Paper or Ask Questions

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding


Apr 05, 2021
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

* submitted to Interspeech 2021 

  Access Paper or Ask Questions

Deep Shallow Fusion for RNN-T Personalization


Nov 16, 2020
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer

* To appear at SLT 2021 

  Access Paper or Ask Questions

Improving RNN Transducer Based ASR with Auxiliary Tasks


Nov 09, 2020
Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig

* Accepted for publication at IEEE Spoken Language Technology Workshop (SLT), 2021 

  Access Paper or Ask Questions

Alignment Restricted Streaming Recurrent Neural Network Transducer


Nov 05, 2020
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer

* Accepted for presentation at IEEE Spoken Language Technology Workshop (SLT) 2021 

  Access Paper or Ask Questions

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition


Oct 29, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer

* 5 pages, 2 figures, submitted to ICASSP 2021 

  Access Paper or Ask Questions

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer


Oct 26, 2020
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le

* submitted to ICASSP 2021 

  Access Paper or Ask Questions

Classification of Huntington Disease using Acoustic and Lexical Features


Aug 07, 2020
Matthew Perez, Wenyu Jin, Duc Le, Noelle Carlozzi, Praveen Dayalu, Angela Roberts, Emily Mower Provost

* 4 pages 

  Access Paper or Ask Questions

Weak-Attention Suppression For Transformer Based Speech Recognition


May 18, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer

* submitted to interspeech 2020 

  Access Paper or Ask Questions

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention


Oct 28, 2019
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer


  Access Paper or Ask Questions

G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR


Oct 22, 2019
Duc Le, Thilo Koehler, Christian Fuegen, Michael L. Seltzer


  Access Paper or Ask Questions

Transformer-based Acoustic Modeling for Hybrid Speech Recognition


Oct 22, 2019
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer


  Access Paper or Ask Questions

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition


Oct 11, 2019
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer

* To appear at ASRU 2019 

  Access Paper or Ask Questions