Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Yangyang Shi

Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution


Oct 07, 2021
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer

* 5 pages, 3 figures, submit to ICASSP 2022 

  Access Paper or Ask Questions

Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study


Oct 07, 2021
Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Collaborative Training of Acoustic Encoders for Speech Recognition


Jul 13, 2021
Varun Nagaraja, Yangyang Shi, Ganesh Venkatesh, Ozlem Kalinli, Michael L. Seltzer, Vikas Chandra

* INTERSPEECH 2021 

  Access Paper or Ask Questions

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models


Jul 09, 2021
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer

* submitted to ASRU 2021 

  Access Paper or Ask Questions

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios


Apr 06, 2021
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 (under review) 

  Access Paper or Ask Questions

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition


Apr 06, 2021
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion


Apr 05, 2021
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency


Apr 05, 2021
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

* 5 pages, 2 figures, submitted Interspeech 2021 

  Access Paper or Ask Questions

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition


Nov 03, 2020
Ching-Feng Yeh, Yongqiang Wang, Yangyang Shi, Chunyang Wu, Frank Zhang, Julian Chan, Michael L. Seltzer

* IEEE Spoken Language Technology Workshop 2021 

  Access Paper or Ask Questions

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications


Oct 29, 2020
Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao

* submitted to ICASSP2021 

  Access Paper or Ask Questions

Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition


Oct 29, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer

* 5 pages, 2 figures, submitted to ICASSP 2021 

  Access Paper or Ask Questions

Weak-Attention Suppression For Transformer Based Speech Recognition


May 18, 2020
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer

* submitted to interspeech 2020 

  Access Paper or Ask Questions

Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory


May 16, 2020
Chunyang Wu, Yongqiang Wang, Yangyang Shi, Ching-Feng Yeh, Frank Zhang

* submitted to Interspeech 2020 

  Access Paper or Ask Questions

Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization


Apr 08, 2019
Yangyang Shi, Mei-Yuh Hwang, Xin Lei, Haoyu Sheng

* ICASSP 2019 

  Access Paper or Ask Questions

End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model


Mar 12, 2019
Yangyang Shi, Mei-Yuh Hwang, Xin Lei

* ICASSP 2019 

  Access Paper or Ask Questions