Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Yifan Gong

On Addressing Practical Challenges for RNN-Transducer


May 04, 2021
Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong

* 5 pages 

  Access Paper or Ask Questions

Streaming Multi-talker Speech Recognition with Joint Speaker Identification


Apr 05, 2021
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

* 5 pages, 2 figures, submitted to Interspeech 2021 

  Access Paper or Ask Questions

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition


Feb 02, 2021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong

* 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada 
* 5 pages, ICASSP 2021 

  Access Paper or Ask Questions

Streaming end-to-end multi-talker speech recognition


Nov 26, 2020
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

* 5 pages, 4 figures 

  Access Paper or Ask Questions

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition


Nov 03, 2020
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

* 2021 IEEE Spoken Language Technology Workshop (SLT) 
* 8 pages, 2 figures, SLT 2021 

  Access Paper or Ask Questions

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer


Oct 23, 2020
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong

* 5 pages, submitted to ICASSP 2021 

  Access Paper or Ask Questions

Speaker Separation Using Speaker Inventories and Estimated Speech


Oct 20, 2020
Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong


  Access Paper or Ask Questions

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability


Jul 30, 2020
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong

* Accepted by Interspeech 2020 

  Access Paper or Ask Questions

Exploring Transformers for Large-Scale Speech Recognition


May 19, 2020
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong

* 5 pages, 1 figure 

  Access Paper or Ask Questions

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR


May 15, 2020
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong

* Accepted at IEEE ICASSP 2020 

  Access Paper or Ask Questions

Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition


May 01, 2020
Hu Hu, Rui Zhao, Jinyu Li, Liang Lu, Yifan Gong

* Accepted by ICASSP 2020 

  Access Paper or Ask Questions

L-Vector: Neural Label Embedding for Domain Adaptation


Apr 25, 2020
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee

* 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain 
* 5 pages, 2 figure, ICASSP 2020 

  Access Paper or Ask Questions

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model


Mar 17, 2020
Jinyu Li, Rui Zhao, Eric Sun, Jeremy H. M. Wong, Amit Das, Zhong Meng, Yifan Gong

* Accepted by ICASSP 2020 

  Access Paper or Ask Questions

A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework


Mar 13, 2020
Zheng Zhan, Yifan Gong, Zhengang Li, Pu Zhao, Xiaolong Ma, Wei Niu, Xiaolin Xu, Bin Ren, Yanzhi Wang, Xue Lin


  Access Paper or Ask Questions

BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method


Feb 22, 2020
Xiaolong Ma, Zhengang Li, Yifan Gong, Tianyun Zhang, Wei Niu, Zheng Zhan, Pu Zhao, Jian Tang, Xue Lin, Bin Ren, Yanzhi Wang


  Access Paper or Ask Questions

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition


Feb 19, 2020
Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Yanzhi Wang, Dingwen Tao


  Access Paper or Ask Questions

SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency


Jan 23, 2020
Zhengang Li, Yifan Gong, Xiaolong Ma, Sijia Liu, Mengshu Sun, Zheng Zhan, Zhenglun Kong, Geng Yuan, Yanzhi Wang


  Access Paper or Ask Questions

Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition


Jan 06, 2020
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong

* 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Sentosa, Singapore 
* 8 pages, 2 figures, ASRU 2019 

  Access Paper or Ask Questions

Character-Aware Attention-Based End-to-End Speech Recognition


Jan 06, 2020
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong

* 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Sentosa, Singapore 
* 7 pages, 3 figures, ASRU 2019 

  Access Paper or Ask Questions

Advances in Online Audio-Visual Meeting Transcription


Dec 10, 2019
Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

* To appear in Proc. IEEE ASRU Workshop 2019 

  Access Paper or Ask Questions

Speaker Adaptation for Attention-Based End-to-End Speech Recognition


Nov 09, 2019
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong

* Interspeech 2019, Graz, Austria 
* 5 pages, 3 figures, Interspeech 2019 

  Access Paper or Ask Questions

Improving RNN Transducer Modeling for End-to-End Speech Recognition


Sep 26, 2019
Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong

* Accepted by IEEE ASRU workshop, 2019 

  Access Paper or Ask Questions

Self-Teaching Networks


Sep 09, 2019
Liang Lu, Eric Sun, Yifan Gong

* 5 pages, Interspeech 2019 

  Access Paper or Ask Questions

PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch


Jul 30, 2019
Liang Lu, Xiong Xiao, Zhuo Chen, Yifan Gong

* 5 pages, 2 figures 

  Access Paper or Ask Questions

Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch


Jul 12, 2019
Liang Lu, Xiong Xiao, Zhuo Chen, Yifan Gong

* 5 pages, 2 figures 

  Access Paper or Ask Questions

Encrypted Speech Recognition using Deep Polynomial Networks


May 11, 2019
Shi-Xiong Zhang, Yifan Gong, Dong Yu

* ICASSP 2019, [email protected] https://www.researchgate.net/publication/333005422_Encrypted_Speech_Recognition_using_deep_polynomial_networks 

  Access Paper or Ask Questions

Adversarial Speaker Adaptation


Apr 29, 2019
Zhong Meng, Jinyu Li, Yifan Gong

* 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom 
* 5 pages, 2 figures, ICASSP 2019 

  Access Paper or Ask Questions

Adversarial Speaker Verification


Apr 29, 2019
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong

* 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom 
* 5 pages, 1 figure, ICASSP 2019 

  Access Paper or Ask Questions