Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhong Meng

Separating Long-Form Speech with Group-Wise Permutation Invariant Training


Nov 17, 2021
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei

* 5 pages, 3 figures, 3 tables, submitted to IEEE ICASSP 2022 

  Access Paper or Ask Questions

Continuous Speech Separation with Recurrent Selective Attention Network


Oct 28, 2021
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Factorized Neural Transducer for Efficient Language Model Adaptation


Oct 18, 2021
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li


  Access Paper or Ask Questions

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition


Oct 14, 2021
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong

* 5 pages, submitted to ICASSP 2022 

  Access Paper or Ask Questions

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR


Oct 07, 2021
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio


Jul 06, 2021
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition


Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

* Interspeech 2021, Brno, Czech Republic 
* 5 pages, Interspeech 2021 

  Access Paper or Ask Questions

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone


Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

End-to-End Speaker-Attributed ASR with Transformer


Apr 05, 2021
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Continuous Speech Separation with Ad Hoc Microphone Arrays


Mar 03, 2021
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng


  Access Paper or Ask Questions

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition


Feb 02, 2021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong

* 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada 
* 5 pages, ICASSP 2021 

  Access Paper or Ask Questions

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings


Jan 06, 2021
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR


Nov 03, 2020
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2021. arXiv admin note: text overlap with arXiv:2006.10930, arXiv:2008.04546 

  Access Paper or Ask Questions

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition


Nov 03, 2020
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

* 2021 IEEE Spoken Language Technology Workshop (SLT) 
* 8 pages, 2 figures, SLT 2021 

  Access Paper or Ask Questions

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer


Oct 23, 2020
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong

* 5 pages, submitted to ICASSP 2021 

  Access Paper or Ask Questions

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings


Aug 11, 2020
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka


  Access Paper or Ask Questions

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability


Jul 30, 2020
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong

* Accepted by Interspeech 2020 

  Access Paper or Ask Questions

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers


Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

Active Voice Authentication


Apr 25, 2020
Zhong Meng, M Umair Bin Altaf, Biing-Hwang, Juang

* Digital Signal Processing, Volume 101, June 2020, 102672, ISSN 1051-2004 
* 39 pages, 4 figures 

  Access Paper or Ask Questions

L-Vector: Neural Label Embedding for Domain Adaptation


Apr 25, 2020
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee

* 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain 
* 5 pages, 2 figure, ICASSP 2020 

  Access Paper or Ask Questions

Serialized Output Training for End-to-End Overlapped Speech Recognition


Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model


Mar 17, 2020
Jinyu Li, Rui Zhao, Eric Sun, Jeremy H. M. Wong, Amit Das, Zhong Meng, Yifan Gong

* Accepted by ICASSP 2020 

  Access Paper or Ask Questions

Continuous speech separation: dataset and analysis


Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li


  Access Paper or Ask Questions

Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition


Jan 06, 2020
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong

* 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Sentosa, Singapore 
* 8 pages, 2 figures, ASRU 2019 

  Access Paper or Ask Questions

Character-Aware Attention-Based End-to-End Speech Recognition


Jan 06, 2020
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong

* 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Sentosa, Singapore 
* 7 pages, 3 figures, ASRU 2019 

  Access Paper or Ask Questions