Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition


Jun 20, 2022
Zhifu Gao, Shiliang Zhang, Ian McLoughlin, Zhijie Yan

* 5 pages, 3 figures, accepted by INTERSPEECH 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios


Mar 31, 2022
Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan

* Submitted to INTERSPEECH 2022, 5 parges, 2 figure 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech


Feb 16, 2022
Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao

* Accepted by ICASSP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge


Feb 08, 2022
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu

* 5 pages, 4 tables 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge


Oct 14, 2021
Fan Yu, Shiliang Zhang, Yihui Fu, Lei Xie, Siqi Zheng, Zhihao Du, Weilong Huang, Pengcheng Guo, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu

* 5 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

BeamTransformer: Microphone Array-based Overlapping Speech Detection


Sep 09, 2021
Siqi Zheng, Shiliang Zhang, Weilong Huang, Qian Chen, Hongbin Suo, Ming Lei, Jinwei Feng, Zhijie Yan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Real-time Speaker Diarization System Based on Spatial Spectrum


Jul 20, 2021
Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan

* Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition


Mar 27, 2019
Shiliang Zhang, Ming Lei, Zhijie Yan

* 6pages, 5 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Deep-FSMN for Large Vocabulary Continuous Speech Recognition


Mar 04, 2018
Shiliang Zhang, Ming Lei, Zhijie Yan, Lirong Dai


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Deep Feed-forward Sequential Memory Networks for Speech Synthesis


Feb 26, 2018
Mengxiao Bi, Heng Lu, Shiliang Zhang, Ming Lei, Zhijie Yan

* 5 pages, ICASSP 2018 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email