Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Zhuo Chen

Separating Long-Form Speech with Group-Wise Permutation Invariant Training


Nov 17, 2021
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei

* 5 pages, 3 figures, 3 tables, submitted to IEEE ICASSP 2022 

  Access Paper or Ask Questions

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing


Oct 29, 2021
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei


  Access Paper or Ask Questions

Continuous Speech Separation with Recurrent Selective Attention Network


Oct 28, 2021
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

VarArray: Array-Geometry-Agnostic Continuous Speech Separation


Oct 26, 2021
Takuya Yoshioka, Xiaofei Wang, Dongmei Wang, Min Tang, Zirun Zhu, Zhuo Chen, Naoyuki Kanda

* 5 pages, 1 figure, 3 tables, submitted to ICASSP 2022; updated reference information of [33] 

  Access Paper or Ask Questions

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement


Oct 20, 2021
Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Personalized Speech Enhancement: New Models and Comprehensive Evaluation


Oct 18, 2021
Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang


  Access Paper or Ask Questions

All-neural beamformer for continuous speech separation


Oct 13, 2021
Zhuohuang Zhang, Takuya Yoshioka, Naoyuki Kanda, Zhuo Chen, Xiaofei Wang, Dongmei Wang, Sefik Emre Eskimez

* 5 pages, 3 figures, 2 tables 

  Access Paper or Ask Questions

UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training


Oct 12, 2021
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu

* ICASSP 2022 Submission 

  Access Paper or Ask Questions

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR


Oct 07, 2021
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Continuous Streaming Multi-Talker ASR with Dual-path Transducers


Sep 17, 2021
Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li

* Submitted to IEEE ICASSP 2022 

  Access Paper or Ask Questions

Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker


Aug 07, 2021
Maokui He, Desh Raj, Zili Huang, Jun Du, Zhuo Chen, Shinji Watanabe


  Access Paper or Ask Questions

Spacetime Neural Network for High Dimensional Quantum Dynamics


Aug 04, 2021
Jiangran Wang, Zhuo Chen, Di Luo, Zhizhen Zhao, Vera Mikyoung Hur, Bryan K. Clark


  Access Paper or Ask Questions

Zero-shot Visual Question Answering using Knowledge Graph


Jul 14, 2021
Zhuo Chen, Jiaoyan Chen, Yuxia Geng, Jeff Z. Pan, Zonggang Yuan, Huajun Chen

* accepted at the International Semantic Web Conference '21 (ISWC 2021) 

  Access Paper or Ask Questions

Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs


Jul 08, 2021
Yikang Zhang, Zhuo Chen, Zhao Zhong


  Access Paper or Ask Questions

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio


Jul 06, 2021
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Investigation of Practical Aspects of Single Channel Speech Separation for ASR


Jul 05, 2021
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

* Accepted by Interspeech 2021 

  Access Paper or Ask Questions

K-ZSL: Resources for Knowledge-driven Zero-shot Learning


Jun 29, 2021
Yuxia Geng, Jiaoyan Chen, Zhuo Chen, Jeff Z. Pan, Zonggang Yuan, Huajun Chen

* Under Review 

  Access Paper or Ask Questions

Modeling and Reasoning in Event Calculus using Goal-Directed Constraint Answer Set Programming


Jun 28, 2021
Joaquín Arias, Manuel Carro, Zhuo Chen, Gopal Gupta

* Under consideration in Theory and Practice of Logic Programming (TPLP) 

  Access Paper or Ask Questions

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement


Jun 05, 2021
Sefik Emre Eskimez, Xiaofei Wang, Min Tang, Hemin Yang, Zirun Zhu, Zhuo Chen, Huaming Wang, Takuya Yoshioka

* Accepted to INTERSPEECH2021 

  Access Paper or Ask Questions

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone


Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario


Apr 08, 2021
Yihui Fu, Luyao Cheng, Shubo Lv, Yukai Jv, Yuxiang Kong, Zhuo Chen, Yanxin Hu, Lei Xie, Jian Wu, Hui Bu, Xin Xu, Jun Du, Jingdong Chen

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

End-to-End Speaker-Attributed ASR with Transformer


Apr 05, 2021
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Continuous Speech Separation with Ad Hoc Microphone Arrays


Mar 03, 2021
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng


  Access Paper or Ask Questions

Knowledge-aware Zero-Shot Learning: Survey and Perspective


Feb 26, 2021
Jiaoyan Chen, Yuxia Geng, Zhuo Chen, Ian Horrocks, Jeff Z. Pan, Huajun Chen


  Access Paper or Ask Questions

Dual-Path Modeling for Long Recording Speech Separation in Meetings


Feb 23, 2021
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian

* Accepted by ICASSP 2021 

  Access Paper or Ask Questions