Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Shi-Xiong Zhang

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature


Nov 22, 2021
Yiwen Shao, Shi-Xiong Zhang, Dong Yu


  Access Paper or Ask Questions

Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer


Nov 09, 2021
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

FAST-RIR: Fast neural diffuse room impulse response generator


Oct 07, 2021
Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu

* More results and source code is available at https://anton-jeran.github.io/FRIR/ 

  Access Paper or Ask Questions

MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation


Apr 26, 2021
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu


  Access Paper or Ask Questions

Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain


Apr 26, 2021
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

* 5 pages, 3 figures 

  Access Paper or Ask Questions

TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation


Mar 31, 2021
Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Generalized RNN beamformer for target speech separation


Jan 04, 2021
Yong Xu, Zhuohuang Zhang, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu

* 4 pages 2 figures, demo: https://yongxuustc.github.io/grnnbf/ 

  Access Paper or Ask Questions

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation


Dec 24, 2020
Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Donald S. Williamson, Dong Yu

* 12 pages, 6 figures. Demos available at https://zzhang68.github.io/mcmf-adl-mvdr/ 

  Access Paper or Ask Questions

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization


Oct 30, 2020
Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu

* submitted to ICASSP 2021 

  Access Paper or Ask Questions

An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation


Aug 21, 2020
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen


  Access Paper or Ask Questions

Self-supervised learning for audio-visual speaker diarization


Feb 13, 2020
Yifan Ding, Yong Xu, Shi-Xiong Zhang, Yahuan Cong, Liqiang Wang


  Access Paper or Ask Questions

A Unified Framework for Speech Separation


Dec 17, 2019
Fahimeh Bahmaninezhad, Shi-Xiong Zhang, Yong Xu, Meng Yu, John H. L. Hansen, Dong Yu


  Access Paper or Ask Questions

End-to-End Multi-Channel Speech Separation


May 28, 2019
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu

* submitted to interspeech 2019 

  Access Paper or Ask Questions

A comprehensive study of speech separation: spectrogram vs waveform separation


May 17, 2019
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu

* Submitted to INTERSPEECH 2019 

  Access Paper or Ask Questions

Encrypted Speech Recognition using Deep Polynomial Networks


May 11, 2019
Shi-Xiong Zhang, Yifan Gong, Dong Yu

* ICASSP 2019, [email protected] https://www.researchgate.net/publication/333005422_Encrypted_Speech_Recognition_using_deep_polynomial_networks 

  Access Paper or Ask Questions

End-to-End Attention based Text-Dependent Speaker Verification


Jan 03, 2017
Shi-Xiong Zhang, Zhuo Chen, Yong Zhao, Jinyu Li, Yifan Gong

* @article{zhang2016End2End, title={End-to-End Attention based Text-Dependent Speaker Verification}, author={Shi-Xiong Zhang, Zhuo Chen$^{\dag}$, Yong Zhao, Jinyu Li and Yifan Gong}, journal={IEEE Workshop on Spoken Language Technology}, pages={171--178}, year={2016}, publisher={IEEE} } 

  Access Paper or Ask Questions