Alert button
Picture for Shi-Xiong Zhang

Shi-Xiong Zhang

Alert button

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

Add code
Bookmark button
Alert button
May 20, 2022
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu

Figure 1 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 2 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 3 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 4 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Viaarxiv icon

EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers

Add code
Bookmark button
Alert button
Mar 31, 2022
Yushi Ueda, Soumi Maiti, Shinji Watanabe, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Yong Xu

Figure 1 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 2 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 3 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 4 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Viaarxiv icon

Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Add code
Bookmark button
Alert button
Dec 30, 2021
Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou

Figure 1 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 2 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 3 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 4 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Viaarxiv icon

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization

Add code
Bookmark button
Alert button
Nov 29, 2021
Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe, Dong Yu

Figure 1 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 2 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 3 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 4 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature

Add code
Bookmark button
Alert button
Nov 22, 2021
Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Figure 1 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 2 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 3 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 4 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Viaarxiv icon

Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer

Add code
Bookmark button
Alert button
Nov 09, 2021
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

Figure 1 for Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer
Figure 2 for Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer
Viaarxiv icon

FAST-RIR: Fast neural diffuse room impulse response generator

Add code
Bookmark button
Alert button
Oct 07, 2021
Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu

Figure 1 for FAST-RIR: Fast neural diffuse room impulse response generator
Figure 2 for FAST-RIR: Fast neural diffuse room impulse response generator
Figure 3 for FAST-RIR: Fast neural diffuse room impulse response generator
Figure 4 for FAST-RIR: Fast neural diffuse room impulse response generator
Viaarxiv icon

MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation

Add code
Bookmark button
Alert button
Apr 26, 2021
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu

Figure 1 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 2 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 3 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Figure 4 for MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation
Viaarxiv icon

Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain

Add code
Bookmark button
Alert button
Apr 26, 2021
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Figure 2 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Figure 3 for Complex Neural Spatial Filter: Enhancing Multi-channel Target Speech Separation in Complex Domain
Viaarxiv icon