Alert button
Picture for Shi-Xiong Zhang

Shi-Xiong Zhang

Alert button

RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR

Add code
Bookmark button
Alert button
Oct 31, 2023
Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Figure 1 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 2 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 3 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 4 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Viaarxiv icon

UniX-Encoder: A Universal $X$-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing

Add code
Bookmark button
Alert button
Oct 25, 2023
Zili Huang, Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Viaarxiv icon

M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec

Add code
Bookmark button
Alert button
Sep 23, 2023
Anton Ratnarajah, Shi-Xiong Zhang, Dong Yu

Figure 1 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 2 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 3 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 4 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Viaarxiv icon

MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

Add code
Bookmark button
Alert button
Mar 11, 2023
Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu

Figure 1 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 2 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 3 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 4 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Viaarxiv icon

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Add code
Bookmark button
Alert button
Feb 27, 2023
Rongzhi Gu, Shi-Xiong Zhang, Dong Yu

Figure 1 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 2 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 3 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 4 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Viaarxiv icon

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Add code
Bookmark button
Alert button
Dec 24, 2022
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 2 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 3 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 4 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Viaarxiv icon

Deep Neural Mel-Subband Beamformer for In-car Speech Separation

Add code
Bookmark button
Alert button
Nov 22, 2022
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

Figure 1 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Figure 2 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Figure 3 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Viaarxiv icon

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

Add code
Bookmark button
Alert button
May 20, 2022
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu

Figure 1 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 2 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 3 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 4 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Viaarxiv icon

EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers

Add code
Bookmark button
Alert button
Mar 31, 2022
Yushi Ueda, Soumi Maiti, Shinji Watanabe, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Yong Xu

Figure 1 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 2 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 3 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Figure 4 for EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Viaarxiv icon

Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Add code
Bookmark button
Alert button
Dec 30, 2021
Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou

Figure 1 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 2 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 3 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 4 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Viaarxiv icon