Alert button
Picture for Shi-Xiong Zhang

Shi-Xiong Zhang

Alert button

RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR

Add code
Bookmark button
Alert button
Oct 31, 2023
Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Figure 1 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 2 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 3 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Figure 4 for RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Viaarxiv icon

UniX-Encoder: A Universal $X$-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing

Add code
Bookmark button
Alert button
Oct 25, 2023
Zili Huang, Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Viaarxiv icon

M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec

Add code
Bookmark button
Alert button
Sep 23, 2023
Anton Ratnarajah, Shi-Xiong Zhang, Dong Yu

Figure 1 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 2 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 3 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Figure 4 for M3-AUDIODEC: Multi-channel multi-speaker multi-spatial audio codec
Viaarxiv icon

MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

Add code
Bookmark button
Alert button
Mar 11, 2023
Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu

Figure 1 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 2 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 3 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Figure 4 for MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning
Viaarxiv icon

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty

Add code
Bookmark button
Alert button
Feb 27, 2023
Rongzhi Gu, Shi-Xiong Zhang, Dong Yu

Figure 1 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 2 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 3 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Figure 4 for 3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty
Viaarxiv icon

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Add code
Bookmark button
Alert button
Dec 24, 2022
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 2 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 3 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 4 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Viaarxiv icon

Deep Neural Mel-Subband Beamformer for In-car Speech Separation

Add code
Bookmark button
Alert button
Nov 22, 2022
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

Figure 1 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Figure 2 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Figure 3 for Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Viaarxiv icon