Alert button
Picture for Longbiao Wang

Longbiao Wang

Alert button

Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation

Add code
Bookmark button
Alert button
May 18, 2023
Yanjie Fu, Meng Ge, Honglong Wang, Nan Li, Haoran Yin, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Figure 1 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 2 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 3 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Figure 4 for Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation
Viaarxiv icon

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Add code
Bookmark button
Alert button
Mar 26, 2023
Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara

Figure 1 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 2 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 3 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 4 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Viaarxiv icon

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification

Add code
Bookmark button
Alert button
Feb 22, 2023
Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang

Figure 1 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 2 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 3 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 4 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Viaarxiv icon

MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation

Add code
Bookmark button
Alert button
Dec 07, 2022
Yanjie Fu, Haoran Yin, Meng Ge, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Figure 1 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 2 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 3 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 4 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Viaarxiv icon

The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results

Add code
Bookmark button
Alert button
Nov 03, 2022
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu

Figure 1 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 2 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 3 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 4 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Viaarxiv icon

I4U System Description for NIST SRE'20 CTS Challenge

Add code
Bookmark button
Alert button
Nov 02, 2022
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera

Figure 1 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 2 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 3 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 4 for I4U System Description for NIST SRE'20 CTS Challenge
Viaarxiv icon

Monolingual Recognizers Fusion for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Nov 02, 2022
Tongtong Song, Qiang Xu, Haoyu Lu, Longbiao Wang, Hao Shi, Yuqin Lin, Yanbing Yang, Jianwu Dang

Figure 1 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 2 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 3 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 4 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Viaarxiv icon

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Add code
Bookmark button
Alert button
Oct 11, 2022
Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network

Add code
Bookmark button
Alert button
Oct 09, 2022
Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang

Figure 1 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 2 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 3 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 4 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Viaarxiv icon

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources

Add code
Bookmark button
Alert button
Jul 15, 2022
Haoran Yin, Meng Ge, Yanjie Fu, Gaoyan Zhang, Longbiao Wang, Lei Zhang, Lin Qiu, Jianwu Dang

Figure 1 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 2 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 3 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 4 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Viaarxiv icon