Alert button
Picture for Jianwu Dang

Jianwu Dang

Alert button

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Add code
Bookmark button
Alert button
Mar 26, 2023
Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara

Figure 1 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 2 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 3 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 4 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Viaarxiv icon

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification

Add code
Bookmark button
Alert button
Feb 22, 2023
Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang

Figure 1 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 2 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 3 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Figure 4 for Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Viaarxiv icon

MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation

Add code
Bookmark button
Alert button
Dec 07, 2022
Yanjie Fu, Haoran Yin, Meng Ge, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Figure 1 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 2 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 3 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Figure 4 for MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation
Viaarxiv icon

Monolingual Recognizers Fusion for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Nov 02, 2022
Tongtong Song, Qiang Xu, Haoyu Lu, Longbiao Wang, Hao Shi, Yuqin Lin, Yanbing Yang, Jianwu Dang

Figure 1 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 2 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 3 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 4 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Viaarxiv icon

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Add code
Bookmark button
Alert button
Oct 11, 2022
Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network

Add code
Bookmark button
Alert button
Oct 09, 2022
Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang

Figure 1 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 2 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 3 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Figure 4 for VCSE: Time-Domain Visual-Contextual Speaker Extraction Network
Viaarxiv icon

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources

Add code
Bookmark button
Alert button
Jul 15, 2022
Haoran Yin, Meng Ge, Yanjie Fu, Gaoyan Zhang, Longbiao Wang, Lei Zhang, Lin Qiu, Jianwu Dang

Figure 1 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 2 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 3 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Figure 4 for MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources
Viaarxiv icon

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 05, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, Jianwu Dang

Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

Iterative Sound Source Localization for Unknown Number of Sources

Add code
Bookmark button
Alert button
Jun 24, 2022
Yanjie Fu, Meng Ge, Haoran Yin, Xinyuan Qian, Longbiao Wang, Gaoyan Zhang, Jianwu Dang

Figure 1 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 2 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 3 for Iterative Sound Source Localization for Unknown Number of Sources
Figure 4 for Iterative Sound Source Localization for Unknown Number of Sources
Viaarxiv icon