Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation


May 18, 2023
Yanjie Fu, Meng Ge, Honglong Wang, Nan Li, Haoran Yin, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Add code

* Accepted by Interspeech 2023. arXiv admin note: substantial text overlap with arXiv:2212.03401 

   Access Paper or Ask Questions

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder


Mar 26, 2023
Hao Shi, Masato Mimura, Longbiao Wang, Jianwu Dang, Tatsuya Kawahara

Add code


   Access Paper or Ask Questions

Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification


Feb 22, 2023
Meng Liu, Kong Aik Lee, Longbiao Wang, Hanyi Zhang, Chang Zeng, Jianwu Dang

Add code


   Access Paper or Ask Questions

MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation


Dec 07, 2022
Yanjie Fu, Haoran Yin, Meng Ge, Longbiao Wang, Gaoyan Zhang, Jianwu Dang, Chengyun Deng, Fei Wang

Add code

* Submitted to ICASSP 2023 

   Access Paper or Ask Questions

Monolingual Recognizers Fusion for Code-switching Speech Recognition


Nov 02, 2022
Tongtong Song, Qiang Xu, Haoyu Lu, Longbiao Wang, Hao Shi, Yuqin Lin, Yanbing Yang, Jianwu Dang

Add code

* Submitted to ICASSP2023 

   Access Paper or Ask Questions

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech


Oct 11, 2022
Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

Add code

* 7 pages, 1 figures, Accecpted by Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia 

   Access Paper or Ask Questions

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network


Oct 09, 2022
Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang

Add code


   Access Paper or Ask Questions

MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources


Jul 15, 2022
Haoran Yin, Meng Ge, Yanjie Fu, Gaoyan Zhang, Longbiao Wang, Lei Zhang, Lin Qiu, Jianwu Dang

Add code

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

Language-specific Characteristic Assistance for Code-switching Speech Recognition


Jul 05, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, Jianwu Dang

Add code

* Accepted by Interspeech 2022 

   Access Paper or Ask Questions

1
2
3
>>