Alert button
Picture for Yannan Wang

Yannan Wang

Alert button

Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization

Dec 07, 2023
Huan Zhao, Li Zhang, Yue Li, Yannan Wang, Hongji Wang, Wei Rao, Qing Wang, Lei Xie

Viaarxiv icon

The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022

Jul 28, 2023
Li Zhang, Huan Zhao, Yue Li, Bowen Pang, Yannan Wang, Hongji Wang, Wei Rao, Qing Wang, Lei Xie

Figure 1 for The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022
Figure 2 for The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022
Figure 3 for The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022
Viaarxiv icon

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation

Jun 28, 2023
Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Yukai Ju, Shulin He, Yannan Wang, Zhiyong Wu

Figure 1 for MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Figure 2 for MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Figure 3 for MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Figure 4 for MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation
Viaarxiv icon

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction

Jun 14, 2023
Wenzhe Liu, Yupeng Shi, Jun Chen, Wei Rao, Shulin He, Andong Li, Yannan Wang, Zhiyong Wu

Figure 1 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 2 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 3 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Figure 4 for Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction
Viaarxiv icon

Inter-SubNet: Speech Enhancement with Subband Interaction

May 09, 2023
Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Zhiyong Wu, Yannan Wang, Shidong Shang, Helen Meng

Figure 1 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 2 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 3 for Inter-SubNet: Speech Enhancement with Subband Interaction
Figure 4 for Inter-SubNet: Speech Enhancement with Subband Interaction
Viaarxiv icon

Distance-based Weight Transfer from Near-field to Far-field Speaker Verification

Mar 15, 2023
Li Zhang, Qing Wang, Hongji Wang, Yue Li, Wei Rao, Yannan Wang, Lei Xie

Figure 1 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 2 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 3 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Viaarxiv icon

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

Mar 14, 2023
Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

Figure 1 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 2 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 3 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Viaarxiv icon

Distance-based Weight Transfer for Fine-tuning from Near-field to Far-field Speaker Verification

Mar 01, 2023
Li Zhang, Qing Wang, Hongji Wang, Yue Li, Wei Rao, Yannan Wang, Lei Xie

Figure 1 for Distance-based Weight Transfer for Fine-tuning from Near-field to Far-field Speaker Verification
Figure 2 for Distance-based Weight Transfer for Fine-tuning from Near-field to Far-field Speaker Verification
Figure 3 for Distance-based Weight Transfer for Fine-tuning from Near-field to Far-field Speaker Verification
Viaarxiv icon

Speech Enhancement with Fullband-Subband Cross-Attention Network

Nov 10, 2022
Jun Chen, Wei Rao, Zilin Wang, Zhiyong Wu, Yannan Wang, Tao Yu, Shidong Shang, Helen Meng

Figure 1 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 2 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Figure 3 for Speech Enhancement with Fullband-Subband Cross-Attention Network
Viaarxiv icon

Speech Enhancement with Intelligent Neural Homomorphic Synthesis

Oct 28, 2022
Shulin He, Wei Rao, Jinjiang Liu, Jun Chen, Yukai Ju, Xueliang Zhang, Yannan Wang, Shidong Shang

Figure 1 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 2 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 3 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 4 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Viaarxiv icon