Picture for Yannan Wang

Yannan Wang

Distance-based Weight Transfer from Near-field to Far-field Speaker Verification

Add code
Mar 15, 2023
Figure 1 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 2 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Figure 3 for Distance-based Weight Transfer from Near-field to Far-field Speaker Verification
Viaarxiv icon

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

Add code
Mar 14, 2023
Figure 1 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 2 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Figure 3 for TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge
Viaarxiv icon

Speech Enhancement with Fullband-Subband Cross-Attention Network

Add code
Nov 10, 2022
Viaarxiv icon

Speech Enhancement with Intelligent Neural Homomorphic Synthesis

Add code
Oct 28, 2022
Figure 1 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 2 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 3 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Figure 4 for Speech Enhancement with Intelligent Neural Homomorphic Synthesis
Viaarxiv icon

Local-global speaker representation for target speaker extraction

Add code
Oct 28, 2022
Figure 1 for Local-global speaker representation for target speaker extraction
Figure 2 for Local-global speaker representation for target speaker extraction
Figure 3 for Local-global speaker representation for target speaker extraction
Figure 4 for Local-global speaker representation for target speaker extraction
Viaarxiv icon

spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement

Add code
Oct 17, 2022
Figure 1 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 2 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 3 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Figure 4 for spatial-dccrn: dccrn equipped with frame-level angle feature and hybrid filtering for multi-channel speech enhancement
Viaarxiv icon

A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification

Add code
Mar 31, 2022
Figure 1 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 2 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 3 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Figure 4 for A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification
Viaarxiv icon

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

Add code
Nov 16, 2021
Figure 1 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 2 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 3 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Figure 4 for S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement
Viaarxiv icon

A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification

Add code
Jul 03, 2021
Figure 1 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 2 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 3 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Figure 4 for A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification
Viaarxiv icon

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction

Add code
Jun 06, 2021
Figure 1 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 2 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 3 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Figure 4 for Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Viaarxiv icon