Picture for Xuyi Zhuang

Xuyi Zhuang

GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model

Add code
Dec 24, 2025
Viaarxiv icon

FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network

Add code
Mar 15, 2022
Figure 1 for FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network
Figure 2 for FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network
Figure 3 for FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network
Figure 4 for FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network
Viaarxiv icon

Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization

Add code
Jul 09, 2021
Figure 1 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 2 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 3 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Figure 4 for Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization
Viaarxiv icon

Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement

Add code
Jun 09, 2021
Figure 1 for Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement
Figure 2 for Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement
Figure 3 for Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement
Figure 4 for Deep Interaction between Masking and Mapping Targets for Single-Channel Speech Enhancement
Viaarxiv icon