Picture for Changyan Zheng

Changyan Zheng

DBMIF: a deep balanced multimodal iterative fusion framework for air- and bone-conduction speech enhancement

Add code
Mar 03, 2026
Viaarxiv icon

Purification Before Fusion: Toward Mask-Free Speech Enhancement for Robust Audio-Visual Speech Recognition

Add code
Jan 18, 2026
Viaarxiv icon

Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization

Add code
Mar 24, 2024
Figure 1 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 2 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 3 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 4 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Viaarxiv icon