Picture for Yinfeng Yu

Yinfeng Yu

ML-SAN: Multi-Level Speaker-Adaptive Network for Emotion Recognition in Conversations

Add code
Apr 28, 2026
Viaarxiv icon

EAD-Net: Emotion-Aware Talking Head Generation with Spatial Refinement and Temporal Coherence

Add code
Apr 25, 2026
Viaarxiv icon

Semantic-Emotional Resonance Embedding: A Semi-Supervised Paradigm for Cross-Lingual Speech Emotion Recognition

Add code
Apr 08, 2026
Viaarxiv icon

Generalizable Audio-Visual Navigation via Binaural Difference Attention and Action Transition Prediction

Add code
Apr 06, 2026
Viaarxiv icon

Spatial-Aware Conditioned Fusion for Audio-Visual Navigation

Add code
Apr 02, 2026
Viaarxiv icon

Reliability-Aware Geometric Fusion for Robust Audio-Visual Navigation

Add code
Apr 02, 2026
Viaarxiv icon

Audio Spatially-Guided Fusion for Audio-Visual Navigation

Add code
Apr 02, 2026
Viaarxiv icon

Beyond Textual Knowledge-Leveraging Multimodal Knowledge Bases for Enhancing Vision-and-Language Navigation

Add code
Mar 27, 2026
Viaarxiv icon

Residual Cross-Modal Fusion Networks for Audio-Visual Navigation

Add code
Jan 11, 2026
Viaarxiv icon

DGFNet: End-to-End Audio-Visual Source Separation Based on Dynamic Gating Fusion

Add code
Apr 30, 2025
Viaarxiv icon