Picture for Fan Zhuo

Fan Zhuo

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Add code
Apr 16, 2026
Viaarxiv icon

ImVideoEdit: Image-learning Video Editing via 2D Spatial Difference Attention Blocks

Add code
Apr 09, 2026
Viaarxiv icon

Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

Add code
Mar 16, 2026
Viaarxiv icon

Self-Reinforcing Prototype Evolution with Dual-Knowledge Cooperation for Semi-Supervised Lifelong Person Re-Identification

Add code
Jul 02, 2025
Viaarxiv icon

An End-to-End Approach for Chord-Conditioned Song Generation

Add code
Sep 10, 2024
Figure 1 for An End-to-End Approach for Chord-Conditioned Song Generation
Figure 2 for An End-to-End Approach for Chord-Conditioned Song Generation
Figure 3 for An End-to-End Approach for Chord-Conditioned Song Generation
Viaarxiv icon