Picture for Yuhao Liao

Yuhao Liao

Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning

Add code
Mar 29, 2026
Viaarxiv icon