Picture for Hardy Chen

Hardy Chen

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Add code
Apr 10, 2025
Viaarxiv icon

ViLBench: A Suite for Vision-Language Process Reward Modeling

Add code
Mar 26, 2025
Viaarxiv icon