Picture for Ruochen Zhou

Ruochen Zhou

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Add code
May 21, 2025
Viaarxiv icon

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Add code
Mar 04, 2025
Figure 1 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 2 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 3 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 4 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Viaarxiv icon

Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models

Add code
May 22, 2024
Viaarxiv icon