Picture for Jianjie Cheng

Jianjie Cheng

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Add code
Feb 12, 2026
Viaarxiv icon

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Add code
Dec 14, 2025
Viaarxiv icon