Picture for Yuchen Fan

Yuchen Fan

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Add code
May 07, 2026
Viaarxiv icon

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents

Add code
Apr 08, 2026
Viaarxiv icon

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Reading $ eq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models

Add code
Mar 09, 2026
Viaarxiv icon

Scale-PINN: Learning Efficient Physics-Informed Neural Networks Through Sequential Correction

Add code
Feb 23, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Figure 1 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 2 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 3 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 4 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Viaarxiv icon