Picture for Yuchen Zhang

Yuchen Zhang

Jack

Toward Efficient Influence Function: Dropout as a Compression Tool

Add code
Sep 19, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms

Add code
Aug 07, 2025
Viaarxiv icon

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Add code
Jun 16, 2025
Viaarxiv icon

Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis

Add code
Jun 13, 2025
Viaarxiv icon

UFM: A Simple Path towards Unified Dense Correspondence with Flow

Add code
Jun 10, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

The Coherence Trap: When MLLM-Crafted Narratives Exploit Manipulated Visual Contexts

Add code
May 23, 2025
Viaarxiv icon