Picture for Haodi Lei

Haodi Lei

$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Add code
May 14, 2026
Viaarxiv icon

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Add code
May 13, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon