Picture for Xiaodong Ai

Xiaodong Ai

SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning

Add code
Apr 08, 2026
Viaarxiv icon