Picture for Allan Zhang

Allan Zhang

Beyond What Seems Necessary: Hidden Gains from Scaling Training-Time Reasoning Length under Outcome Supervision

Add code
Jan 31, 2026
Viaarxiv icon