Picture for Haozhi Xie

Haozhi Xie

Stable Adaptive Thinking via Advantage Shaping and Length-Aware Gradient Regulation

Add code
Feb 26, 2026
Viaarxiv icon