Picture for Xuhang Zhu

Xuhang Zhu

Momentum for Reasoning: Dense Intrinsic Signals in Policy Optimization

Add code
Jun 07, 2026
Viaarxiv icon