Picture for Huichuan Fan

Huichuan Fan

Hidden States Know Where Reasoning Diverges: Credit Assignment via Span-Level Wasserstein Distance

Add code
Apr 25, 2026
Viaarxiv icon

From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon