Picture for Gufeng Zhang

Gufeng Zhang

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Add code
Oct 29, 2025
Viaarxiv icon