Picture for Dengdong Fan

Dengdong Fan

PowerStep: Memory-Efficient Adaptive Optimization via $\ell_p$-Norm Steepest Descent

Add code
May 11, 2026
Viaarxiv icon

PCL-Reasoner-V1.5: Advancing Math Reasoning with Offline Reinforcement Learning

Add code
Jan 21, 2026
Viaarxiv icon