Picture for Shuxian Liang

Shuxian Liang

Self-Guided Process Reward Optimization with Redefined Step-wise Advantage for Process Reinforcement Learning

Add code
Jul 03, 2025
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon