Picture for Fuwei Cui

Fuwei Cui

KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning

Add code
Mar 04, 2025
Viaarxiv icon