Picture for Fuwei Cui

Fuwei Cui

KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning

Add code
Mar 04, 2025
Figure 1 for An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Figure 2 for An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Figure 3 for An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Figure 4 for An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Viaarxiv icon