Picture for Tongshan Xu

Tongshan Xu

Right Is Not Enough: The Pitfalls of Outcome Supervision in Training LLMs for Math Reasoning

Add code
Jun 07, 2025
Viaarxiv icon