Picture for Hongyi James Cai

Hongyi James Cai

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Add code
May 30, 2025
Viaarxiv icon