Picture for Qingzhi Chen

Qingzhi Chen

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Add code
Feb 01, 2026
Viaarxiv icon