Picture for Yixuan Even Xu

Yixuan Even Xu

Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

Add code
Apr 18, 2025
Viaarxiv icon