Picture for Qian Qiu

Qian Qiu

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Add code
Jun 09, 2026
Viaarxiv icon