Picture for Qingnan Ren

Qingnan Ren

Echo: Decoupling Inference and Training for Large-Scale RL Alignment on Heterogeneous Swarms

Add code
Aug 07, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Viaarxiv icon

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Add code
Feb 20, 2025
Viaarxiv icon