Picture for Hongzhi Luan

Hongzhi Luan

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Add code
Jun 18, 2025
Viaarxiv icon