Picture for Shaomian Zheng

Shaomian Zheng

On Representation Redundancy in Large-Scale Instruction Tuning Data Selection

Add code
Feb 14, 2026
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

Add code
Jun 18, 2025
Viaarxiv icon

Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models

Add code
Apr 09, 2025
Viaarxiv icon