Picture for Ganqu Cui

Ganqu Cui

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Add code
May 28, 2026
Viaarxiv icon

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Add code
May 18, 2026
Viaarxiv icon

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Add code
May 13, 2026
Viaarxiv icon

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Add code
May 07, 2026
Viaarxiv icon

TEMPO: Scaling Test-time Training for Large Reasoning Models

Add code
Apr 21, 2026
Viaarxiv icon

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Add code
Mar 26, 2026
Viaarxiv icon

InCoder-32B: Code Foundation Model for Industrial Scenarios

Add code
Mar 17, 2026
Viaarxiv icon

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon