Picture for Yuepeng Sheng

Yuepeng Sheng

Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE

Add code
Dec 08, 2025
Viaarxiv icon

Compass-Thinker-7B Technical Report

Add code
Aug 12, 2025
Figure 1 for Compass-Thinker-7B Technical Report
Figure 2 for Compass-Thinker-7B Technical Report
Viaarxiv icon