Picture for Shuangshuang Tian

Shuangshuang Tian

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

Rethinking Expert Trajectory Utilization in LLM Post-training

Add code
Dec 12, 2025
Viaarxiv icon

GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

Add code
Oct 23, 2025
Viaarxiv icon