Picture for Shuangshuang Tian

Shuangshuang Tian

Rethinking Expert Trajectory Utilization in LLM Post-training

Add code
Dec 12, 2025
Viaarxiv icon

GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning

Add code
Oct 23, 2025
Viaarxiv icon