Picture for Lifeng Shang

Lifeng Shang

LoopMoE: Unifying Iterative Computation with Mixture-of-Experts for Language Modeling

Add code
Jun 03, 2026
Viaarxiv icon

What Makes Interaction Trajectories Effective for Training Terminal Agents?

Add code
Jun 02, 2026
Viaarxiv icon

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

Add code
May 27, 2026
Viaarxiv icon

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Add code
May 18, 2026
Viaarxiv icon

GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning

Add code
Apr 22, 2026
Viaarxiv icon

Switch Attention: Towards Dynamic and Fine-grained Hybrid Transformers

Add code
Mar 27, 2026
Viaarxiv icon

UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking

Add code
Mar 11, 2026
Viaarxiv icon

Gradually Excavating External Knowledge for Implicit Complex Question Answering

Add code
Mar 09, 2026
Viaarxiv icon

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Add code
Feb 03, 2026
Viaarxiv icon

InfMem: Learning System-2 Memory Control for Long-Context Agent

Add code
Feb 02, 2026
Viaarxiv icon