Picture for Lifeng Shang

Lifeng Shang

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

Add code
May 27, 2026
Viaarxiv icon

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Add code
May 18, 2026
Viaarxiv icon

GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning

Add code
Apr 22, 2026
Viaarxiv icon

Switch Attention: Towards Dynamic and Fine-grained Hybrid Transformers

Add code
Mar 27, 2026
Viaarxiv icon

UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking

Add code
Mar 11, 2026
Viaarxiv icon

Gradually Excavating External Knowledge for Implicit Complex Question Answering

Add code
Mar 09, 2026
Viaarxiv icon

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation

Add code
Feb 03, 2026
Viaarxiv icon

InfMem: Learning System-2 Memory Control for Long-Context Agent

Add code
Feb 02, 2026
Viaarxiv icon

OVD: On-policy Verbal Distillation

Add code
Jan 29, 2026
Viaarxiv icon

From Verifiable Dot to Reward Chain: Harnessing Verifiable Reference-based Rewards for Reinforcement Learning of Open-ended Generation

Add code
Jan 26, 2026
Viaarxiv icon