Picture for Shuo Lu

Shuo Lu

HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

Add code
Jun 12, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis

Add code
Jun 01, 2026
Viaarxiv icon

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Add code
Mar 20, 2026
Viaarxiv icon

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Add code
Feb 12, 2026
Viaarxiv icon

Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments

Add code
Feb 08, 2026
Viaarxiv icon

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Add code
Dec 14, 2025
Viaarxiv icon