Picture for Shuo Lu

Shuo Lu

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Add code
Mar 20, 2026
Viaarxiv icon

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation

Add code
Feb 12, 2026
Viaarxiv icon

Implicit Strategic Optimization: Rethinking Long-Horizon Decision-Making in Adversarial Poker Environments

Add code
Feb 08, 2026
Viaarxiv icon

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

One Size, Many Fits: Aligning Diverse Group-Wise Click Preferences in Large-Scale Advertising Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Add code
Dec 14, 2025
Viaarxiv icon

Beyond Boundaries: Leveraging Vision Foundation Models for Source-Free Object Detection

Add code
Nov 10, 2025
Viaarxiv icon

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation

Add code
Sep 26, 2025
Figure 1 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 2 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 3 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 4 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Viaarxiv icon

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Add code
Aug 26, 2025
Figure 1 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 2 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 3 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Figure 4 for GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging
Viaarxiv icon