Picture for Shu Liu

Shu Liu

ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay

Add code
May 22, 2025
Viaarxiv icon

Accelerated Markov Chain Monte Carlo Algorithms on Discrete States

Add code
May 19, 2025
Viaarxiv icon

VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning

Add code
May 17, 2025
Viaarxiv icon

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Add code
Apr 11, 2025
Viaarxiv icon

Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation

Add code
Mar 20, 2025
Viaarxiv icon

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Add code
Mar 16, 2025
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Adaptive Semantic Prompt Caching with VectorQ

Add code
Feb 06, 2025
Viaarxiv icon

Locality-aware Fair Scheduling in LLM Serving

Add code
Jan 24, 2025
Viaarxiv icon