Picture for Shilong Liu

Shilong Liu

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Add code
Jun 09, 2026
Viaarxiv icon

Any2Poster: Any-Source Poster Generation Across Modalities and Domains

Add code
Jun 01, 2026
Viaarxiv icon

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Add code
May 27, 2026
Viaarxiv icon

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Add code
May 14, 2026
Viaarxiv icon

Wan-R1: Verifiable-Reinforcement Learning for Video Reasoning

Add code
Mar 29, 2026
Viaarxiv icon

UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents

Add code
Feb 05, 2026
Viaarxiv icon

Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts

Add code
Feb 02, 2026
Viaarxiv icon

MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing

Add code
Jan 08, 2026
Viaarxiv icon

CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations

Add code
Dec 30, 2025
Viaarxiv icon

Web World Models

Add code
Dec 29, 2025
Viaarxiv icon