Picture for Mike Zheng Shou

Mike Zheng Shou

SWEET: Sparse World Modeling with Image Editing for Embodied Task Execution

Add code
May 19, 2026
Viaarxiv icon

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Add code
May 13, 2026
Viaarxiv icon

World Action Models: The Next Frontier in Embodied AI

Add code
May 12, 2026
Viaarxiv icon

OmniHumanoid: Streaming Cross-Embodiment Video Generation with Paired-Free Adaptation

Add code
May 12, 2026
Viaarxiv icon

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance

Add code
May 07, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Add code
Apr 08, 2026
Viaarxiv icon

UENR-600K: A Large-Scale Physically Grounded Dataset for Nighttime Video Deraining

Add code
Apr 06, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon

P-Flow: Prompting Visual Effects Generation

Add code
Mar 23, 2026
Viaarxiv icon