Picture for Mike Zheng Shou

Mike Zheng Shou

Supervise What Survives: Geometry-Guided VLA Adaptation from Synthetic Robot Videos

Add code
Jun 23, 2026
Viaarxiv icon

ActionMap: Robot Policy Learning via Voxel Action Heatmap

Add code
Jun 05, 2026
Viaarxiv icon

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

Add code
Jun 04, 2026
Viaarxiv icon

Demo2Tutorial: From Human Experience to Multimodal Software Tutorials

Add code
Jun 02, 2026
Viaarxiv icon

PAI-Studio: Cinematic Video Background Replacement with Camera-Aware Motion

Add code
May 31, 2026
Viaarxiv icon

SWEET: Sparse World Modeling with Image Editing for Embodied Task Execution

Add code
May 19, 2026
Viaarxiv icon

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Add code
May 13, 2026
Viaarxiv icon

World Action Models: The Next Frontier in Embodied AI

Add code
May 12, 2026
Viaarxiv icon

OmniHumanoid: Streaming Cross-Embodiment Video Generation with Paired-Free Adaptation

Add code
May 12, 2026
Viaarxiv icon

Sparkle: Realizing Lively Instruction-Guided Video Background Replacement via Decoupled Guidance

Add code
May 07, 2026
Viaarxiv icon