Picture for Shenyuan Gao

Shenyuan Gao

Seeing Farther and Smarter: Value-Guided Multi-Path Reflection for VLM Policy Optimization

Add code
Feb 22, 2026
Viaarxiv icon

World Action Models are Zero-shot Policies

Add code
Feb 17, 2026
Viaarxiv icon

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

Add code
Feb 06, 2026
Viaarxiv icon

StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation

Add code
Oct 06, 2025
Viaarxiv icon

ReSim: Reliable World Simulation for Autonomous Driving

Add code
Jun 11, 2025
Viaarxiv icon

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Add code
May 09, 2025
Viaarxiv icon

AdaWorld: Learning Adaptable World Models with Latent Actions

Add code
Mar 24, 2025
Figure 1 for AdaWorld: Learning Adaptable World Models with Latent Actions
Figure 2 for AdaWorld: Learning Adaptable World Models with Latent Actions
Figure 3 for AdaWorld: Learning Adaptable World Models with Latent Actions
Figure 4 for AdaWorld: Learning Adaptable World Models with Latent Actions
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability

Add code
May 27, 2024
Figure 1 for Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Figure 2 for Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Figure 3 for Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Figure 4 for Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Viaarxiv icon

Content-aware Masked Image Modeling Transformer for Stereo Image Compression

Add code
Mar 20, 2024
Viaarxiv icon