Picture for Modi Shi

Modi Shi

EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration

Add code
Feb 10, 2026
Viaarxiv icon

$χ_{0}$: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies

Add code
Feb 09, 2026
Viaarxiv icon

WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control

Add code
Dec 15, 2025
Viaarxiv icon

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Add code
Aug 28, 2025
Viaarxiv icon

Is Diversity All You Need for Scalable Robotic Manipulation?

Add code
Jul 08, 2025
Viaarxiv icon

Hume: Introducing System-2 Thinking in Visual-Language-Action Model

Add code
May 29, 2025
Figure 1 for Hume: Introducing System-2 Thinking in Visual-Language-Action Model
Figure 2 for Hume: Introducing System-2 Thinking in Visual-Language-Action Model
Figure 3 for Hume: Introducing System-2 Thinking in Visual-Language-Action Model
Figure 4 for Hume: Introducing System-2 Thinking in Visual-Language-Action Model
Viaarxiv icon

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

Add code
May 24, 2025
Figure 1 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 2 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 3 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Figure 4 for Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

Add code
Apr 02, 2024
Figure 1 for Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
Figure 2 for Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
Figure 3 for Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
Figure 4 for Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge
Viaarxiv icon