Picture for Xinyi Mao

Xinyi Mao

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Add code
Mar 05, 2026
Viaarxiv icon

Vidarc: Embodied Video Diffusion Model for Closed-loop Control

Add code
Dec 19, 2025
Figure 1 for Vidarc: Embodied Video Diffusion Model for Closed-loop Control
Figure 2 for Vidarc: Embodied Video Diffusion Model for Closed-loop Control
Figure 3 for Vidarc: Embodied Video Diffusion Model for Closed-loop Control
Figure 4 for Vidarc: Embodied Video Diffusion Model for Closed-loop Control
Viaarxiv icon

ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation

Add code
Nov 04, 2024
Figure 1 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 2 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 3 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 4 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Viaarxiv icon