Picture for Jingyang He

Jingyang He

Demo-JEPA: Joint-Embedding Predictive Architecture for One-shot Cross-Embodiment Imitation

Add code
May 20, 2026
Viaarxiv icon

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence

Add code
Dec 31, 2025
Viaarxiv icon

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation

Add code
Dec 18, 2024
Figure 1 for RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Figure 2 for RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Figure 3 for RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Figure 4 for RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Viaarxiv icon