Picture for Chengen Xie

Chengen Xie

LatentVLA: Efficient Vision-Language Models for Autonomous Driving via Latent Action Prediction

Add code
Jan 09, 2026
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

DriveLM: Driving with Graph Visual Question Answering

Add code
Dec 21, 2023
Figure 1 for DriveLM: Driving with Graph Visual Question Answering
Figure 2 for DriveLM: Driving with Graph Visual Question Answering
Figure 3 for DriveLM: Driving with Graph Visual Question Answering
Figure 4 for DriveLM: Driving with Graph Visual Question Answering
Viaarxiv icon