Picture for Shaoqing Xu

Shaoqing Xu

DriveFuture: Future-Aware Latent World Models for Autonomous Driving

Add code
May 10, 2026
Viaarxiv icon

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Add code
Apr 20, 2026
Viaarxiv icon

Think before Go: Hierarchical Reasoning for Image-goal Navigation

Add code
Apr 19, 2026
Viaarxiv icon

DVGT-2: Vision-Geometry-Action Model for Autonomous Driving at Scale

Add code
Apr 01, 2026
Viaarxiv icon

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving

Add code
Mar 02, 2026
Viaarxiv icon

Unleashing VLA Potentials in Autonomous Driving via Explicit Learning from Failures

Add code
Mar 01, 2026
Viaarxiv icon

VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness

Add code
Jan 19, 2026
Viaarxiv icon

DVGT: Driving Visual Geometry Transformer

Add code
Dec 18, 2025
Figure 1 for DVGT: Driving Visual Geometry Transformer
Figure 2 for DVGT: Driving Visual Geometry Transformer
Figure 3 for DVGT: Driving Visual Geometry Transformer
Figure 4 for DVGT: Driving Visual Geometry Transformer
Viaarxiv icon

DGFusion: Dual-guided Fusion for Robust Multi-Modal 3D Object Detection

Add code
Nov 13, 2025
Viaarxiv icon

FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction

Add code
Sep 04, 2025
Figure 1 for FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
Figure 2 for FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
Figure 3 for FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
Figure 4 for FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
Viaarxiv icon