Picture for Chenxu Dang

Chenxu Dang

SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving

Add code
Mar 09, 2026
Viaarxiv icon

VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Add code
Feb 24, 2026
Viaarxiv icon

DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving

Add code
Feb 16, 2026
Viaarxiv icon

From Representational Complementarity to Dual Systems: Synergizing VLM and Vision-Only Backbones for End-to-End Driving

Add code
Feb 11, 2026
Viaarxiv icon

SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning

Add code
Jan 10, 2026
Viaarxiv icon

SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction

Add code
Jul 22, 2025
Viaarxiv icon