Picture for Guang Chen

Guang Chen

VEBench:Benchmarking Large Multimodal Models for Real-World Video Editing

Add code
May 05, 2026
Viaarxiv icon

MU-GeNeRF: Multi-view Uncertainty-guided Generalizable Neural Radiance Fields for Distractor-aware Scene

Add code
Apr 20, 2026
Viaarxiv icon

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments

Add code
Apr 20, 2026
Viaarxiv icon

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Add code
Apr 20, 2026
Viaarxiv icon

DeCoNav: Dialog enhanced Long-Horizon Collaborative Vision-Language Navigation

Add code
Apr 14, 2026
Viaarxiv icon

DriveVA: Video Action Models are Zero-Shot Drivers

Add code
Apr 05, 2026
Viaarxiv icon

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

Add code
Apr 02, 2026
Viaarxiv icon

Energy-Aware Imitation Learning for Steering Prediction Using Events and Frames

Add code
Mar 30, 2026
Viaarxiv icon

Toward Physically Consistent Driving Video World Models under Challenging Trajectories

Add code
Mar 25, 2026
Viaarxiv icon

PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon