Picture for Zongchuang Zhao

Zongchuang Zhao

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Add code
Dec 16, 2025
Figure 1 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 2 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 3 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 4 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Viaarxiv icon

NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding

Add code
Oct 31, 2025
Viaarxiv icon

Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving

Add code
May 13, 2025
Figure 1 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 2 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 3 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 4 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Viaarxiv icon

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Add code
Mar 25, 2025
Viaarxiv icon