Picture for Zichong Gu

Zichong Gu

HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving

Add code
Feb 11, 2026
Viaarxiv icon