Picture for Bolei Zhou

Bolei Zhou

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

Add code
Jun 16, 2025
Viaarxiv icon

Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation

Add code
Jun 11, 2025
Viaarxiv icon

Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism

Add code
Jun 10, 2025
Viaarxiv icon

Dreamland: Controllable World Creation with Simulator and Generative Models

Add code
Jun 09, 2025
Viaarxiv icon

Towards Autonomous Micromobility through Scalable Urban Simulation

Add code
May 01, 2025
Viaarxiv icon

X-Fusion: Introducing New Modality to Frozen Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon

Data-Efficient Learning from Human Interventions for Mobile Robots

Add code
Mar 06, 2025
Viaarxiv icon

Learning from Active Human Involvement through Proxy Value Propagation

Add code
Feb 05, 2025
Viaarxiv icon

Embodied Scene Understanding for Vision Language Models via MetaVQA

Add code
Jan 15, 2025
Figure 1 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 2 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 3 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 4 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Viaarxiv icon

Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation

Add code
Jan 14, 2025
Viaarxiv icon