Picture for Dongbin Zhao

Dongbin Zhao

Learning from Mistakes: Post-Training for Driving VLA with Takeover Data

Add code
Mar 16, 2026
Viaarxiv icon

PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

InCoM: Intent-Driven Perception and Structured Coordination for Whole-Body Mobile Manipulation

Add code
Feb 26, 2026
Viaarxiv icon

WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

Add code
Feb 15, 2026
Viaarxiv icon

Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL

Add code
Feb 13, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

Towards Long-Lived Robots: Continual Learning VLA Models via Reinforcement Fine-Tuning

Add code
Feb 11, 2026
Viaarxiv icon

Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection

Add code
Jan 10, 2026
Viaarxiv icon

Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations

Add code
Dec 25, 2025
Figure 1 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 2 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 3 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 4 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Viaarxiv icon

TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data

Add code
Dec 22, 2025
Figure 1 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 2 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 3 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 4 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Viaarxiv icon