Picture for Dongbin Zhao

Dongbin Zhao

WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

Add code
Feb 15, 2026
Viaarxiv icon

Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL

Add code
Feb 13, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

Towards Long-Lived Robots: Continual Learning VLA Models via Reinforcement Fine-Tuning

Add code
Feb 11, 2026
Viaarxiv icon

Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection

Add code
Jan 10, 2026
Viaarxiv icon

Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations

Add code
Dec 25, 2025
Figure 1 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 2 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 3 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Figure 4 for Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations
Viaarxiv icon

TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data

Add code
Dec 22, 2025
Figure 1 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 2 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 3 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Figure 4 for TakeAD: Preference-based Post-optimization for End-to-end Autonomous Driving with Expert Takeover Data
Viaarxiv icon

DiffuDepGrasp: Diffusion-based Depth Noise Modeling Empowers Sim2Real Robotic Grasping

Add code
Nov 17, 2025
Figure 1 for DiffuDepGrasp: Diffusion-based Depth Noise Modeling Empowers Sim2Real Robotic Grasping
Figure 2 for DiffuDepGrasp: Diffusion-based Depth Noise Modeling Empowers Sim2Real Robotic Grasping
Figure 3 for DiffuDepGrasp: Diffusion-based Depth Noise Modeling Empowers Sim2Real Robotic Grasping
Figure 4 for DiffuDepGrasp: Diffusion-based Depth Noise Modeling Empowers Sim2Real Robotic Grasping
Viaarxiv icon

CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic

Add code
Nov 15, 2025
Viaarxiv icon

ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games

Add code
Nov 11, 2025
Figure 1 for ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games
Figure 2 for ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games
Figure 3 for ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games
Figure 4 for ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games
Viaarxiv icon