Picture for Dongbin Zhao

Dongbin Zhao

Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

Posterior Optimization with Clipped Objective for Bridging Efficiency and Stability in Generative Policy Learning

Add code
Apr 02, 2026
Viaarxiv icon

Dynamic Dual-Granularity Skill Bank for Agentic RL

Add code
Mar 30, 2026
Viaarxiv icon

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Add code
Mar 26, 2026
Viaarxiv icon

Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving

Add code
Mar 25, 2026
Viaarxiv icon

DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving

Add code
Mar 25, 2026
Viaarxiv icon

Learning from Mistakes: Post-Training for Driving VLA with Takeover Data

Add code
Mar 16, 2026
Viaarxiv icon

PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

InCoM: Intent-Driven Perception and Structured Coordination for Whole-Body Mobile Manipulation

Add code
Feb 26, 2026
Viaarxiv icon

WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

Add code
Feb 15, 2026
Viaarxiv icon