Picture for Dongbin Zhao

Dongbin Zhao

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement

Add code
Oct 15, 2024
Viaarxiv icon

SELU: Self-Learning Embodied MLLMs in Unknown Environments

Add code
Oct 04, 2024
Viaarxiv icon

Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning

Add code
Aug 01, 2024
Viaarxiv icon

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

Add code
Jun 04, 2024
Viaarxiv icon

Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning

Add code
May 20, 2024
Viaarxiv icon

Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer

Add code
Mar 15, 2024
Viaarxiv icon

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game

Add code
Feb 01, 2024
Viaarxiv icon

RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks

Add code
Nov 27, 2023
Viaarxiv icon

Boosting Continuous Control with Consistency Policy

Add code
Oct 10, 2023
Viaarxiv icon

ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery

Add code
Sep 29, 2023
Viaarxiv icon