Picture for Chongjie Zhang

Chongjie Zhang

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Add code
Feb 14, 2022
Figure 1 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 2 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 3 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Figure 4 for Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Viaarxiv icon

MOORe: Model-based Offline-to-Online Reinforcement Learning

Add code
Jan 25, 2022
Figure 1 for MOORe: Model-based Offline-to-Online Reinforcement Learning
Figure 2 for MOORe: Model-based Offline-to-Online Reinforcement Learning
Figure 3 for MOORe: Model-based Offline-to-Online Reinforcement Learning
Figure 4 for MOORe: Model-based Offline-to-Online Reinforcement Learning
Viaarxiv icon

Self-Organized Polynomial-Time Coordination Graphs

Add code
Dec 07, 2021
Figure 1 for Self-Organized Polynomial-Time Coordination Graphs
Figure 2 for Self-Organized Polynomial-Time Coordination Graphs
Figure 3 for Self-Organized Polynomial-Time Coordination Graphs
Figure 4 for Self-Organized Polynomial-Time Coordination Graphs
Viaarxiv icon

Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

Add code
Nov 22, 2021
Figure 1 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 2 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 3 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Figure 4 for Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Viaarxiv icon

Offline Reinforcement Learning with Value-based Episodic Memory

Add code
Oct 19, 2021
Figure 1 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 2 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 3 for Offline Reinforcement Learning with Value-based Episodic Memory
Figure 4 for Offline Reinforcement Learning with Value-based Episodic Memory
Viaarxiv icon

Containerized Distributed Value-Based Multi-Agent Reinforcement Learning

Add code
Oct 15, 2021
Figure 1 for Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Figure 2 for Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Figure 3 for Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Figure 4 for Containerized Distributed Value-Based Multi-Agent Reinforcement Learning
Viaarxiv icon

LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates

Add code
Oct 15, 2021
Figure 1 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 2 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 3 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Figure 4 for LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates
Viaarxiv icon

Offline Reinforcement Learning with Reverse Model-based Imagination

Add code
Oct 01, 2021
Figure 1 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 2 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 3 for Offline Reinforcement Learning with Reverse Model-based Imagination
Figure 4 for Offline Reinforcement Learning with Reverse Model-based Imagination
Viaarxiv icon

On the Estimation Bias in Double Q-Learning

Add code
Sep 29, 2021
Figure 1 for On the Estimation Bias in Double Q-Learning
Figure 2 for On the Estimation Bias in Double Q-Learning
Figure 3 for On the Estimation Bias in Double Q-Learning
Figure 4 for On the Estimation Bias in Double Q-Learning
Viaarxiv icon

Context-Aware Sparse Deep Coordination Graphs

Add code
Jun 05, 2021
Figure 1 for Context-Aware Sparse Deep Coordination Graphs
Figure 2 for Context-Aware Sparse Deep Coordination Graphs
Figure 3 for Context-Aware Sparse Deep Coordination Graphs
Figure 4 for Context-Aware Sparse Deep Coordination Graphs
Viaarxiv icon