Picture for Chongjie Zhang

Chongjie Zhang

GOMAA-Geo: GOal Modality Agnostic Active Geo-localization

Add code
Jun 04, 2024
Viaarxiv icon

Bayesian Design Principles for Offline-to-Online Reinforcement Learning

Add code
May 31, 2024
Viaarxiv icon

Efficient Multi-agent Reinforcement Learning by Planning

Add code
May 20, 2024
Figure 1 for Efficient Multi-agent Reinforcement Learning by Planning
Figure 2 for Efficient Multi-agent Reinforcement Learning by Planning
Figure 3 for Efficient Multi-agent Reinforcement Learning by Planning
Figure 4 for Efficient Multi-agent Reinforcement Learning by Planning
Viaarxiv icon

Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning

Add code
Feb 14, 2024
Figure 1 for Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Figure 2 for Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Figure 3 for Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Figure 4 for Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Viaarxiv icon

Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design

Add code
Nov 02, 2023
Viaarxiv icon

Unsupervised Behavior Extraction via Random Intent Priors

Add code
Oct 28, 2023
Figure 1 for Unsupervised Behavior Extraction via Random Intent Priors
Figure 2 for Unsupervised Behavior Extraction via Random Intent Priors
Figure 3 for Unsupervised Behavior Extraction via Random Intent Priors
Figure 4 for Unsupervised Behavior Extraction via Random Intent Priors
Viaarxiv icon

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Add code
Oct 19, 2023
Figure 1 for Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Figure 2 for Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Figure 3 for Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Figure 4 for Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Viaarxiv icon

Imitation Learning from Observation with Automatic Discount Scheduling

Add code
Oct 12, 2023
Figure 1 for Imitation Learning from Observation with Automatic Discount Scheduling
Figure 2 for Imitation Learning from Observation with Automatic Discount Scheduling
Figure 3 for Imitation Learning from Observation with Automatic Discount Scheduling
Figure 4 for Imitation Learning from Observation with Automatic Discount Scheduling
Viaarxiv icon

Never Explore Repeatedly in Multi-Agent Reinforcement Learning

Add code
Aug 19, 2023
Figure 1 for Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Figure 2 for Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Figure 3 for Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Figure 4 for Never Explore Repeatedly in Multi-Agent Reinforcement Learning
Viaarxiv icon

IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse

Add code
Aug 14, 2023
Figure 1 for IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Figure 2 for IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Figure 3 for IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Figure 4 for IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse
Viaarxiv icon