Picture for Vitchyr H. Pong

Vitchyr H. Pong

Offline Meta-Reinforcement Learning with Online Self-Supervision

Add code
Jul 19, 2021
Figure 1 for Offline Meta-Reinforcement Learning with Online Self-Supervision
Figure 2 for Offline Meta-Reinforcement Learning with Online Self-Supervision
Figure 3 for Offline Meta-Reinforcement Learning with Online Self-Supervision
Figure 4 for Offline Meta-Reinforcement Learning with Online Self-Supervision
Viaarxiv icon

DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies

Add code
Apr 23, 2021
Figure 1 for DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Figure 2 for DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Figure 3 for DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Figure 4 for DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Viaarxiv icon

Outcome-Driven Reinforcement Learning via Variational Inference

Add code
Apr 20, 2021
Figure 1 for Outcome-Driven Reinforcement Learning via Variational Inference
Figure 2 for Outcome-Driven Reinforcement Learning via Variational Inference
Figure 3 for Outcome-Driven Reinforcement Learning via Variational Inference
Figure 4 for Outcome-Driven Reinforcement Learning via Variational Inference
Viaarxiv icon

Planning with Goal-Conditioned Policies

Add code
Nov 19, 2019
Figure 1 for Planning with Goal-Conditioned Policies
Figure 2 for Planning with Goal-Conditioned Policies
Figure 3 for Planning with Goal-Conditioned Policies
Figure 4 for Planning with Goal-Conditioned Policies
Viaarxiv icon

Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

Add code
Mar 08, 2019
Figure 1 for Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Figure 2 for Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Figure 3 for Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Figure 4 for Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Viaarxiv icon