Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Model-Ensemble Trust-Region Policy Optimization

Add code
Oct 05, 2018
Figure 1 for Model-Ensemble Trust-Region Policy Optimization
Figure 2 for Model-Ensemble Trust-Region Policy Optimization
Figure 3 for Model-Ensemble Trust-Region Policy Optimization
Figure 4 for Model-Ensemble Trust-Region Policy Optimization
Viaarxiv icon

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow

Add code
Oct 01, 2018
Figure 1 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 2 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 3 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 4 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Viaarxiv icon

Learning with Opponent-Learning Awareness

Add code
Sep 19, 2018
Figure 1 for Learning with Opponent-Learning Awareness
Figure 2 for Learning with Opponent-Learning Awareness
Figure 3 for Learning with Opponent-Learning Awareness
Figure 4 for Learning with Opponent-Learning Awareness
Viaarxiv icon

Model-Based Reinforcement Learning via Meta-Policy Optimization

Add code
Sep 14, 2018
Figure 1 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 2 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 3 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 4 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Viaarxiv icon

Latent Space Policies for Hierarchical Reinforcement Learning

Add code
Sep 03, 2018
Figure 1 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 2 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 3 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 4 for Latent Space Policies for Hierarchical Reinforcement Learning
Viaarxiv icon

Transfer Learning for Estimating Causal Effects using Neural Networks

Add code
Aug 23, 2018
Figure 1 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 2 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 3 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 4 for Transfer Learning for Estimating Causal Effects using Neural Networks
Viaarxiv icon

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Add code
Aug 08, 2018
Figure 1 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 2 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 3 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 4 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Viaarxiv icon

DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

Add code
Jul 27, 2018
Figure 1 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 2 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 3 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 4 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Viaarxiv icon

Variational Option Discovery Algorithms

Add code
Jul 26, 2018
Figure 1 for Variational Option Discovery Algorithms
Figure 2 for Variational Option Discovery Algorithms
Figure 3 for Variational Option Discovery Algorithms
Figure 4 for Variational Option Discovery Algorithms
Viaarxiv icon

Learning Generalized Reactive Policies using Deep Neural Networks

Add code
Jul 25, 2018
Figure 1 for Learning Generalized Reactive Policies using Deep Neural Networks
Figure 2 for Learning Generalized Reactive Policies using Deep Neural Networks
Figure 3 for Learning Generalized Reactive Policies using Deep Neural Networks
Figure 4 for Learning Generalized Reactive Policies using Deep Neural Networks
Viaarxiv icon