Alert button
Picture for Pieter Abbeel

Pieter Abbeel

Alert button

Model-Ensemble Trust-Region Policy Optimization

Add code
Bookmark button
Alert button
Oct 05, 2018
Thanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel

Figure 1 for Model-Ensemble Trust-Region Policy Optimization
Figure 2 for Model-Ensemble Trust-Region Policy Optimization
Figure 3 for Model-Ensemble Trust-Region Policy Optimization
Figure 4 for Model-Ensemble Trust-Region Policy Optimization
Viaarxiv icon

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow

Add code
Bookmark button
Alert button
Oct 01, 2018
Xue Bin Peng, Angjoo Kanazawa, Sam Toyer, Pieter Abbeel, Sergey Levine

Figure 1 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 2 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 3 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Figure 4 for Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow
Viaarxiv icon

Learning with Opponent-Learning Awareness

Add code
Bookmark button
Alert button
Sep 19, 2018
Jakob N. Foerster, Richard Y. Chen, Maruan Al-Shedivat, Shimon Whiteson, Pieter Abbeel, Igor Mordatch

Figure 1 for Learning with Opponent-Learning Awareness
Figure 2 for Learning with Opponent-Learning Awareness
Figure 3 for Learning with Opponent-Learning Awareness
Figure 4 for Learning with Opponent-Learning Awareness
Viaarxiv icon

Model-Based Reinforcement Learning via Meta-Policy Optimization

Add code
Bookmark button
Alert button
Sep 14, 2018
Ignasi Clavera, Jonas Rothfuss, John Schulman, Yasuhiro Fujita, Tamim Asfour, Pieter Abbeel

Figure 1 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 2 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 3 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Figure 4 for Model-Based Reinforcement Learning via Meta-Policy Optimization
Viaarxiv icon

Latent Space Policies for Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 03, 2018
Tuomas Haarnoja, Kristian Hartikainen, Pieter Abbeel, Sergey Levine

Figure 1 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 2 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 3 for Latent Space Policies for Hierarchical Reinforcement Learning
Figure 4 for Latent Space Policies for Hierarchical Reinforcement Learning
Viaarxiv icon

SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 28, 2018
Marvin Zhang, Sharad Vikram, Laura Smith, Pieter Abbeel, Matthew J. Johnson, Sergey Levine

Figure 1 for SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning
Figure 2 for SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning
Figure 3 for SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning
Figure 4 for SOLAR: Deep Structured Latent Representations for Model-Based Reinforcement Learning
Viaarxiv icon

Transfer Learning for Estimating Causal Effects using Neural Networks

Add code
Bookmark button
Alert button
Aug 23, 2018
Sören R. Künzel, Bradly C. Stadie, Nikita Vemuri, Varsha Ramakrishnan, Jasjeet S. Sekhon, Pieter Abbeel

Figure 1 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 2 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 3 for Transfer Learning for Estimating Causal Effects using Neural Networks
Figure 4 for Transfer Learning for Estimating Causal Effects using Neural Networks
Viaarxiv icon

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Add code
Bookmark button
Alert button
Aug 08, 2018
Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine

Figure 1 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 2 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 3 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Figure 4 for Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Viaarxiv icon

DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills

Add code
Bookmark button
Alert button
Jul 27, 2018
Xue Bin Peng, Pieter Abbeel, Sergey Levine, Michiel van de Panne

Figure 1 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 2 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 3 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Figure 4 for DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Viaarxiv icon

Variational Option Discovery Algorithms

Add code
Bookmark button
Alert button
Jul 26, 2018
Joshua Achiam, Harrison Edwards, Dario Amodei, Pieter Abbeel

Figure 1 for Variational Option Discovery Algorithms
Figure 2 for Variational Option Discovery Algorithms
Figure 3 for Variational Option Discovery Algorithms
Figure 4 for Variational Option Discovery Algorithms
Viaarxiv icon