Alert button
Picture for Pieter Abbeel

Pieter Abbeel

Alert button

An Algorithmic Perspective on Imitation Learning

Nov 16, 2018
Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

Figure 1 for An Algorithmic Perspective on Imitation Learning
Figure 2 for An Algorithmic Perspective on Imitation Learning
Figure 3 for An Algorithmic Perspective on Imitation Learning
Figure 4 for An Algorithmic Perspective on Imitation Learning
Viaarxiv icon

Modular Architecture for StarCraft II with Deep Reinforcement Learning

Nov 08, 2018
Dennis Lee, Haoran Tang, Jeffrey O Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel

Figure 1 for Modular Architecture for StarCraft II with Deep Reinforcement Learning
Figure 2 for Modular Architecture for StarCraft II with Deep Reinforcement Learning
Figure 3 for Modular Architecture for StarCraft II with Deep Reinforcement Learning
Figure 4 for Modular Architecture for StarCraft II with Deep Reinforcement Learning
Viaarxiv icon

One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks

Oct 25, 2018
Tianhe Yu, Pieter Abbeel, Sergey Levine, Chelsea Finn

Figure 1 for One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Figure 2 for One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Figure 3 for One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Figure 4 for One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks
Viaarxiv icon

High-Dimensional Continuous Control Using Generalized Advantage Estimation

Oct 20, 2018
John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, Pieter Abbeel

Figure 1 for High-Dimensional Continuous Control Using Generalized Advantage Estimation
Figure 2 for High-Dimensional Continuous Control Using Generalized Advantage Estimation
Figure 3 for High-Dimensional Continuous Control Using Generalized Advantage Estimation
Figure 4 for High-Dimensional Continuous Control Using Generalized Advantage Estimation
Viaarxiv icon

Enabling Robots to Communicate their Objectives

Oct 18, 2018
Sandy H. Huang, David Held, Pieter Abbeel, Anca D. Dragan

Figure 1 for Enabling Robots to Communicate their Objectives
Figure 2 for Enabling Robots to Communicate their Objectives
Figure 3 for Enabling Robots to Communicate their Objectives
Figure 4 for Enabling Robots to Communicate their Objectives
Viaarxiv icon

Establishing Appropriate Trust via Critical States

Oct 18, 2018
Sandy H. Huang, Kush Bhatia, Pieter Abbeel, Anca D. Dragan

Figure 1 for Establishing Appropriate Trust via Critical States
Figure 2 for Establishing Appropriate Trust via Critical States
Figure 3 for Establishing Appropriate Trust via Critical States
Figure 4 for Establishing Appropriate Trust via Critical States
Viaarxiv icon

ProMP: Proximal Meta-Policy Search

Oct 17, 2018
Jonas Rothfuss, Dennis Lee, Ignasi Clavera, Tamim Asfour, Pieter Abbeel

Figure 1 for ProMP: Proximal Meta-Policy Search
Figure 2 for ProMP: Proximal Meta-Policy Search
Figure 3 for ProMP: Proximal Meta-Policy Search
Viaarxiv icon

Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation

Oct 16, 2018
Gregory Kahn, Adam Villaflor, Pieter Abbeel, Sergey Levine

Figure 1 for Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
Figure 2 for Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
Figure 3 for Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
Figure 4 for Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation
Viaarxiv icon

SFV: Reinforcement Learning of Physical Skills from Videos

Oct 15, 2018
Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine

Figure 1 for SFV: Reinforcement Learning of Physical Skills from Videos
Figure 2 for SFV: Reinforcement Learning of Physical Skills from Videos
Figure 3 for SFV: Reinforcement Learning of Physical Skills from Videos
Figure 4 for SFV: Reinforcement Learning of Physical Skills from Videos
Viaarxiv icon

Equivalence Between Policy Gradients and Soft Q-Learning

Oct 14, 2018
John Schulman, Xi Chen, Pieter Abbeel

Figure 1 for Equivalence Between Policy Gradients and Soft Q-Learning
Figure 2 for Equivalence Between Policy Gradients and Soft Q-Learning
Figure 3 for Equivalence Between Policy Gradients and Soft Q-Learning
Viaarxiv icon