Alert button
Picture for Pieter Abbeel

Pieter Abbeel

Alert button

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 03, 2018
Bradly C. Stadie, Ge Yang, Rein Houthooft, Xi Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever

Figure 1 for Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Figure 2 for Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Figure 3 for Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Figure 4 for Some Considerations on Learning to Explore via Meta-Reinforcement Learning
Viaarxiv icon

Sim-to-Real Transfer of Robotic Control with Dynamics Randomization

Add code
Bookmark button
Alert button
Mar 03, 2018
Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel

Figure 1 for Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Figure 2 for Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Figure 3 for Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Figure 4 for Sim-to-Real Transfer of Robotic Control with Dynamics Randomization
Viaarxiv icon

Overcoming Exploration in Reinforcement Learning with Demonstrations

Add code
Bookmark button
Alert button
Feb 25, 2018
Ashvin Nair, Bob McGrew, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel

Figure 1 for Overcoming Exploration in Reinforcement Learning with Demonstrations
Figure 2 for Overcoming Exploration in Reinforcement Learning with Demonstrations
Figure 3 for Overcoming Exploration in Reinforcement Learning with Demonstrations
Figure 4 for Overcoming Exploration in Reinforcement Learning with Demonstrations
Viaarxiv icon

A Simple Neural Attentive Meta-Learner

Add code
Bookmark button
Alert button
Feb 25, 2018
Nikhil Mishra, Mostafa Rohaninejad, Xi Chen, Pieter Abbeel

Figure 1 for A Simple Neural Attentive Meta-Learner
Figure 2 for A Simple Neural Attentive Meta-Learner
Figure 3 for A Simple Neural Attentive Meta-Learner
Figure 4 for A Simple Neural Attentive Meta-Learner
Viaarxiv icon

Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Add code
Bookmark button
Alert button
Feb 23, 2018
Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel

Figure 1 for Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Figure 2 for Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Figure 3 for Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Figure 4 for Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
Viaarxiv icon

Hindsight Experience Replay

Add code
Bookmark button
Alert button
Feb 23, 2018
Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, Wojciech Zaremba

Figure 1 for Hindsight Experience Replay
Figure 2 for Hindsight Experience Replay
Figure 3 for Hindsight Experience Replay
Figure 4 for Hindsight Experience Replay
Viaarxiv icon

Meta-Reinforcement Learning of Structured Exploration Strategies

Add code
Bookmark button
Alert button
Feb 20, 2018
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine

Figure 1 for Meta-Reinforcement Learning of Structured Exploration Strategies
Figure 2 for Meta-Reinforcement Learning of Structured Exploration Strategies
Figure 3 for Meta-Reinforcement Learning of Structured Exploration Strategies
Figure 4 for Meta-Reinforcement Learning of Structured Exploration Strategies
Viaarxiv icon

Interpretable and Pedagogical Examples

Add code
Bookmark button
Alert button
Feb 14, 2018
Smitha Milli, Pieter Abbeel, Igor Mordatch

Figure 1 for Interpretable and Pedagogical Examples
Figure 2 for Interpretable and Pedagogical Examples
Figure 3 for Interpretable and Pedagogical Examples
Figure 4 for Interpretable and Pedagogical Examples
Viaarxiv icon

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning

Add code
Bookmark button
Alert button
Feb 05, 2018
Tianhe Yu, Chelsea Finn, Annie Xie, Sudeep Dasari, Tianhao Zhang, Pieter Abbeel, Sergey Levine

Figure 1 for One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Figure 2 for One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Figure 3 for One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Figure 4 for One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Viaarxiv icon

Parameter Space Noise for Exploration

Add code
Bookmark button
Alert button
Jan 31, 2018
Matthias Plappert, Rein Houthooft, Prafulla Dhariwal, Szymon Sidor, Richard Y. Chen, Xi Chen, Tamim Asfour, Pieter Abbeel, Marcin Andrychowicz

Figure 1 for Parameter Space Noise for Exploration
Figure 2 for Parameter Space Noise for Exploration
Figure 3 for Parameter Space Noise for Exploration
Figure 4 for Parameter Space Noise for Exploration
Viaarxiv icon