Alert button
Picture for Pieter Abbeel

Pieter Abbeel

Alert button

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Add code
Bookmark button
Alert button
Jan 16, 2018
Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch

Figure 1 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 2 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 3 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Figure 4 for Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Viaarxiv icon

PixelSNAIL: An Improved Autoregressive Generative Model

Add code
Bookmark button
Alert button
Dec 28, 2017
Xi Chen, Nikhil Mishra, Mostafa Rohaninejad, Pieter Abbeel

Figure 1 for PixelSNAIL: An Improved Autoregressive Generative Model
Figure 2 for PixelSNAIL: An Improved Autoregressive Generative Model
Figure 3 for PixelSNAIL: An Improved Autoregressive Generative Model
Figure 4 for PixelSNAIL: An Improved Autoregressive Generative Model
Viaarxiv icon

A Berkeley View of Systems Challenges for AI

Add code
Bookmark button
Alert button
Dec 15, 2017
Ion Stoica, Dawn Song, Raluca Ada Popa, David Patterson, Michael W. Mahoney, Randy Katz, Anthony D. Joseph, Michael Jordan, Joseph M. Hellerstein, Joseph E. Gonzalez, Ken Goldberg, Ali Ghodsi, David Culler, Pieter Abbeel

Figure 1 for A Berkeley View of Systems Challenges for AI
Viaarxiv icon

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 05, 2017
Haoran Tang, Rein Houthooft, Davis Foote, Adam Stooke, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter Abbeel

Figure 1 for #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Figure 2 for #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Figure 3 for #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Figure 4 for #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Viaarxiv icon

One-Shot Imitation Learning

Add code
Bookmark button
Alert button
Dec 04, 2017
Yan Duan, Marcin Andrychowicz, Bradly C. Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba

Figure 1 for One-Shot Imitation Learning
Figure 2 for One-Shot Imitation Learning
Figure 3 for One-Shot Imitation Learning
Figure 4 for One-Shot Imitation Learning
Viaarxiv icon

Inverse Reward Design

Add code
Bookmark button
Alert button
Nov 08, 2017
Dylan Hadfield-Menell, Smitha Milli, Pieter Abbeel, Stuart Russell, Anca Dragan

Figure 1 for Inverse Reward Design
Figure 2 for Inverse Reward Design
Figure 3 for Inverse Reward Design
Figure 4 for Inverse Reward Design
Viaarxiv icon

UCB Exploration via Q-Ensembles

Add code
Bookmark button
Alert button
Nov 07, 2017
Richard Y. Chen, Szymon Sidor, Pieter Abbeel, John Schulman

Figure 1 for UCB Exploration via Q-Ensembles
Figure 2 for UCB Exploration via Q-Ensembles
Figure 3 for UCB Exploration via Q-Ensembles
Figure 4 for UCB Exploration via Q-Ensembles
Viaarxiv icon

Meta Learning Shared Hierarchies

Add code
Bookmark button
Alert button
Oct 26, 2017
Kevin Frans, Jonathan Ho, Xi Chen, Pieter Abbeel, John Schulman

Figure 1 for Meta Learning Shared Hierarchies
Figure 2 for Meta Learning Shared Hierarchies
Figure 3 for Meta Learning Shared Hierarchies
Figure 4 for Meta Learning Shared Hierarchies
Viaarxiv icon

Asymmetric Actor Critic for Image-Based Robot Learning

Add code
Bookmark button
Alert button
Oct 18, 2017
Lerrel Pinto, Marcin Andrychowicz, Peter Welinder, Wojciech Zaremba, Pieter Abbeel

Figure 1 for Asymmetric Actor Critic for Image-Based Robot Learning
Figure 2 for Asymmetric Actor Critic for Image-Based Robot Learning
Figure 3 for Asymmetric Actor Critic for Image-Based Robot Learning
Figure 4 for Asymmetric Actor Critic for Image-Based Robot Learning
Viaarxiv icon

Synkhronos: a Multi-GPU Theano Extension for Data Parallelism

Add code
Bookmark button
Alert button
Oct 11, 2017
Adam Stooke, Pieter Abbeel

Figure 1 for Synkhronos: a Multi-GPU Theano Extension for Data Parallelism
Figure 2 for Synkhronos: a Multi-GPU Theano Extension for Data Parallelism
Figure 3 for Synkhronos: a Multi-GPU Theano Extension for Data Parallelism
Figure 4 for Synkhronos: a Multi-GPU Theano Extension for Data Parallelism
Viaarxiv icon