Picture for Pieter Abbeel

Pieter Abbeel

UC Berkeley

Automatic Goal Generation for Reinforcement Learning Agents

Add code
Jul 23, 2018
Figure 1 for Automatic Goal Generation for Reinforcement Learning Agents
Figure 2 for Automatic Goal Generation for Reinforcement Learning Agents
Figure 3 for Automatic Goal Generation for Reinforcement Learning Agents
Figure 4 for Automatic Goal Generation for Reinforcement Learning Agents
Viaarxiv icon

Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation

Add code
Jun 18, 2018
Figure 1 for Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
Figure 2 for Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
Figure 3 for Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
Figure 4 for Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
Viaarxiv icon

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings

Add code
Jun 07, 2018
Figure 1 for Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
Figure 2 for Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
Figure 3 for Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
Figure 4 for Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
Viaarxiv icon

Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

Add code
May 17, 2018
Figure 1 for Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation
Figure 2 for Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation
Figure 3 for Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation
Figure 4 for Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation
Viaarxiv icon

Evolved Policy Gradients

Add code
Apr 29, 2018
Figure 1 for Evolved Policy Gradients
Figure 2 for Evolved Policy Gradients
Figure 3 for Evolved Policy Gradients
Figure 4 for Evolved Policy Gradients
Viaarxiv icon

The Limits and Potentials of Deep Learning for Robotics

Add code
Apr 18, 2018
Figure 1 for The Limits and Potentials of Deep Learning for Robotics
Figure 2 for The Limits and Potentials of Deep Learning for Robotics
Figure 3 for The Limits and Potentials of Deep Learning for Robotics
Figure 4 for The Limits and Potentials of Deep Learning for Robotics
Viaarxiv icon

Stochastic Adversarial Video Prediction

Add code
Apr 04, 2018
Figure 1 for Stochastic Adversarial Video Prediction
Figure 2 for Stochastic Adversarial Video Prediction
Figure 3 for Stochastic Adversarial Video Prediction
Figure 4 for Stochastic Adversarial Video Prediction
Viaarxiv icon

Universal Planning Networks

Add code
Apr 04, 2018
Figure 1 for Universal Planning Networks
Figure 2 for Universal Planning Networks
Figure 3 for Universal Planning Networks
Figure 4 for Universal Planning Networks
Viaarxiv icon

Domain Randomization and Generative Models for Robotic Grasping

Add code
Apr 03, 2018
Figure 1 for Domain Randomization and Generative Models for Robotic Grasping
Figure 2 for Domain Randomization and Generative Models for Robotic Grasping
Figure 3 for Domain Randomization and Generative Models for Robotic Grasping
Figure 4 for Domain Randomization and Generative Models for Robotic Grasping
Viaarxiv icon

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

Add code
Mar 20, 2018
Figure 1 for Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Figure 2 for Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Figure 3 for Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Figure 4 for Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
Viaarxiv icon