Picture for Sergey Levine

Sergey Levine

Stanford University

Why Does Hierarchy Work So Well in Reinforcement Learning?

Add code
Sep 23, 2019
Figure 1 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 2 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 3 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Figure 4 for Why Does Hierarchy  Work So Well in Reinforcement Learning?
Viaarxiv icon

Scaled Autonomy: Enabling Human Operators to Control Robot Fleets

Add code
Sep 22, 2019
Figure 1 for Scaled Autonomy: Enabling Human Operators to Control Robot Fleets
Figure 2 for Scaled Autonomy: Enabling Human Operators to Control Robot Fleets
Figure 3 for Scaled Autonomy: Enabling Human Operators to Control Robot Fleets
Figure 4 for Scaled Autonomy: Enabling Human Operators to Control Robot Fleets
Viaarxiv icon

Meta-Learning with Implicit Gradients

Add code
Sep 10, 2019
Figure 1 for Meta-Learning with Implicit Gradients
Figure 2 for Meta-Learning with Implicit Gradients
Figure 3 for Meta-Learning with Implicit Gradients
Figure 4 for Meta-Learning with Implicit Gradients
Viaarxiv icon

Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery

Add code
Jul 18, 2019
Figure 1 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 2 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 3 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 4 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Viaarxiv icon

Dynamics-Aware Unsupervised Discovery of Skills

Add code
Jul 02, 2019
Figure 1 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 2 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 3 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 4 for Dynamics-Aware Unsupervised Discovery of Skills
Viaarxiv icon

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Add code
Jul 01, 2019
Figure 1 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 2 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 3 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 4 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Viaarxiv icon

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Add code
Jun 25, 2019
Figure 1 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 2 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 3 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 4 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Viaarxiv icon

Off-Policy Evaluation via Off-Policy Classification

Add code
Jun 20, 2019
Figure 1 for Off-Policy Evaluation via Off-Policy Classification
Figure 2 for Off-Policy Evaluation via Off-Policy Classification
Figure 3 for Off-Policy Evaluation via Off-Policy Classification
Figure 4 for Off-Policy Evaluation via Off-Policy Classification
Viaarxiv icon

When to Trust Your Model: Model-Based Policy Optimization

Add code
Jun 19, 2019
Figure 1 for When to Trust Your Model: Model-Based Policy Optimization
Figure 2 for When to Trust Your Model: Model-Based Policy Optimization
Figure 3 for When to Trust Your Model: Model-Based Policy Optimization
Figure 4 for When to Trust Your Model: Model-Based Policy Optimization
Viaarxiv icon

SQIL: Imitation Learning via Regularized Behavioral Cloning

Add code
Jun 14, 2019
Figure 1 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 2 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 3 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 4 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Viaarxiv icon