Picture for Sergey Levine

Sergey Levine

Stanford University

Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery

Add code
Jul 18, 2019
Figure 1 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 2 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 3 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Figure 4 for Dynamical Distance Learning for Unsupervised and Semi-Supervised Skill Discovery
Viaarxiv icon

Dynamics-Aware Unsupervised Discovery of Skills

Add code
Jul 02, 2019
Figure 1 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 2 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 3 for Dynamics-Aware Unsupervised Discovery of Skills
Figure 4 for Dynamics-Aware Unsupervised Discovery of Skills
Viaarxiv icon

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Add code
Jul 01, 2019
Figure 1 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 2 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 3 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Figure 4 for Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Viaarxiv icon

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Add code
Jun 25, 2019
Figure 1 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 2 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 3 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Figure 4 for Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives
Viaarxiv icon

Off-Policy Evaluation via Off-Policy Classification

Add code
Jun 20, 2019
Figure 1 for Off-Policy Evaluation via Off-Policy Classification
Figure 2 for Off-Policy Evaluation via Off-Policy Classification
Figure 3 for Off-Policy Evaluation via Off-Policy Classification
Figure 4 for Off-Policy Evaluation via Off-Policy Classification
Viaarxiv icon

When to Trust Your Model: Model-Based Policy Optimization

Add code
Jun 19, 2019
Figure 1 for When to Trust Your Model: Model-Based Policy Optimization
Figure 2 for When to Trust Your Model: Model-Based Policy Optimization
Figure 3 for When to Trust Your Model: Model-Based Policy Optimization
Figure 4 for When to Trust Your Model: Model-Based Policy Optimization
Viaarxiv icon

SQIL: Imitation Learning via Regularized Behavioral Cloning

Add code
Jun 14, 2019
Figure 1 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 2 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 3 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Figure 4 for SQIL: Imitation Learning via Regularized Behavioral Cloning
Viaarxiv icon

Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards

Add code
Jun 13, 2019
Figure 1 for Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Figure 2 for Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Figure 3 for Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Figure 4 for Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards
Viaarxiv icon

Efficient Exploration via State Marginal Matching

Add code
Jun 12, 2019
Figure 1 for Efficient Exploration via State Marginal Matching
Figure 2 for Efficient Exploration via State Marginal Matching
Figure 3 for Efficient Exploration via State Marginal Matching
Figure 4 for Efficient Exploration via State Marginal Matching
Viaarxiv icon

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning

Add code
Jun 12, 2019
Figure 1 for Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Figure 2 for Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Figure 3 for Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Figure 4 for Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Viaarxiv icon