Picture for Shane Legg

Shane Legg

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Add code
Jun 28, 2018
Figure 1 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 2 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 3 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 4 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Viaarxiv icon

Measuring and avoiding side effects using relative reachability

Add code
Jun 04, 2018
Figure 1 for Measuring and avoiding side effects using relative reachability
Figure 2 for Measuring and avoiding side effects using relative reachability
Figure 3 for Measuring and avoiding side effects using relative reachability
Figure 4 for Measuring and avoiding side effects using relative reachability
Viaarxiv icon

Agents and Devices: A Relative Definition of Agency

Add code
May 31, 2018
Figure 1 for Agents and Devices: A Relative Definition of Agency
Figure 2 for Agents and Devices: A Relative Definition of Agency
Figure 3 for Agents and Devices: A Relative Definition of Agency
Figure 4 for Agents and Devices: A Relative Definition of Agency
Viaarxiv icon

Noisy Networks for Exploration

Add code
Feb 15, 2018
Figure 1 for Noisy Networks for Exploration
Figure 2 for Noisy Networks for Exploration
Figure 3 for Noisy Networks for Exploration
Figure 4 for Noisy Networks for Exploration
Viaarxiv icon

Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

Add code
Feb 04, 2018
Figure 1 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 2 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 3 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 4 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Viaarxiv icon

AI Safety Gridworlds

Add code
Nov 28, 2017
Figure 1 for AI Safety Gridworlds
Figure 2 for AI Safety Gridworlds
Figure 3 for AI Safety Gridworlds
Figure 4 for AI Safety Gridworlds
Viaarxiv icon

Reinforcement Learning with a Corrupted Reward Channel

Add code
Aug 19, 2017
Figure 1 for Reinforcement Learning with a Corrupted Reward Channel
Figure 2 for Reinforcement Learning with a Corrupted Reward Channel
Figure 3 for Reinforcement Learning with a Corrupted Reward Channel
Figure 4 for Reinforcement Learning with a Corrupted Reward Channel
Viaarxiv icon

Deep reinforcement learning from human preferences

Add code
Jul 13, 2017
Figure 1 for Deep reinforcement learning from human preferences
Figure 2 for Deep reinforcement learning from human preferences
Figure 3 for Deep reinforcement learning from human preferences
Figure 4 for Deep reinforcement learning from human preferences
Viaarxiv icon

DeepMind Lab

Add code
Dec 13, 2016
Figure 1 for DeepMind Lab
Figure 2 for DeepMind Lab
Figure 3 for DeepMind Lab
Figure 4 for DeepMind Lab
Viaarxiv icon

Massively Parallel Methods for Deep Reinforcement Learning

Add code
Jul 16, 2015
Figure 1 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 2 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 3 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 4 for Massively Parallel Methods for Deep Reinforcement Learning
Viaarxiv icon