Alert button
Picture for Shane Legg

Shane Legg

Alert button

Measuring and avoiding side effects using relative reachability

Add code
Bookmark button
Alert button
Jun 04, 2018
Victoria Krakovna, Laurent Orseau, Miljan Martic, Shane Legg

Figure 1 for Measuring and avoiding side effects using relative reachability
Figure 2 for Measuring and avoiding side effects using relative reachability
Figure 3 for Measuring and avoiding side effects using relative reachability
Figure 4 for Measuring and avoiding side effects using relative reachability
Viaarxiv icon

Agents and Devices: A Relative Definition of Agency

Add code
Bookmark button
Alert button
May 31, 2018
Laurent Orseau, Simon McGregor McGill, Shane Legg

Figure 1 for Agents and Devices: A Relative Definition of Agency
Figure 2 for Agents and Devices: A Relative Definition of Agency
Figure 3 for Agents and Devices: A Relative Definition of Agency
Figure 4 for Agents and Devices: A Relative Definition of Agency
Viaarxiv icon

Noisy Networks for Exploration

Add code
Bookmark button
Alert button
Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

Figure 1 for Noisy Networks for Exploration
Figure 2 for Noisy Networks for Exploration
Figure 3 for Noisy Networks for Exploration
Figure 4 for Noisy Networks for Exploration
Viaarxiv icon

Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

Add code
Bookmark button
Alert button
Feb 04, 2018
Joel Z. Leibo, Cyprien de Masson d'Autume, Daniel Zoran, David Amos, Charles Beattie, Keith Anderson, Antonio García Castañeda, Manuel Sanchez, Simon Green, Audrunas Gruslys, Shane Legg, Demis Hassabis, Matthew M. Botvinick

Figure 1 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 2 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 3 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 4 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Viaarxiv icon

AI Safety Gridworlds

Add code
Bookmark button
Alert button
Nov 28, 2017
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg

Figure 1 for AI Safety Gridworlds
Figure 2 for AI Safety Gridworlds
Figure 3 for AI Safety Gridworlds
Figure 4 for AI Safety Gridworlds
Viaarxiv icon

Reinforcement Learning with a Corrupted Reward Channel

Add code
Bookmark button
Alert button
Aug 19, 2017
Tom Everitt, Victoria Krakovna, Laurent Orseau, Marcus Hutter, Shane Legg

Figure 1 for Reinforcement Learning with a Corrupted Reward Channel
Figure 2 for Reinforcement Learning with a Corrupted Reward Channel
Figure 3 for Reinforcement Learning with a Corrupted Reward Channel
Figure 4 for Reinforcement Learning with a Corrupted Reward Channel
Viaarxiv icon

Deep reinforcement learning from human preferences

Add code
Bookmark button
Alert button
Jul 13, 2017
Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei

Figure 1 for Deep reinforcement learning from human preferences
Figure 2 for Deep reinforcement learning from human preferences
Figure 3 for Deep reinforcement learning from human preferences
Figure 4 for Deep reinforcement learning from human preferences
Viaarxiv icon

DeepMind Lab

Add code
Bookmark button
Alert button
Dec 13, 2016
Charles Beattie, Joel Z. Leibo, Denis Teplyashin, Tom Ward, Marcus Wainwright, Heinrich Küttler, Andrew Lefrancq, Simon Green, Víctor Valdés, Amir Sadik, Julian Schrittwieser, Keith Anderson, Sarah York, Max Cant, Adam Cain, Adrian Bolton, Stephen Gaffney, Helen King, Demis Hassabis, Shane Legg, Stig Petersen

Figure 1 for DeepMind Lab
Figure 2 for DeepMind Lab
Figure 3 for DeepMind Lab
Figure 4 for DeepMind Lab
Viaarxiv icon

Massively Parallel Methods for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 16, 2015
Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu, David Silver

Figure 1 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 2 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 3 for Massively Parallel Methods for Deep Reinforcement Learning
Figure 4 for Massively Parallel Methods for Deep Reinforcement Learning
Viaarxiv icon

An Approximation of the Universal Intelligence Measure

Add code
Bookmark button
Alert button
Sep 29, 2011
Shane Legg, Joel Veness

Figure 1 for An Approximation of the Universal Intelligence Measure
Figure 2 for An Approximation of the Universal Intelligence Measure
Figure 3 for An Approximation of the Universal Intelligence Measure
Figure 4 for An Approximation of the Universal Intelligence Measure
Viaarxiv icon