Alert button
Picture for Shane Legg

Shane Legg

Alert button

Agent Incentives: A Causal Perspective

Add code
Bookmark button
Alert button
Feb 02, 2021
Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg

Figure 1 for Agent Incentives: A Causal Perspective
Figure 2 for Agent Incentives: A Causal Perspective
Figure 3 for Agent Incentives: A Causal Perspective
Figure 4 for Agent Incentives: A Causal Perspective
Viaarxiv icon

Avoiding Tampering Incentives in Deep RL via Decoupled Approval

Add code
Bookmark button
Alert button
Nov 17, 2020
Jonathan Uesato, Ramana Kumar, Victoria Krakovna, Tom Everitt, Richard Ngo, Shane Legg

Figure 1 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 2 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 3 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 4 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Viaarxiv icon

REALab: An Embedded Perspective on Tampering

Add code
Bookmark button
Alert button
Nov 17, 2020
Ramana Kumar, Jonathan Uesato, Richard Ngo, Tom Everitt, Victoria Krakovna, Shane Legg

Figure 1 for REALab: An Embedded Perspective on Tampering
Figure 2 for REALab: An Embedded Perspective on Tampering
Figure 3 for REALab: An Embedded Perspective on Tampering
Figure 4 for REALab: An Embedded Perspective on Tampering
Viaarxiv icon

Algorithms for Causal Reasoning in Probability Trees

Add code
Bookmark button
Alert button
Nov 12, 2020
Tim Genewein, Tom McGrath, Grégoire Déletang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega

Figure 1 for Algorithms for Causal Reasoning in Probability Trees
Figure 2 for Algorithms for Causal Reasoning in Probability Trees
Figure 3 for Algorithms for Causal Reasoning in Probability Trees
Figure 4 for Algorithms for Causal Reasoning in Probability Trees
Viaarxiv icon

Meta-trained agents implement Bayes-optimal agents

Add code
Bookmark button
Alert button
Oct 21, 2020
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega

Figure 1 for Meta-trained agents implement Bayes-optimal agents
Figure 2 for Meta-trained agents implement Bayes-optimal agents
Figure 3 for Meta-trained agents implement Bayes-optimal agents
Figure 4 for Meta-trained agents implement Bayes-optimal agents
Viaarxiv icon

Avoiding Side Effects By Considering Future Tasks

Add code
Bookmark button
Alert button
Oct 15, 2020
Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg

Figure 1 for Avoiding Side Effects By Considering Future Tasks
Figure 2 for Avoiding Side Effects By Considering Future Tasks
Figure 3 for Avoiding Side Effects By Considering Future Tasks
Figure 4 for Avoiding Side Effects By Considering Future Tasks
Viaarxiv icon

Quantifying Differences in Reward Functions

Add code
Bookmark button
Alert button
Jun 24, 2020
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike

Figure 1 for Quantifying Differences in Reward Functions
Figure 2 for Quantifying Differences in Reward Functions
Figure 3 for Quantifying Differences in Reward Functions
Figure 4 for Quantifying Differences in Reward Functions
Viaarxiv icon

Pitfalls of learning a reward function online

Add code
Bookmark button
Alert button
Apr 28, 2020
Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg

Figure 1 for Pitfalls of learning a reward function online
Figure 2 for Pitfalls of learning a reward function online
Figure 3 for Pitfalls of learning a reward function online
Figure 4 for Pitfalls of learning a reward function online
Viaarxiv icon

The Incentives that Shape Behaviour

Add code
Bookmark button
Alert button
Jan 20, 2020
Ryan Carey, Eric Langlois, Tom Everitt, Shane Legg

Figure 1 for The Incentives that Shape Behaviour
Figure 2 for The Incentives that Shape Behaviour
Figure 3 for The Incentives that Shape Behaviour
Figure 4 for The Incentives that Shape Behaviour
Viaarxiv icon