Picture for Shane Legg

Shane Legg

Shaking the foundations: delusions in sequence models for interaction and control

Add code
Oct 20, 2021
Figure 1 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 2 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 3 for Shaking the foundations: delusions in sequence models for interaction and control
Figure 4 for Shaking the foundations: delusions in sequence models for interaction and control
Viaarxiv icon

Causal Analysis of Agent Behavior for AI Safety

Add code
Mar 05, 2021
Figure 1 for Causal Analysis of Agent Behavior for AI Safety
Figure 2 for Causal Analysis of Agent Behavior for AI Safety
Figure 3 for Causal Analysis of Agent Behavior for AI Safety
Figure 4 for Causal Analysis of Agent Behavior for AI Safety
Viaarxiv icon

Agent Incentives: A Causal Perspective

Add code
Feb 02, 2021
Figure 1 for Agent Incentives: A Causal Perspective
Figure 2 for Agent Incentives: A Causal Perspective
Figure 3 for Agent Incentives: A Causal Perspective
Figure 4 for Agent Incentives: A Causal Perspective
Viaarxiv icon

Avoiding Tampering Incentives in Deep RL via Decoupled Approval

Add code
Nov 17, 2020
Figure 1 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 2 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 3 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Figure 4 for Avoiding Tampering Incentives in Deep RL via Decoupled Approval
Viaarxiv icon

REALab: An Embedded Perspective on Tampering

Add code
Nov 17, 2020
Figure 1 for REALab: An Embedded Perspective on Tampering
Figure 2 for REALab: An Embedded Perspective on Tampering
Figure 3 for REALab: An Embedded Perspective on Tampering
Figure 4 for REALab: An Embedded Perspective on Tampering
Viaarxiv icon

Algorithms for Causal Reasoning in Probability Trees

Add code
Nov 12, 2020
Figure 1 for Algorithms for Causal Reasoning in Probability Trees
Figure 2 for Algorithms for Causal Reasoning in Probability Trees
Figure 3 for Algorithms for Causal Reasoning in Probability Trees
Figure 4 for Algorithms for Causal Reasoning in Probability Trees
Viaarxiv icon

Meta-trained agents implement Bayes-optimal agents

Add code
Oct 21, 2020
Figure 1 for Meta-trained agents implement Bayes-optimal agents
Figure 2 for Meta-trained agents implement Bayes-optimal agents
Figure 3 for Meta-trained agents implement Bayes-optimal agents
Figure 4 for Meta-trained agents implement Bayes-optimal agents
Viaarxiv icon

Avoiding Side Effects By Considering Future Tasks

Add code
Oct 15, 2020
Figure 1 for Avoiding Side Effects By Considering Future Tasks
Figure 2 for Avoiding Side Effects By Considering Future Tasks
Figure 3 for Avoiding Side Effects By Considering Future Tasks
Figure 4 for Avoiding Side Effects By Considering Future Tasks
Viaarxiv icon

Quantifying Differences in Reward Functions

Add code
Jun 24, 2020
Figure 1 for Quantifying Differences in Reward Functions
Figure 2 for Quantifying Differences in Reward Functions
Figure 3 for Quantifying Differences in Reward Functions
Figure 4 for Quantifying Differences in Reward Functions
Viaarxiv icon

Pitfalls of learning a reward function online

Add code
Apr 28, 2020
Figure 1 for Pitfalls of learning a reward function online
Figure 2 for Pitfalls of learning a reward function online
Figure 3 for Pitfalls of learning a reward function online
Figure 4 for Pitfalls of learning a reward function online
Viaarxiv icon