Alert button
Picture for Tom Everitt

Tom Everitt

Alert button

Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective

Add code
Bookmark button
Alert button
Aug 20, 2019
Tom Everitt, Marcus Hutter

Figure 1 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 2 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 3 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 4 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Viaarxiv icon

Modeling AGI Safety Frameworks with Causal Influence Diagrams

Add code
Bookmark button
Alert button
Jun 20, 2019
Tom Everitt, Ramana Kumar, Victoria Krakovna, Shane Legg

Figure 1 for Modeling AGI Safety Frameworks with Causal Influence Diagrams
Figure 2 for Modeling AGI Safety Frameworks with Causal Influence Diagrams
Figure 3 for Modeling AGI Safety Frameworks with Causal Influence Diagrams
Figure 4 for Modeling AGI Safety Frameworks with Causal Influence Diagrams
Viaarxiv icon

Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings

Add code
Bookmark button
Alert button
Mar 12, 2019
Tom Everitt, Pedro A. Ortega, Elizabeth Barnes, Shane Legg

Figure 1 for Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings
Figure 2 for Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings
Figure 3 for Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings
Figure 4 for Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings
Viaarxiv icon

Scalable agent alignment via reward modeling: a research direction

Add code
Bookmark button
Alert button
Nov 19, 2018
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg

Figure 1 for Scalable agent alignment via reward modeling: a research direction
Figure 2 for Scalable agent alignment via reward modeling: a research direction
Figure 3 for Scalable agent alignment via reward modeling: a research direction
Figure 4 for Scalable agent alignment via reward modeling: a research direction
Viaarxiv icon

AGI Safety Literature Review

Add code
Bookmark button
Alert button
May 21, 2018
Tom Everitt, Gary Lea, Marcus Hutter

Figure 1 for AGI Safety Literature Review
Viaarxiv icon

A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem

Add code
Bookmark button
Alert button
Apr 12, 2018
Tom Everitt, Marcus Hutter

Figure 1 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 2 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 3 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 4 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Viaarxiv icon

AI Safety Gridworlds

Add code
Bookmark button
Alert button
Nov 28, 2017
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg

Figure 1 for AI Safety Gridworlds
Figure 2 for AI Safety Gridworlds
Figure 3 for AI Safety Gridworlds
Figure 4 for AI Safety Gridworlds
Viaarxiv icon

Reinforcement Learning with a Corrupted Reward Channel

Add code
Bookmark button
Alert button
Aug 19, 2017
Tom Everitt, Victoria Krakovna, Laurent Orseau, Marcus Hutter, Shane Legg

Figure 1 for Reinforcement Learning with a Corrupted Reward Channel
Figure 2 for Reinforcement Learning with a Corrupted Reward Channel
Figure 3 for Reinforcement Learning with a Corrupted Reward Channel
Figure 4 for Reinforcement Learning with a Corrupted Reward Channel
Viaarxiv icon

Count-Based Exploration in Feature Space for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 25, 2017
Jarryd Martin, Suraj Narayanan Sasikumar, Tom Everitt, Marcus Hutter

Figure 1 for Count-Based Exploration in Feature Space for Reinforcement Learning
Figure 2 for Count-Based Exploration in Feature Space for Reinforcement Learning
Viaarxiv icon

Free Lunch for Optimisation under the Universal Distribution

Add code
Bookmark button
Alert button
Aug 16, 2016
Tom Everitt, Tor Lattimore, Marcus Hutter

Figure 1 for Free Lunch for Optimisation under the Universal Distribution
Viaarxiv icon