Alert button
Picture for Tom Everitt

Tom Everitt

Alert button

Robust agents learn causal world models

Feb 26, 2024
Jonathan Richens, Tom Everitt

Viaarxiv icon

The Reasons that Agents Act: Intention and Instrumental Goals

Feb 15, 2024
Francis Rhys Ward, Matt MacDermott, Francesco Belardinelli, Francesca Toni, Tom Everitt

Viaarxiv icon

Honesty Is the Best Policy: Defining and Mitigating AI Deception

Dec 03, 2023
Francis Rhys Ward, Francesco Belardinelli, Francesca Toni, Tom Everitt

Viaarxiv icon

Characterising Decision Theories with Mechanised Causal Graphs

Jul 20, 2023
Matt MacDermott, Tom Everitt, Francesco Belardinelli

Figure 1 for Characterising Decision Theories with Mechanised Causal Graphs
Figure 2 for Characterising Decision Theories with Mechanised Causal Graphs
Figure 3 for Characterising Decision Theories with Mechanised Causal Graphs
Figure 4 for Characterising Decision Theories with Mechanised Causal Graphs
Viaarxiv icon

Human Control: Definitions and Algorithms

May 31, 2023
Ryan Carey, Tom Everitt

Figure 1 for Human Control: Definitions and Algorithms
Figure 2 for Human Control: Definitions and Algorithms
Figure 3 for Human Control: Definitions and Algorithms
Figure 4 for Human Control: Definitions and Algorithms
Viaarxiv icon

Reasoning about Causality in Games

Jan 05, 2023
Lewis Hammond, James Fox, Tom Everitt, Ryan Carey, Alessandro Abate, Michael Wooldridge

Figure 1 for Reasoning about Causality in Games
Figure 2 for Reasoning about Causality in Games
Figure 3 for Reasoning about Causality in Games
Figure 4 for Reasoning about Causality in Games
Viaarxiv icon

Discovering Agents

Aug 24, 2022
Zachary Kenton, Ramana Kumar, Sebastian Farquhar, Jonathan Richens, Matt MacDermott, Tom Everitt

Figure 1 for Discovering Agents
Figure 2 for Discovering Agents
Figure 3 for Discovering Agents
Figure 4 for Discovering Agents
Viaarxiv icon

Path-Specific Objectives for Safer Agent Incentives

Apr 21, 2022
Sebastian Farquhar, Ryan Carey, Tom Everitt

Figure 1 for Path-Specific Objectives for Safer Agent Incentives
Figure 2 for Path-Specific Objectives for Safer Agent Incentives
Figure 3 for Path-Specific Objectives for Safer Agent Incentives
Figure 4 for Path-Specific Objectives for Safer Agent Incentives
Viaarxiv icon