Alert button
Picture for Stuart Armstrong

Stuart Armstrong

Alert button

CoinRun: Solving Goal Misgeneralisation

Add code
Bookmark button
Alert button
Sep 28, 2023
Stuart Armstrong, Alexandre Maranhão, Oliver Daniels-Koch, Patrick Leask, Rebecca Gorman

Viaarxiv icon

Concept Extrapolation: A Conceptual Primer

Add code
Bookmark button
Alert button
Jun 19, 2023
Matija Franklin, Rebecca Gorman, Hal Ashton, Stuart Armstrong

Viaarxiv icon

Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI

Add code
Bookmark button
Alert button
Mar 30, 2022
Matija Franklin, Hal Ashton, Rebecca Gorman, Stuart Armstrong

Figure 1 for Recognising the importance of preference change: A call for a coordinated multidisciplinary research effort in the age of AI
Viaarxiv icon

The dangers in algorithms learning humans' values and irrationalities

Add code
Bookmark button
Alert button
Mar 01, 2022
Rebecca Gorman, Stuart Armstrong

Figure 1 for The dangers in algorithms learning humans' values and irrationalities
Figure 2 for The dangers in algorithms learning humans' values and irrationalities
Figure 3 for The dangers in algorithms learning humans' values and irrationalities
Viaarxiv icon

Chess as a Testing Grounds for the Oracle Approach to AI Safety

Add code
Bookmark button
Alert button
Oct 06, 2020
James D. Miller, Roman Yampolskiy, Olle Haggstrom, Stuart Armstrong

Viaarxiv icon

Pitfalls of learning a reward function online

Add code
Bookmark button
Alert button
Apr 28, 2020
Stuart Armstrong, Jan Leike, Laurent Orseau, Shane Legg

Figure 1 for Pitfalls of learning a reward function online
Figure 2 for Pitfalls of learning a reward function online
Figure 3 for Pitfalls of learning a reward function online
Figure 4 for Pitfalls of learning a reward function online
Viaarxiv icon

Occam's razor is insufficient to infer the preferences of irrational agents

Add code
Bookmark button
Alert button
Oct 29, 2018
Stuart Armstrong, Sören Mindermann

Viaarxiv icon

Good and safe uses of AI Oracles

Add code
Bookmark button
Alert button
Jun 05, 2018
Stuart Armstrong, Xavier O'Rorke

Figure 1 for Good and safe uses of AI Oracles
Figure 2 for Good and safe uses of AI Oracles
Figure 3 for Good and safe uses of AI Oracles
Figure 4 for Good and safe uses of AI Oracles
Viaarxiv icon

'Indifference' methods for managing agent rewards

Add code
Bookmark button
Alert button
Jun 05, 2018
Stuart Armstrong, Xavier O'Rourke

Figure 1 for 'Indifference' methods for managing agent rewards
Viaarxiv icon

Counterfactual equivalence for POMDPs, and underlying deterministic environments

Add code
Bookmark button
Alert button
Jan 14, 2018
Stuart Armstrong

Figure 1 for Counterfactual equivalence for POMDPs, and underlying deterministic environments
Figure 2 for Counterfactual equivalence for POMDPs, and underlying deterministic environments
Figure 3 for Counterfactual equivalence for POMDPs, and underlying deterministic environments
Figure 4 for Counterfactual equivalence for POMDPs, and underlying deterministic environments
Viaarxiv icon