Alert button
Picture for Andrew Critch

Andrew Critch

Alert button

TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI

Add code
Bookmark button
Alert button
Jun 14, 2023
Andrew Critch, Stuart Russell

Figure 1 for TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI
Figure 2 for TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI
Figure 3 for TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI
Viaarxiv icon

TASRA: A Taxonomy and Analysis of Societal-Scale Risks from AI

Add code
Bookmark button
Alert button
Jun 12, 2023
Andrew Critch, Stuart Russell

Figure 1 for TASRA: A Taxonomy and Analysis of Societal-Scale Risks from AI
Figure 2 for TASRA: A Taxonomy and Analysis of Societal-Scale Risks from AI
Figure 3 for TASRA: A Taxonomy and Analysis of Societal-Scale Risks from AI
Viaarxiv icon

WordSig: QR streams enabling platform-independent self-identification that's impossible to deepfake

Add code
Bookmark button
Alert button
Jul 15, 2022
Andrew Critch

Figure 1 for WordSig: QR streams enabling platform-independent self-identification that's impossible to deepfake
Figure 2 for WordSig: QR streams enabling platform-independent self-identification that's impossible to deepfake
Figure 3 for WordSig: QR streams enabling platform-independent self-identification that's impossible to deepfake
Viaarxiv icon

For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria

Add code
Bookmark button
Alert button
Jul 07, 2022
Scott Emmons, Caspar Oesterheld, Andrew Critch, Vincent Conitzer, Stuart Russell

Figure 1 for For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Figure 2 for For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Figure 3 for For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Figure 4 for For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Viaarxiv icon

Human irrationality: both bad and good for reward inference

Add code
Bookmark button
Alert button
Nov 12, 2021
Lawrence Chan, Andrew Critch, Anca Dragan

Figure 1 for Human irrationality: both bad and good for reward inference
Figure 2 for Human irrationality: both bad and good for reward inference
Figure 3 for Human irrationality: both bad and good for reward inference
Figure 4 for Human irrationality: both bad and good for reward inference
Viaarxiv icon

Detecting Modularity in Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 13, 2021
Shlomi Hod, Stephen Casper, Daniel Filan, Cody Wild, Andrew Critch, Stuart Russell

Figure 1 for Detecting Modularity in Deep Neural Networks
Figure 2 for Detecting Modularity in Deep Neural Networks
Figure 3 for Detecting Modularity in Deep Neural Networks
Figure 4 for Detecting Modularity in Deep Neural Networks
Viaarxiv icon

Clusterability in Neural Networks

Add code
Bookmark button
Alert button
Mar 04, 2021
Daniel Filan, Stephen Casper, Shlomi Hod, Cody Wild, Andrew Critch, Stuart Russell

Figure 1 for Clusterability in Neural Networks
Figure 2 for Clusterability in Neural Networks
Figure 3 for Clusterability in Neural Networks
Figure 4 for Clusterability in Neural Networks
Viaarxiv icon

Accumulating Risk Capital Through Investing in Cooperation

Add code
Bookmark button
Alert button
Jan 25, 2021
Charlotte Roman, Michael Dennis, Andrew Critch, Stuart Russell

Figure 1 for Accumulating Risk Capital Through Investing in Cooperation
Figure 2 for Accumulating Risk Capital Through Investing in Cooperation
Figure 3 for Accumulating Risk Capital Through Investing in Cooperation
Figure 4 for Accumulating Risk Capital Through Investing in Cooperation
Viaarxiv icon

Multi-Principal Assistance Games: Definition and Collegial Mechanisms

Add code
Bookmark button
Alert button
Dec 29, 2020
Arnaud Fickinger, Simon Zhuang, Andrew Critch, Dylan Hadfield-Menell, Stuart Russell

Viaarxiv icon

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design

Add code
Bookmark button
Alert button
Dec 03, 2020
Michael Dennis, Natasha Jaques, Eugene Vinitsky, Alexandre Bayen, Stuart Russell, Andrew Critch, Sergey Levine

Figure 1 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 2 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 3 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Figure 4 for Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design
Viaarxiv icon