Alert button
Picture for Ben Garfinkel

Ben Garfinkel

Alert button

Model evaluation for extreme risks

Add code
Bookmark button
Alert button
May 24, 2023
Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon

Democratising AI: Multiple Meanings, Goals, and Methods

Add code
Bookmark button
Alert button
Mar 27, 2023
Elizabeth Seger, Aviv Ovadya, Ben Garfinkel, Divya Siddarth, Allan Dafoe

Viaarxiv icon

Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases

Add code
Bookmark button
Alert button
Mar 20, 2023
Emma Bluemke, Tantum Collins, Ben Garfinkel, Andrew Trask

Figure 1 for Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases
Viaarxiv icon

The Windfall Clause: Distributing the Benefits of AI for the Common Good

Add code
Bookmark button
Alert button
Jan 24, 2020
Cullen O'Keefe, Peter Cihon, Ben Garfinkel, Carrick Flynn, Jade Leung, Allan Dafoe

Figure 1 for The Windfall Clause: Distributing the Benefits of AI for the Common Good
Viaarxiv icon

The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

Add code
Bookmark button
Alert button
Feb 20, 2018
Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy, Dario Amodei

Figure 1 for The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Viaarxiv icon