Alert button
Picture for Shahar Avin

Shahar Avin

Alert button

AI Systems of Concern

Oct 09, 2023
Kayla Matteucci, Shahar Avin, Fazl Barez, Seán Ó hÉigeartaigh

Figure 1 for AI Systems of Concern
Figure 2 for AI Systems of Concern
Figure 3 for AI Systems of Concern
Viaarxiv icon

Frontier AI Regulation: Managing Emerging Risks to Public Safety

Jul 11, 2023
Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

Figure 1 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 2 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 3 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 4 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Viaarxiv icon

Model evaluation for extreme risks

May 24, 2023
Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon

Filling gaps in trustworthy development of AI

Dec 14, 2021
Shahar Avin, Haydn Belfield, Miles Brundage, Gretchen Krueger, Jasmine Wang, Adrian Weller, Markus Anderljung, Igor Krawczuk, David Krueger, Jonathan Lebensold, Tegan Maharaj, Noa Zilberman

Viaarxiv icon

Exploring AI Futures Through Role Play

Dec 19, 2019
Shahar Avin, Ross Gruetzemacher, James Fox

Figure 1 for Exploring AI Futures Through Role Play
Figure 2 for Exploring AI Futures Through Role Play
Viaarxiv icon

Accounting for the Neglected Dimensions of AI Progress

Jun 02, 2018
Fernando Martínez-Plumed, Shahar Avin, Miles Brundage, Allan Dafoe, Sean Ó hÉigeartaigh, José Hernández-Orallo

Figure 1 for Accounting for the Neglected Dimensions of AI Progress
Figure 2 for Accounting for the Neglected Dimensions of AI Progress
Figure 3 for Accounting for the Neglected Dimensions of AI Progress
Figure 4 for Accounting for the Neglected Dimensions of AI Progress
Viaarxiv icon

The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

Feb 20, 2018
Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy, Dario Amodei

Figure 1 for The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Viaarxiv icon