Alert button
Picture for Lewis Ho

Lewis Ho

Alert button

Evaluating Frontier Models for Dangerous Capabilities

Add code
Bookmark button
Alert button
Mar 20, 2024
Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar, Maria Abi Raad, Albert Webson, Lewis Ho, Sharon Lin, Sebastian Farquhar, Marcus Hutter, Gregoire Deletang, Anian Ruoss, Seliem El-Sayed, Sasha Brown, Anca Dragan, Rohin Shah, Allan Dafoe, Toby Shevlane

Figure 1 for Evaluating Frontier Models for Dangerous Capabilities
Figure 2 for Evaluating Frontier Models for Dangerous Capabilities
Figure 3 for Evaluating Frontier Models for Dangerous Capabilities
Figure 4 for Evaluating Frontier Models for Dangerous Capabilities
Viaarxiv icon

International Governance of Civilian AI: A Jurisdictional Certification Approach

Add code
Bookmark button
Alert button
Sep 11, 2023
Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, Seán Ó hÉigeartaigh, Simon Staffell, José Jaime Villalobos

Figure 1 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 2 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 3 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 4 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Viaarxiv icon

Frontier AI Regulation: Managing Emerging Risks to Public Safety

Add code
Bookmark button
Alert button
Jul 11, 2023
Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

Figure 1 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 2 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 3 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 4 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Viaarxiv icon

Model evaluation for extreme risks

Add code
Bookmark button
Alert button
May 24, 2023
Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon