Alert button
Picture for Seán Ó hÉigeartaigh

Seán Ó hÉigeartaigh

Alert button

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob Foerster, Florian Tramer, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger

Viaarxiv icon

Predictable Artificial Intelligence

Add code
Bookmark button
Alert button
Oct 09, 2023
Lexin Zhou, Pablo A. Moreno-Casares, Fernando Martínez-Plumed, John Burden, Ryan Burnell, Lucy Cheke, Cèsar Ferri, Alexandru Marcoci, Behzad Mehrbakhsh, Yael Moros-Daval, Seán Ó hÉigeartaigh, Danaja Rutar, Wout Schellaert, Konstantinos Voudouris, José Hernández-Orallo

Figure 1 for Predictable Artificial Intelligence
Figure 2 for Predictable Artificial Intelligence
Figure 3 for Predictable Artificial Intelligence
Figure 4 for Predictable Artificial Intelligence
Viaarxiv icon

AI Systems of Concern

Add code
Bookmark button
Alert button
Oct 09, 2023
Kayla Matteucci, Shahar Avin, Fazl Barez, Seán Ó hÉigeartaigh

Figure 1 for AI Systems of Concern
Figure 2 for AI Systems of Concern
Figure 3 for AI Systems of Concern
Viaarxiv icon

International Governance of Civilian AI: A Jurisdictional Certification Approach

Add code
Bookmark button
Alert button
Sep 11, 2023
Robert Trager, Ben Harack, Anka Reuel, Allison Carnegie, Lennart Heim, Lewis Ho, Sarah Kreps, Ranjit Lall, Owen Larter, Seán Ó hÉigeartaigh, Simon Staffell, José Jaime Villalobos

Figure 1 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 2 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 3 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Figure 4 for International Governance of Civilian AI: A Jurisdictional Certification Approach
Viaarxiv icon

The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

Add code
Bookmark button
Alert button
Feb 20, 2018
Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy, Dario Amodei

Figure 1 for The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Viaarxiv icon