Alert button
Picture for Anton Korinek

Anton Korinek

Alert button

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob Foerster, Florian Tramer, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger

Viaarxiv icon

Scenarios for the Transition to AGI

Add code
Bookmark button
Alert button
Mar 17, 2024
Anton Korinek, Donghyun Suh

Figure 1 for Scenarios for the Transition to AGI
Figure 2 for Scenarios for the Transition to AGI
Figure 3 for Scenarios for the Transition to AGI
Figure 4 for Scenarios for the Transition to AGI
Viaarxiv icon

Market Concentration Implications of Foundation Models

Add code
Bookmark button
Alert button
Nov 02, 2023
Jai Vipra, Anton Korinek

Viaarxiv icon

Frontier AI Regulation: Managing Emerging Risks to Public Safety

Add code
Bookmark button
Alert button
Jul 11, 2023
Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

Figure 1 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 2 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 3 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Figure 4 for Frontier AI Regulation: Managing Emerging Risks to Public Safety
Viaarxiv icon

Aligned with Whom? Direct and social goals for AI systems

Add code
Bookmark button
Alert button
May 09, 2022
Anton Korinek, Avital Balwit

Figure 1 for Aligned with Whom? Direct and social goals for AI systems
Figure 2 for Aligned with Whom? Direct and social goals for AI systems
Viaarxiv icon

AI and Shared Prosperity

Add code
Bookmark button
Alert button
May 18, 2021
Katya Klinova, Anton Korinek

Viaarxiv icon