Alert button
Picture for Usman Anwar

Usman Anwar

Alert button

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob Foerster, Florian Tramer, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger

Viaarxiv icon

Reward Model Ensembles Help Mitigate Overoptimization

Add code
Bookmark button
Alert button
Oct 04, 2023
Thomas Coste, Usman Anwar, Robert Kirk, David Krueger

Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 27, 2022
Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger

Figure 1 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 2 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 3 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 4 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Viaarxiv icon

Inverse Constrained Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 24, 2020
Usman Anwar, Shehryar Malik, Alireza Aghasi, Ali Ahmed

Figure 1 for Inverse Constrained Reinforcement Learning
Figure 2 for Inverse Constrained Reinforcement Learning
Figure 3 for Inverse Constrained Reinforcement Learning
Figure 4 for Inverse Constrained Reinforcement Learning
Viaarxiv icon

Learning To Solve Differential Equations Across Initial Conditions

Add code
Bookmark button
Alert button
Apr 19, 2020
Shehryar Malik, Usman Anwar, Ali Ahmed, Alireza Aghasi

Figure 1 for Learning To Solve Differential Equations Across Initial Conditions
Figure 2 for Learning To Solve Differential Equations Across Initial Conditions
Viaarxiv icon