Alert button
Picture for Thomas Krendl Gilbert

Thomas Krendl Gilbert

Alert button

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Optimization's Neglected Normative Commitments

Add code
Bookmark button
Alert button
May 27, 2023
Benjamin Laufer, Thomas Krendl Gilbert, Helen Nissenbaum

Figure 1 for Optimization's Neglected Normative Commitments
Figure 2 for Optimization's Neglected Normative Commitments
Viaarxiv icon

Dynamic Documentation for AI Systems

Add code
Bookmark button
Alert button
Mar 20, 2023
Soham Mehta, Anderson Rogers, Thomas Krendl Gilbert

Viaarxiv icon

Beyond Bias and Compliance: Towards Individual Agency and Plurality of Ethics in AI

Add code
Bookmark button
Alert button
Feb 23, 2023
Thomas Krendl Gilbert, Megan Welle Brozek, Andrew Brozek

Figure 1 for Beyond Bias and Compliance: Towards Individual Agency and Plurality of Ethics in AI
Viaarxiv icon

Reward Reports for Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 25, 2022
Thomas Krendl Gilbert, Sarah Dean, Nathan Lambert, Tom Zick, Aaron Snoswell

Figure 1 for Reward Reports for Reinforcement Learning
Figure 2 for Reward Reports for Reinforcement Learning
Figure 3 for Reward Reports for Reinforcement Learning
Figure 4 for Reward Reports for Reinforcement Learning
Viaarxiv icon

Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems

Add code
Bookmark button
Alert button
Feb 11, 2022
Thomas Krendl Gilbert, Sarah Dean, Tom Zick, Nathan Lambert

Viaarxiv icon

Hard Choices in Artificial Intelligence

Add code
Bookmark button
Alert button
Jun 10, 2021
Roel Dobbe, Thomas Krendl Gilbert, Yonatan Mintz

Figure 1 for Hard Choices in Artificial Intelligence
Figure 2 for Hard Choices in Artificial Intelligence
Figure 3 for Hard Choices in Artificial Intelligence
Viaarxiv icon

Axes for Sociotechnical Inquiry in AI Research

Add code
Bookmark button
Alert button
Apr 26, 2021
Sarah Dean, Thomas Krendl Gilbert, Nathan Lambert, Tom Zick

Figure 1 for Axes for Sociotechnical Inquiry in AI Research
Figure 2 for Axes for Sociotechnical Inquiry in AI Research
Viaarxiv icon

AI Development for the Public Interest: From Abstraction Traps to Sociotechnical Risks

Add code
Bookmark button
Alert button
Feb 04, 2021
McKane Andrus, Sarah Dean, Thomas Krendl Gilbert, Nathan Lambert, Tom Zick

Figure 1 for AI Development for the Public Interest: From Abstraction Traps to Sociotechnical Risks
Viaarxiv icon

Hard Choices in Artificial Intelligence: Addressing Normative Uncertainty through Sociotechnical Commitments

Add code
Bookmark button
Alert button
Nov 20, 2019
Roel Dobbe, Thomas Krendl Gilbert, Yonatan Mintz

Viaarxiv icon