Alert button
Picture for Stewart Slocum

Stewart Slocum

Alert button

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Interpretable by Design: Learning Predictors by Composing Interpretable Queries

Add code
Bookmark button
Alert button
Jul 03, 2022
Aditya Chattopadhyay, Stewart Slocum, Benjamin D. Haeffele, Rene Vidal, Donald Geman

Figure 1 for Interpretable by Design: Learning Predictors by Composing Interpretable Queries
Figure 2 for Interpretable by Design: Learning Predictors by Composing Interpretable Queries
Figure 3 for Interpretable by Design: Learning Predictors by Composing Interpretable Queries
Figure 4 for Interpretable by Design: Learning Predictors by Composing Interpretable Queries
Viaarxiv icon

AdaLead: A simple and robust adaptive greedy search algorithm for sequence design

Add code
Bookmark button
Alert button
Oct 05, 2020
Sam Sinai, Richard Wang, Alexander Whatley, Stewart Slocum, Elina Locane, Eric D. Kelsic

Figure 1 for AdaLead: A simple and robust adaptive greedy search algorithm for sequence design
Figure 2 for AdaLead: A simple and robust adaptive greedy search algorithm for sequence design
Figure 3 for AdaLead: A simple and robust adaptive greedy search algorithm for sequence design
Viaarxiv icon