Alert button
Picture for Micah Carroll

Micah Carroll

Alert button

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Who Needs to Know? Minimal Knowledge for Optimal Coordination

Add code
Bookmark button
Alert button
Jun 15, 2023
Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell

Figure 1 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 2 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 3 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 4 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Viaarxiv icon

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

Add code
Bookmark button
Alert button
Nov 30, 2022
David Zhang, Micah Carroll, Andreea Bobu, Anca Dragan

Figure 1 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 2 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 3 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 4 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Viaarxiv icon

UniMASK: Unified Inference in Sequential Decision Problems

Add code
Bookmark button
Alert button
Nov 20, 2022
Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon

Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

Add code
Bookmark button
Alert button
Nov 19, 2022
Mesut Yang, Micah Carroll, Anca Dragan

Figure 1 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 2 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 3 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 4 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Viaarxiv icon

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Add code
Bookmark button
Alert button
Apr 28, 2022
Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 2 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 3 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 4 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Viaarxiv icon

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Add code
Bookmark button
Alert button
Apr 25, 2022
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan

Figure 1 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 2 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 3 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 4 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Viaarxiv icon

Evaluating the Robustness of Collaborative Agents

Add code
Bookmark button
Alert button
Jan 14, 2021
Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, A. D. Dragan, Rohin Shah

Figure 1 for Evaluating the Robustness of Collaborative Agents
Figure 2 for Evaluating the Robustness of Collaborative Agents
Figure 3 for Evaluating the Robustness of Collaborative Agents
Figure 4 for Evaluating the Robustness of Collaborative Agents
Viaarxiv icon

On the Utility of Learning about Humans for Human-AI Coordination

Add code
Bookmark button
Alert button
Oct 13, 2019
Micah Carroll, Rohin Shah, Mark K. Ho, Thomas L. Griffiths, Sanjit A. Seshia, Pieter Abbeel, Anca Dragan

Figure 1 for On the Utility of Learning about Humans for Human-AI Coordination
Figure 2 for On the Utility of Learning about Humans for Human-AI Coordination
Figure 3 for On the Utility of Learning about Humans for Human-AI Coordination
Figure 4 for On the Utility of Learning about Humans for Human-AI Coordination
Viaarxiv icon