Alert button
Picture for Anca Dragan

Anca Dragan

Alert button

UniMASK: Unified Inference in Sequential Decision Problems

Add code
Bookmark button
Alert button
Nov 20, 2022
Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon

Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

Add code
Bookmark button
Alert button
Nov 19, 2022
Mesut Yang, Micah Carroll, Anca Dragan

Figure 1 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 2 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 3 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Figure 4 for Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Viaarxiv icon

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Add code
Bookmark button
Alert button
Apr 28, 2022
Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 2 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 3 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Figure 4 for Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Viaarxiv icon

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Add code
Bookmark button
Alert button
Apr 25, 2022
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan

Figure 1 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 2 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 3 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 4 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Viaarxiv icon

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

Add code
Bookmark button
Alert button
Apr 22, 2022
Cassidy Laidlaw, Anca Dragan

Figure 1 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 2 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 3 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Figure 4 for The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models
Viaarxiv icon

Inferring Rewards from Language in Context

Add code
Bookmark button
Alert button
Apr 05, 2022
Jessy Lin, Daniel Fried, Dan Klein, Anca Dragan

Figure 1 for Inferring Rewards from Language in Context
Figure 2 for Inferring Rewards from Language in Context
Figure 3 for Inferring Rewards from Language in Context
Figure 4 for Inferring Rewards from Language in Context
Viaarxiv icon

Human irrationality: both bad and good for reward inference

Add code
Bookmark button
Alert button
Nov 12, 2021
Lawrence Chan, Andrew Critch, Anca Dragan

Figure 1 for Human irrationality: both bad and good for reward inference
Figure 2 for Human irrationality: both bad and good for reward inference
Figure 3 for Human irrationality: both bad and good for reward inference
Figure 4 for Human irrationality: both bad and good for reward inference
Viaarxiv icon

B-Pref: Benchmarking Preference-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2021
Kimin Lee, Laura Smith, Anca Dragan, Pieter Abbeel

Figure 1 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 2 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 3 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Figure 4 for B-Pref: Benchmarking Preference-Based Reinforcement Learning
Viaarxiv icon

The MineRL BASALT Competition on Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 05, 2021
Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

Figure 1 for The MineRL BASALT Competition on Learning from Human Feedback
Figure 2 for The MineRL BASALT Competition on Learning from Human Feedback
Viaarxiv icon