Alert button
Picture for Anca Dragan

Anca Dragan

Alert button

Learning to Model the World with Language

Add code
Bookmark button
Alert button
Jul 31, 2023
Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca Dragan

Figure 1 for Learning to Model the World with Language
Figure 2 for Learning to Model the World with Language
Figure 3 for Learning to Model the World with Language
Figure 4 for Learning to Model the World with Language
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Add code
Bookmark button
Alert button
Jun 30, 2023
Vivek Myers, Andre He, Kuan Fang, Homer Walke, Philippe Hansen-Estruch, Ching-An Cheng, Mihai Jalobeanu, Andrey Kolobov, Anca Dragan, Sergey Levine

Figure 1 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 2 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 3 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Figure 4 for Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control
Viaarxiv icon

Toward Grounded Social Reasoning

Add code
Bookmark button
Alert button
Jun 14, 2023
Minae Kwon, Hengyuan Hu, Vivek Myers, Siddharth Karamcheti, Anca Dragan, Dorsa Sadigh

Figure 1 for Toward Grounded Social Reasoning
Figure 2 for Toward Grounded Social Reasoning
Figure 3 for Toward Grounded Social Reasoning
Figure 4 for Toward Grounded Social Reasoning
Viaarxiv icon

Bridging RL Theory and Practice with the Effective Horizon

Add code
Bookmark button
Alert button
Apr 19, 2023
Cassidy Laidlaw, Stuart Russell, Anca Dragan

Figure 1 for Bridging RL Theory and Practice with the Effective Horizon
Figure 2 for Bridging RL Theory and Practice with the Effective Horizon
Figure 3 for Bridging RL Theory and Practice with the Effective Horizon
Figure 4 for Bridging RL Theory and Practice with the Effective Horizon
Viaarxiv icon

Learning to Influence Human Behavior with Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 10, 2023
Joey Hong, Anca Dragan, Sergey Levine

Figure 1 for Learning to Influence Human Behavior with Offline Reinforcement Learning
Figure 2 for Learning to Influence Human Behavior with Offline Reinforcement Learning
Figure 3 for Learning to Influence Human Behavior with Offline Reinforcement Learning
Figure 4 for Learning to Influence Human Behavior with Offline Reinforcement Learning
Viaarxiv icon

Automatically Auditing Large Language Models via Discrete Optimization

Add code
Bookmark button
Alert button
Mar 08, 2023
Erik Jones, Anca Dragan, Aditi Raghunathan, Jacob Steinhardt

Figure 1 for Automatically Auditing Large Language Models via Discrete Optimization
Figure 2 for Automatically Auditing Large Language Models via Discrete Optimization
Figure 3 for Automatically Auditing Large Language Models via Discrete Optimization
Figure 4 for Automatically Auditing Large Language Models via Discrete Optimization
Viaarxiv icon

Towards Modeling and Influencing the Dynamics of Human Learning

Add code
Bookmark button
Alert button
Jan 02, 2023
Ran Tian, Masayoshi Tomizuka, Anca Dragan, Andrea Bajcsy

Figure 1 for Towards Modeling and Influencing the Dynamics of Human Learning
Figure 2 for Towards Modeling and Influencing the Dynamics of Human Learning
Figure 3 for Towards Modeling and Influencing the Dynamics of Human Learning
Figure 4 for Towards Modeling and Influencing the Dynamics of Human Learning
Viaarxiv icon

On the Sensitivity of Reward Inference to Misspecified Human Models

Add code
Bookmark button
Alert button
Dec 09, 2022
Joey Hong, Kush Bhatia, Anca Dragan

Figure 1 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 2 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 3 for On the Sensitivity of Reward Inference to Misspecified Human Models
Figure 4 for On the Sensitivity of Reward Inference to Misspecified Human Models
Viaarxiv icon

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

Add code
Bookmark button
Alert button
Nov 30, 2022
David Zhang, Micah Carroll, Andreea Bobu, Anca Dragan

Figure 1 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 2 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 3 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Figure 4 for Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Viaarxiv icon