Alert button
Picture for Dylan Hadfield-Menell

Dylan Hadfield-Menell

Alert button

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Add code
Bookmark button
Alert button
Apr 25, 2022
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan

Figure 1 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 2 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 3 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 4 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Viaarxiv icon

Linguistic communication as (inverse) reward design

Add code
Bookmark button
Alert button
Apr 11, 2022
Theodore R. Sumers, Robert D. Hawkins, Mark K. Ho, Thomas L. Griffiths, Dylan Hadfield-Menell

Figure 1 for Linguistic communication as (inverse) reward design
Figure 2 for Linguistic communication as (inverse) reward design
Figure 3 for Linguistic communication as (inverse) reward design
Viaarxiv icon

Guided Imitation of Task and Motion Planning

Add code
Bookmark button
Alert button
Dec 06, 2021
Michael James McDonald, Dylan Hadfield-Menell

Figure 1 for Guided Imitation of Task and Motion Planning
Figure 2 for Guided Imitation of Task and Motion Planning
Figure 3 for Guided Imitation of Task and Motion Planning
Figure 4 for Guided Imitation of Task and Motion Planning
Viaarxiv icon

What are you optimizing for? Aligning Recommender Systems with Human Values

Add code
Bookmark button
Alert button
Jul 22, 2021
Jonathan Stray, Ivan Vendrov, Jeremy Nixon, Steven Adler, Dylan Hadfield-Menell

Viaarxiv icon

Consequences of Misaligned AI

Add code
Bookmark button
Alert button
Feb 07, 2021
Simon Zhuang, Dylan Hadfield-Menell

Figure 1 for Consequences of Misaligned AI
Figure 2 for Consequences of Misaligned AI
Viaarxiv icon

Multi-Principal Assistance Games: Definition and Collegial Mechanisms

Add code
Bookmark button
Alert button
Dec 29, 2020
Arnaud Fickinger, Simon Zhuang, Andrew Critch, Dylan Hadfield-Menell, Stuart Russell

Viaarxiv icon

Multi-Principal Assistance Games

Add code
Bookmark button
Alert button
Jul 19, 2020
Arnaud Fickinger, Simon Zhuang, Dylan Hadfield-Menell, Stuart Russell

Figure 1 for Multi-Principal Assistance Games
Viaarxiv icon

Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors

Add code
Bookmark button
Alert button
Jan 25, 2020
Raphael Köster, Dylan Hadfield-Menell, Gillian K. Hadfield, Joel Z. Leibo

Figure 1 for Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors
Figure 2 for Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors
Figure 3 for Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors
Figure 4 for Silly rules improve the capacity of agents to learn stable enforcement and compliance behaviors
Viaarxiv icon

An Extensible Interactive Interface for Agent Design

Add code
Bookmark button
Alert button
Jun 10, 2019
Matthew Rahtz, James Fang, Anca D. Dragan, Dylan Hadfield-Menell

Figure 1 for An Extensible Interactive Interface for Agent Design
Figure 2 for An Extensible Interactive Interface for Agent Design
Figure 3 for An Extensible Interactive Interface for Agent Design
Figure 4 for An Extensible Interactive Interface for Agent Design
Viaarxiv icon