Picture for Dylan Hadfield-Menell

Dylan Hadfield-Menell

Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL

Add code
Aug 22, 2022
Figure 1 for Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL
Figure 2 for Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL
Figure 3 for Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL
Figure 4 for Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL
Viaarxiv icon

Towards Psychologically-Grounded Dynamic Preference Models

Add code
Aug 06, 2022
Figure 1 for Towards Psychologically-Grounded Dynamic Preference Models
Figure 2 for Towards Psychologically-Grounded Dynamic Preference Models
Figure 3 for Towards Psychologically-Grounded Dynamic Preference Models
Figure 4 for Towards Psychologically-Grounded Dynamic Preference Models
Viaarxiv icon

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

Add code
Jul 28, 2022
Figure 1 for Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Figure 2 for Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Figure 3 for Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Figure 4 for Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Viaarxiv icon

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis

Add code
Jul 20, 2022
Figure 1 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 2 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 3 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Figure 4 for Building Human Values into Recommender Systems: An Interdisciplinary Synthesis
Viaarxiv icon

How to talk so your robot will learn: Instructions, descriptions, and pragmatics

Add code
Jun 16, 2022
Figure 1 for How to talk so your robot will learn: Instructions, descriptions, and pragmatics
Figure 2 for How to talk so your robot will learn: Instructions, descriptions, and pragmatics
Figure 3 for How to talk so your robot will learn: Instructions, descriptions, and pragmatics
Figure 4 for How to talk so your robot will learn: Instructions, descriptions, and pragmatics
Viaarxiv icon

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Add code
Apr 25, 2022
Figure 1 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 2 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 3 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 4 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Viaarxiv icon

Linguistic communication as (inverse) reward design

Add code
Apr 11, 2022
Figure 1 for Linguistic communication as (inverse) reward design
Figure 2 for Linguistic communication as (inverse) reward design
Figure 3 for Linguistic communication as (inverse) reward design
Viaarxiv icon

Guided Imitation of Task and Motion Planning

Add code
Dec 06, 2021
Figure 1 for Guided Imitation of Task and Motion Planning
Figure 2 for Guided Imitation of Task and Motion Planning
Figure 3 for Guided Imitation of Task and Motion Planning
Figure 4 for Guided Imitation of Task and Motion Planning
Viaarxiv icon

What are you optimizing for? Aligning Recommender Systems with Human Values

Add code
Jul 22, 2021
Viaarxiv icon

Consequences of Misaligned AI

Add code
Feb 07, 2021
Figure 1 for Consequences of Misaligned AI
Figure 2 for Consequences of Misaligned AI
Viaarxiv icon