Alert button
Picture for W. Bradley Knox

W. Bradley Knox

Alert button

Contrastive Preference Learning: Learning from Human Feedback without RL

Add code
Bookmark button
Alert button
Oct 24, 2023
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh

Viaarxiv icon

Contrastive Prefence Learning: Learning from Human Feedback without RL

Add code
Bookmark button
Alert button
Oct 20, 2023
Joey Hejna, Rafael Rafailov, Harshit Sikchi, Chelsea Finn, Scott Niekum, W. Bradley Knox, Dorsa Sadigh

Viaarxiv icon

Learning Optimal Advantage from Preferences and Mistaking it for Reward

Add code
Bookmark button
Alert button
Oct 03, 2023
W. Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson, Serena Booth, Anca Dragan, Peter Stone, Scott Niekum

Figure 1 for Learning Optimal Advantage from Preferences and Mistaking it for Reward
Figure 2 for Learning Optimal Advantage from Preferences and Mistaking it for Reward
Figure 3 for Learning Optimal Advantage from Preferences and Mistaking it for Reward
Figure 4 for Learning Optimal Advantage from Preferences and Mistaking it for Reward
Viaarxiv icon

Models of human preference for learning reward functions

Add code
Bookmark button
Alert button
Jun 05, 2022
W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Allievi

Figure 1 for Models of human preference for learning reward functions
Figure 2 for Models of human preference for learning reward functions
Figure 3 for Models of human preference for learning reward functions
Figure 4 for Models of human preference for learning reward functions
Viaarxiv icon

Reward (Mis)design for Autonomous Driving

Add code
Bookmark button
Alert button
Apr 28, 2021
W. Bradley Knox, Alessandro Allievi, Holger Banzhaf, Felix Schmitt, Peter Stone

Figure 1 for Reward (Mis)design for Autonomous Driving
Figure 2 for Reward (Mis)design for Autonomous Driving
Viaarxiv icon

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Add code
Bookmark button
Alert button
Sep 28, 2020
Yuchen Cui, Qiping Zhang, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox

Figure 1 for The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Figure 2 for The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Figure 3 for The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Figure 4 for The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Viaarxiv icon