Picture for Shawn Im

Shawn Im

Understanding the Learning Dynamics of Alignment with Human Feedback

Add code
Apr 08, 2024
Figure 1 for Understanding the Learning Dynamics of Alignment with Human Feedback
Figure 2 for Understanding the Learning Dynamics of Alignment with Human Feedback
Figure 3 for Understanding the Learning Dynamics of Alignment with Human Feedback
Figure 4 for Understanding the Learning Dynamics of Alignment with Human Feedback
Viaarxiv icon

Evaluating the Utility of Model Explanations for Model Development

Dec 10, 2023
Viaarxiv icon