Alert button

Reward Collapse in Aligning Large Language Models

Add code
Bookmark button
Alert button
May 28, 2023
Ziang Song, Tianle Cai, Jason D. Lee, Weijie J. Su

Figure 1 for Reward Collapse in Aligning Large Language Models
Figure 2 for Reward Collapse in Aligning Large Language Models
Figure 3 for Reward Collapse in Aligning Large Language Models
Figure 4 for Reward Collapse in Aligning Large Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: