Alert button

RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Add code
Bookmark button
Alert button
Apr 11, 2023
Zheng Yuan, Hongyi Yuan, Chuanqi Tan, Wei Wang, Songfang Huang, Fei Huang

Figure 1 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Figure 2 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Figure 3 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears
Figure 4 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: