Alert button

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Feb 19, 2024
Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

Figure 1 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 2 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 3 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 4 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: