Alert button
Picture for Simon Wang

Simon Wang

Alert button

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Add code
Bookmark button
Alert button
Feb 19, 2024
Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen

Figure 1 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 2 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 3 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Figure 4 for Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Viaarxiv icon