Alert button
Picture for Wenbin Lai

Wenbin Lai

Alert button

Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

Add code
Bookmark button
Alert button
Oct 19, 2023
Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang

Figure 1 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 2 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 3 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Figure 4 for Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Viaarxiv icon

Secrets of RLHF in Large Language Models Part I: PPO

Add code
Bookmark button
Alert button
Jul 18, 2023
Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang

Figure 1 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 2 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 3 for Secrets of RLHF in Large Language Models Part I: PPO
Figure 4 for Secrets of RLHF in Large Language Models Part I: PPO
Viaarxiv icon