Alert button
Picture for Wenqiang Wei

Wenqiang Wei

Alert button

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

Add code
Bookmark button
Alert button
Mar 07, 2024
Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, Mingchuan yang

Figure 1 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 2 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 3 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Figure 4 for Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy
Viaarxiv icon