Alert button
Picture for Kellie Lu

Kellie Lu

Alert button

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Add code
Bookmark button
Alert button
Sep 01, 2023
Harrison Lee, Samrat Phatale, Hassan Mansoor, Kellie Lu, Thomas Mesnard, Colton Bishop, Victor Carbune, Abhinav Rastogi

Figure 1 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 2 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 3 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Figure 4 for RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Viaarxiv icon