Alert button

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

Add code
Bookmark button
Alert button
May 22, 2023
Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

Figure 1 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 2 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 3 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Figure 4 for AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: