Alert button

Scaling Laws for Reward Model Overoptimization

Oct 19, 2022
Leo Gao, John Schulman, Jacob Hilton

Figure 1 for Scaling Laws for Reward Model Overoptimization
Figure 2 for Scaling Laws for Reward Model Overoptimization
Figure 3 for Scaling Laws for Reward Model Overoptimization
Figure 4 for Scaling Laws for Reward Model Overoptimization

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: