Alert button

REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback

Dec 22, 2023
Souradip Chakraborty, Amisha Bhaskar, Anukriti Singh, Pratap Tokekar, Dinesh Manocha, Amrit Singh Bedi

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: