Alert button

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment

Add code
Bookmark button
Alert button
Apr 05, 2024
Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: