Alert button

Bayesian Reward Models for LLM Alignment

Feb 20, 2024
Adam X. Yang, Maxime Robeyns, Thomas Coste, Jun Wang, Haitham Bou-Ammar, Laurence Aitchison

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: