Alert button

Direct Language Model Alignment from Online AI Feedback

Feb 07, 2024
Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi Liu, Misha Khalman, Felipe Llinares, Alexandre Rame, Thomas Mesnard, Yao Zhao, Bilal Piot, Johan Ferret, Mathieu Blondel

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: