Alert button

Transforming and Combining Rewards for Aligning Large Language Models

Feb 01, 2024
Zihao Wang, Chirag Nagpal, Jonathan Berant, Jacob Eisenstein, Alex D'Amour, Sanmi Koyejo, Victor Veitch

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: