Alert button

The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values

Oct 11, 2023
Hannah Rose Kirk, Andrew M. Bean, Bertie Vidgen, Paul Röttger, Scott A. Hale

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: