Alert button

When Your AI Deceives You: Challenges with Partial Observability of Human Evaluators in Reward Learning

Feb 27, 2024
Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: