Alert button

Behavior Alignment via Reward Function Optimization

Oct 29, 2023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: