Alert button

Preventing Reward Hacking with Occupancy Measure Regularization

Add code
Bookmark button
Alert button
Mar 05, 2024
Cassidy Laidlaw, Shivam Singhal, Anca Dragan

Figure 1 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 2 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 3 for Preventing Reward Hacking with Occupancy Measure Regularization
Figure 4 for Preventing Reward Hacking with Occupancy Measure Regularization

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: