Alert button

Aligning Agent Policy with Externalities: Reward Design via Bilevel RL

Aug 03, 2023
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang

Figure 1 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 2 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 3 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 4 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: