Alert button

Mind the Gap: Offline Policy Optimization for Imperfect Rewards

Feb 03, 2023
Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang

Figure 1 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 2 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 3 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Figure 4 for Mind the Gap: Offline Policy Optimization for Imperfect Rewards

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: