Alert button

Beyond Reward: Offline Preference-guided Policy Optimization

May 25, 2023
Yachen Kang, Diyuan Shi, Jinxin Liu, Li He, Donglin Wang

Figure 1 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 2 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 3 for Beyond Reward: Offline Preference-guided Policy Optimization
Figure 4 for Beyond Reward: Offline Preference-guided Policy Optimization

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: