Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

Add code
Jan 30, 2023
Figure 1 for Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons
Figure 2 for Principled Reinforcement Learning with Human Feedback from Pairwise or $K$-wise Comparisons

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: