Alert button

Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback

Jan 31, 2022
Tiancheng Jin, Tal Lancewicki, Haipeng Luo, Yishay Mansour, Aviv Rosenberg

Figure 1 for Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: