Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

Add code
Jul 06, 2022

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: