Alert button

Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

Jul 06, 2022
Andrew Wagenmaker, Kevin Jamieson

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: