Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

YI Zhu

Efficient Inference and Exploration for Reinforcement Learning

Nov 04, 2019

YI Zhu, Jing Dong, Henry Lam

Figure 1 for Efficient Inference and Exploration for Reinforcement Learning

Figure 2 for Efficient Inference and Exploration for Reinforcement Learning

Figure 3 for Efficient Inference and Exploration for Reinforcement Learning

Figure 4 for Efficient Inference and Exploration for Reinforcement Learning

Abstract:Despite an ever growing literature on reinforcement learning algorithms and applications, much less is known about their statistical inference. In this paper, we investigate the large sample behaviors of the Q-value estimates with closed-form characterizations of the asymptotic variances. This allows us to efficiently construct confidence regions for Q-value and optimal value functions, and to develop policies to minimize their estimation errors. This also leads to a policy exploration strategy that relies on estimating the relative discrepancies among the Q estimates. Numerical experiments show superior performances of our exploration strategy than other benchmark approaches.

Via

Access Paper or Ask Questions