Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Dec 12, 2022

Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu

Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Share this with someone who'll enjoy it:

Abstract:We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogeneous linear Markov decision processes (linear MDPs) whose transition dynamic can be parameterized as a linear function of a given feature mapping, we propose the first computationally efficient algorithm that achieves the nearly minimax optimal regret $\tilde O(d\sqrt{H^3K})$, where $d$ is the dimension of the feature mapping, $H$ is the planning horizon, and $K$ is the number of episodes. Our algorithm is based on a weighted linear regression scheme with a carefully designed weight, which depends on a new variance estimator that (1) directly estimates the variance of the \emph{optimal} value function, (2) monotonically decreases with respect to the number of episodes to ensure a better estimation accuracy, and (3) uses a rare-switching policy to update the value function estimator to control the complexity of the estimated value function class. Our work provides a complete answer to optimal RL with linear MDPs, and the developed algorithm and theoretical tools may be of independent interest.

* 44 pages, 1 table

View paper on

Share this with someone who'll enjoy it:

Title:Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Paper and Code