Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Feb 12, 2021

Piotr Kozakowski, Łukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kańska

Figure 1 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Figure 2 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Figure 3 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Figure 4 for Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Share this with someone who'll enjoy it:

Abstract:Sample efficiency and performance in the offline setting have emerged as significant challenges of deep reinforcement learning. We introduce Q-Value Weighted Regression (QWR), a simple RL algorithm that excels in these aspects. QWR is an extension of Advantage Weighted Regression (AWR), an off-policy actor-critic algorithm that performs very well on continuous control tasks, also in the offline setting, but has low sample efficiency and struggles with high-dimensional observation spaces. We perform an analysis of AWR that explains its shortcomings and use these insights to motivate QWR. We show experimentally that QWR matches the state-of-the-art algorithms both on tasks with continuous and discrete actions. In particular, QWR yields results on par with SAC on the MuJoCo suite and - with the same set of hyperparameters - yields results on par with a highly tuned Rainbow implementation on a set of Atari games. We also verify that QWR performs well in the offline RL setting.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Paper and Code