Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Conditional Importance Sampling for Off-Policy Learning

Oct 16, 2019

Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney

Figure 1 for Conditional Importance Sampling for Off-Policy Learning

Figure 2 for Conditional Importance Sampling for Off-Policy Learning

Figure 3 for Conditional Importance Sampling for Off-Policy Learning

Figure 4 for Conditional Importance Sampling for Off-Policy Learning

Share this with someone who'll enjoy it:

Abstract:The principal contribution of this paper is a conceptual framework for off-policy reinforcement learning, based on conditional expectations of importance sampling ratios. This framework yields new perspectives and understanding of existing off-policy algorithms, and reveals a broad space of unexplored algorithms. We theoretically analyse this space, and concretely investigate several algorithms that arise from this framework.

View paper on

Share this with someone who'll enjoy it:

Title:Conditional Importance Sampling for Off-Policy Learning

Paper and Code