Alert button

A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Jun 17, 2020
Nevena Lazic, Dong Yin, Mehrdad Farajtabar, Nir Levine, Dilan Gorur, Chris Harris, Dale Schuurmans

Figure 1 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Figure 2 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: