Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Alan Chan

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Jul 17, 2021
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

* Submitted to JMLR 

  Access Paper or Ask Questions

Parameter-free Gradient Temporal Difference Learning

May 10, 2021
Andrew Jacobsen, Alan Chan

* 30 pages, 10 figures 

  Access Paper or Ask Questions

Inverse Policy Evaluation for Value-based Sequential Decision-making

Aug 26, 2020
Alan Chan, Kris de Asis, Richard S. Sutton

* Submitted to NeurIPS 2020 

  Access Paper or Ask Questions

Efficient decorrelation of features using Gramian in Reinforcement Learning

Nov 19, 2019
Borislav Mavrin, Daniel Graves, Alan Chan

  Access Paper or Ask Questions

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

Sep 09, 2019
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves

  Access Paper or Ask Questions