Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Alan Chan

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences


Jul 17, 2021
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

* Submitted to JMLR 

  Access Paper or Ask Questions

Parameter-free Gradient Temporal Difference Learning


May 10, 2021
Andrew Jacobsen, Alan Chan

* 30 pages, 10 figures 

  Access Paper or Ask Questions

Inverse Policy Evaluation for Value-based Sequential Decision-making


Aug 26, 2020
Alan Chan, Kris de Asis, Richard S. Sutton

* Submitted to NeurIPS 2020 

  Access Paper or Ask Questions

Efficient decorrelation of features using Gramian in Reinforcement Learning


Nov 19, 2019
Borislav Mavrin, Daniel Graves, Alan Chan


  Access Paper or Ask Questions

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning


Sep 09, 2019
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves


  Access Paper or Ask Questions