Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Martha White

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Jul 17, 2021
Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

* Submitted to JMLR 

  Access Paper or Ask Questions

Predictive Representation Learning for Language Modeling

May 29, 2021
Qingfeng Lan, Luke Kumar, Martha White, Alona Fyshe

  Access Paper or Ask Questions

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Apr 28, 2021
Andrew Patterson, Adam White, Sina Ghiassian, Martha White

  Access Paper or Ask Questions

Scalable Online Recurrent Learning Using Columnar Neural Networks

Mar 09, 2021
Khurram Javed, Martha White, Rich Sutton

* Structural credit-assignment, scalable recurrent learning, scalable meta-learning, backward view credit-assignment 

  Access Paper or Ask Questions

Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

Dec 07, 2020
Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White

* Summary of the "2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics" held in conjunction with "Robotics: Science and System 2020". Website: 

  Access Paper or Ask Questions

Towards Safe Policy Improvement for Non-Stationary MDPs

Oct 23, 2020
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas

* Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020) 

  Access Paper or Ask Questions

From Language to Language-ish: How Brain-Like is an LSTM's Representation of Nonsensical Language Stimuli?

Oct 14, 2020
Maryam Hashemzadeh, Greta Kaufeld, Martha White, Andrea E. Martin, Alona Fyshe

* 12 pages 

  Access Paper or Ask Questions

Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities

Jul 19, 2020
Jincheng Mei, Yangchen Pan, Martha White, Amir-massoud Farahmand, Hengshuai Yao

* The paper is under review 

  Access Paper or Ask Questions

Towards a practical measure of interference for reinforcement learning

Jul 07, 2020
Vincent Liu, Adam White, Hengshuai Yao, Martha White

* 18 pages 

  Access Paper or Ask Questions

Gradient Temporal-Difference Learning with Regularized Corrections

Jul 07, 2020
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White

* 22 pages. Accepted to ICML 2020 

  Access Paper or Ask Questions

Selective Dyna-style Planning Under Limited Model Capacity

Jul 05, 2020
Muhammad Zaheer, Samuel Sokota, Erin J. Talvitie, Martha White

* Accepted at ICML 2020 

  Access Paper or Ask Questions

Learning Causal Models Online

Jun 12, 2020
Khurram Javed, Martha White, Yoshua Bengio

* Spurious features, causal models, online learning, random search, non-iid 

  Access Paper or Ask Questions

Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models

Jun 08, 2020
Taher Jafferjee, Ehsan Imani, Erin Talvitie, Martha White, Micheal Bowling

* 9 pages, 7 figures, 

  Access Paper or Ask Questions

Optimizing for the Future in Non-Stationary MDPs

Jun 02, 2020
Yash Chandak, Georgios Theocharous, Shiv Shankar, Martha White, Sridhar Mahadevan, Philip S. Thomas

* Thirty-seventh International Conference on Machine Learning (ICML 2020) 

  Access Paper or Ask Questions

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

May 11, 2020
Yash Satsangi, Sungsu Lim, Shimon Whiteson, Frans Oliehoek, Martha White

* AAMAS 2020 

  Access Paper or Ask Questions

Maxmin Q-learning: Controlling the Estimation Bias of Q-learning

Feb 16, 2020
Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White

  Access Paper or Ask Questions

An implicit function learning approach for parametric modal regression

Feb 14, 2020
Yangchen Pan, Ehsan Imani, Martha White, Amir-massoud Farahmand

  Access Paper or Ask Questions

Is Fast Adaptation All You Need?

Oct 03, 2019
Khurram Javed, Hengshuai Yao, Martha White

* Meta Learning Workshop, NeurIPS 2019, 2 figures, MRCL, MAML 

  Access Paper or Ask Questions

Meta-descent for Online, Continual Prediction

Jul 17, 2019
Andrew Jacobsen, Matthew Schlegel, Cameron Linke, Thomas Degris, Adam White, Martha White

* AAAI Conference on Artificial Intelligence 2019 

  Access Paper or Ask Questions

Hill Climbing on Value Estimates for Search-control in Dyna

Jul 04, 2019
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White

* IJCAI 2019 

  Access Paper or Ask Questions

Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study

Jun 19, 2019
Cam Linke, Nadia M. Ady, Martha White, Thomas Degris, Adam White

  Access Paper or Ask Questions

Importance Resampling for Off-policy Prediction

Jun 11, 2019
Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White

  Access Paper or Ask Questions

Meta-Learning Representations for Continual Learning

May 29, 2019
Khurram Javed, Martha White

* 14 pages, 9 figures, open-source, representation learning, continual learning, online learning, under review 

  Access Paper or Ask Questions

Planning with Expectation Models

Apr 03, 2019
Yi Wan, Muhammad Zaheer, Adam White, Martha White, Richard S. Sutton

  Access Paper or Ask Questions

Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling

Dec 03, 2018
Minghan Li, Tanli Zuo, Ruicheng Li, Martha White, Weishi Zheng

  Access Paper or Ask Questions

An Off-policy Policy Gradient Theorem Using Emphatic Weightings

Nov 22, 2018
Ehsan Imani, Eric Graves, Martha White

  Access Paper or Ask Questions