Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Patrick M. Pilarski

Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI


Aug 27, 2020
Katya Kudashkina, Patrick M. Pilarski, Richard S. Sutton

* Currently under review 

  Access Paper or Ask Questions

What's a Good Prediction? Issues in Evaluating General Value Functions Through Error


Jan 23, 2020
Alex Kearney, Anna Koop, Patrick M. Pilarski

* Submitted to AAMAS 

  Access Paper or Ask Questions

Gamma-Nets: Generalizing Value Estimation over Timescale


Nov 23, 2019
Craig Sherstan, Shibhansh Dohare, James MacGlashan, Johannes G眉nther, Patrick M. Pilarski

* accepted AAAI 2020 

  Access Paper or Ask Questions

Examining the Use of Temporal-Difference Incremental Delta-Bar-Delta for Real-World Predictive Knowledge Architectures


Aug 15, 2019
Johannes G眉nther, Nadia M. Ady, Alex Kearney, Michael R. Dawson, Patrick M. Pilarski


  Access Paper or Ask Questions

General Dynamic Neural Networks for explainable PID parameter tuning in control engineering: An extensive comparison


May 30, 2019
Johannes G眉nther, Elias Reichensd枚rfer, Patrick M. Pilarski, Klaus Diepold


  Access Paper or Ask Questions

Learned human-agent decision-making, communication and joint action in a virtual reality environment


May 07, 2019
Patrick M. Pilarski, Andrew Butcher, Michael Johanson, Matthew M. Botvinick, Andrew Bolt, Adam S. R. Parker

* 5 pages, 3 figures. Accepted to The 4th Multidisciplinary Conference on Reinforcement Learning and Decision Making, July 7-10, 2019, McGill University, Montreal, Quebec, Canada 

  Access Paper or Ask Questions

When is a Prediction Knowledge?


Apr 18, 2019
Alex Kearney, Patrick M. Pilarski

* Accepted to RLDM 2019 

  Access Paper or Ask Questions

Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning


Mar 20, 2019
Sandy H. Huang, Martina Zambelli, Jackie Kay, Murilo F. Martins, Yuval Tassa, Patrick M. Pilarski, Raia Hadsell


  Access Paper or Ask Questions

Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning


Mar 08, 2019
Alex Kearney, Vivek Veeriah, Jaden Travnik, Patrick M. Pilarski, Richard S. Sutton


  Access Paper or Ask Questions

TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent


Apr 10, 2018
Alex Kearney, Vivek Veeriah, Jaden B. Travnik, Richard S. Sutton, Patrick M. Pilarski

* Version as submitted to the 31st Conference on Neural Information Processing Systems (NIPS 2017) on May 19, 2017. 9 pages, 5 figures. Extended version in preparation for journal submission 

  Access Paper or Ask Questions

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation


Mar 23, 2018
Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski


  Access Paper or Ask Questions

Reactive Reinforcement Learning in Asynchronous Environments


Feb 16, 2018
Jaden B. Travnik, Kory W. Mathewson, Richard S. Sutton, Patrick M. Pilarski

* 11 pages, 7 figures, currently under journal peer review 

  Access Paper or Ask Questions

Communicative Capital for Prosthetic Agents


Nov 10, 2017
Patrick M. Pilarski, Richard S. Sutton, Kory W. Mathewson, Craig Sherstan, Adam S. R. Parker, Ann L. Edwards

* 33 pages, 10 figures; unpublished technical report undergoing peer review 

  Access Paper or Ask Questions

Actor-Critic Reinforcement Learning with Simultaneous Human Control and Feedback


Mar 15, 2017
Kory W. Mathewson, Patrick M. Pilarski

* 10 pages, 2 pages of references, 8 figures. Under review for the 34th International Conference on Machine Learning, Sydney, Australia, 2017. Copyright 2017 by the authors 

  Access Paper or Ask Questions

Reinforcement Learning based Embodied Agents Modelling Human Users Through Interaction and Multi-Sensory Perception


Jan 26, 2017
Kory W. Mathewson, Patrick M. Pilarski

* 4 pages, 2 figures, Accepted at the 2017 AAAI Spring Symposium on Interactive Multi-Sensory Object Perception for Embodied Agents 

  Access Paper or Ask Questions

True Online Temporal-Difference Learning


Sep 08, 2016
Harm van Seijen, A. Rupam Mahmood, Patrick M. Pilarski, Marlos C. Machado, Richard S. Sutton

* Journal of Machine Learning Research (JMLR), 17(145):1-40, 2016 
* This is the published JMLR version. It is a much improved version. The main changes are: 1) re-structuring of the article; 2) additional analysis on the forward view; 3) empirical comparison of traditional and new forward view; 4) added discussion of other true online papers; 5) updated discussion for non-linear function approximation 

  Access Paper or Ask Questions

Simultaneous Control and Human Feedback in the Training of a Robotic Agent with Actor-Critic Reinforcement Learning


Jun 22, 2016
Kory W. Mathewson, Patrick M. Pilarski

* 7 pages, 3 figures, Accepted at the Interactive Machine Learning Workshop at IJCAI 2016 (IML): Connecting Humans and Machines 

  Access Paper or Ask Questions

Introspective Agents: Confidence Measures for General Value Functions


Jun 17, 2016
Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski

* Accepted for presentation at the Ninth Conference on Artificial General Intelligence (AGI 2016), 4 pages, 1 figure 

  Access Paper or Ask Questions

Face valuing: Training user interfaces with facial expressions and reinforcement learning


Jun 09, 2016
Vivek Veeriah, Patrick M. Pilarski, Richard S. Sutton

* 7 pages, 4 figures, IJCAI 2016 - Interactive Machine Learning Workshop 

  Access Paper or Ask Questions

An Empirical Evaluation of True Online TD(位)


Jul 01, 2015
Harm van Seijen, A. Rupam Mahmood, Patrick M. Pilarski, Richard S. Sutton

* European Workshop on Reinforcement Learning (EWRL) 2015 

  Access Paper or Ask Questions

Using Learned Predictions as Feedback to Improve Control and Communication with an Artificial Limb: Preliminary Findings


Aug 08, 2014
Adam S. R. Parker, Ann L. Edwards, Patrick M. Pilarski

* 7 pages, 5 figures 

  Access Paper or Ask Questions

Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb


Sep 18, 2013
Ann L. Edwards, Alexandra Kearney, Michael Rory Dawson, Richard S. Sutton, Patrick M. Pilarski

* 5 pages, 4 figures, This version to appear at The 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making, Princeton, NJ, USA, Oct. 25-27, 2013 

  Access Paper or Ask Questions