Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Satinder Singh

Bootstrapped Meta-Learning


Sep 09, 2021
Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh

* 31 pages, 19 figures, 7 tables 

  Access Paper or Ask Questions

Proper Value Equivalence


Jun 18, 2021
Christopher Grimm, André Barreto, Gregory Farquhar, David Silver, Satinder Singh


  Access Paper or Ask Questions

Discovering Diverse Nearly Optimal Policies withSuccessor Features


Jun 01, 2021
Tom Zahavy, Brendan O'Donoghue, Andre Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh


  Access Paper or Ask Questions

Reward is enough for convex MDPs


Jun 01, 2021
Tom Zahavy, Brendan O'Donoghue, Guillaume Desjardins, Satinder Singh


  Access Paper or Ask Questions

Reinforcement Learning of Implicit and Explicit Control Flow in Instructions


Feb 25, 2021
Ethan A. Brooks, Janarthanan Rajendran, Richard L. Lewis, Satinder Singh


  Access Paper or Ask Questions

Discovery of Options via Meta-Learned Subgoals


Feb 12, 2021
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

Pairwise Weights for Temporal Credit Assignment


Feb 09, 2021
Zeyu Zheng, Risto Vuorio, Richard Lewis, Satinder Singh

* The first two authors contributed equally 

  Access Paper or Ask Questions

Learning State Representations from Random Deep Action-conditional Predictions


Feb 09, 2021
Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard Lewis, Satinder Singh


  Access Paper or Ask Questions

Efficient Querying for Cooperative Probabilistic Commitments


Dec 14, 2020
Qi Zhang, Edmund H. Durfee, Satinder Singh


  Access Paper or Ask Questions

The Value Equivalence Principle for Model-Based Reinforcement Learning


Nov 06, 2020
Christopher Grimm, André Barreto, Satinder Singh, David Silver

* NeurIPS-2020 

  Access Paper or Ask Questions

Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments


Oct 28, 2020
Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh


  Access Paper or Ask Questions

Discovering Reinforcement Learning Algorithms


Jul 17, 2020
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver


  Access Paper or Ask Questions

Meta-Gradient Reinforcement Learning with an Objective Discovered Online


Jul 16, 2020
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver


  Access Paper or Ask Questions

Learning to Play No-Press Diplomacy with Best Response Policy Iteration


Jun 17, 2020
Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach


  Access Paper or Ask Questions

Self-Tuning Deep Reinforcement Learning


Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

How Should an Agent Practice?


Dec 15, 2019
Janarthanan Rajendran, Richard Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh

* AAAI-2020 

  Access Paper or Ask Questions

What Can Learned Intrinsic Rewards Capture?


Dec 11, 2019
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

Hindsight Credit Assignment


Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

* NeurIPS 2019 

  Access Paper or Ask Questions

Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem


Nov 25, 2019
John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng Jin, Satinder Singh, Chenxi Wang, Jieping Ye

* ICDM 2019 Short Paper 

  Access Paper or Ask Questions

Disentangled Cumulants Help Successor Representations Transfer to New Tasks


Nov 25, 2019
Christopher Grimm, Irina Higgins, Andre Barreto, Denis Teplyashin, Markus Wulfmeier, Tim Hertweck, Raia Hadsell, Satinder Singh


  Access Paper or Ask Questions

Object-oriented state editing for HRL


Oct 31, 2019
Victor Bapst, Alvaro Sanchez-Gonzalez, Omar Shams, Kimberly Stachenfeld, Peter W. Battaglia, Satinder Singh, Jessica B. Hamrick

* 8 pages; accepted to the Perception as Generative Reasoning workshop of the 33rd Conference on Neural InformationProcessing Systems (NeurIPS 2019) 

  Access Paper or Ask Questions

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles


Oct 23, 2019
Aditya Modi, Nan Jiang, Ambuj Tewari, Satinder Singh


  Access Paper or Ask Questions

Discovery of Useful Questions as Auxiliary Tasks


Sep 10, 2019
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

No Press Diplomacy: Modeling Multi-Agent Gameplay


Sep 04, 2019
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

* Accepted at NeurIPS 2019 

  Access Paper or Ask Questions

Behaviour Suite for Reinforcement Learning


Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt


  Access Paper or Ask Questions

Learning Independently-Obtainable Reward Functions


Jan 31, 2019
Christopher Grimm, Satinder Singh


  Access Paper or Ask Questions