Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Satinder Singh

Reinforcement Learning of Implicit and Explicit Control Flow in Instructions


Feb 25, 2021
Ethan A. Brooks, Janarthanan Rajendran, Richard L. Lewis, Satinder Singh


  Access Paper or Ask Questions

Discovery of Options via Meta-Learned Subgoals


Feb 12, 2021
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

Pairwise Weights for Temporal Credit Assignment


Feb 09, 2021
Zeyu Zheng, Risto Vuorio, Richard Lewis, Satinder Singh

* The first two authors contributed equally 

  Access Paper or Ask Questions

Learning State Representations from Random Deep Action-conditional Predictions


Feb 09, 2021
Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard Lewis, Satinder Singh


  Access Paper or Ask Questions

Efficient Querying for Cooperative Probabilistic Commitments


Dec 14, 2020
Qi Zhang, Edmund H. Durfee, Satinder Singh


  Access Paper or Ask Questions

The Value Equivalence Principle for Model-Based Reinforcement Learning


Nov 06, 2020
Christopher Grimm, André Barreto, Satinder Singh, David Silver

* NeurIPS-2020 

  Access Paper or Ask Questions

Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments


Oct 28, 2020
Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh


  Access Paper or Ask Questions

Discovering Reinforcement Learning Algorithms


Jul 17, 2020
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver


  Access Paper or Ask Questions

Meta-Gradient Reinforcement Learning with an Objective Discovered Online


Jul 16, 2020
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver


  Access Paper or Ask Questions

Learning to Play No-Press Diplomacy with Best Response Policy Iteration


Jun 17, 2020
Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach


  Access Paper or Ask Questions

Self-Tuning Deep Reinforcement Learning


Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

How Should an Agent Practice?


Dec 15, 2019
Janarthanan Rajendran, Richard Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh

* AAAI-2020 

  Access Paper or Ask Questions

What Can Learned Intrinsic Rewards Capture?


Dec 11, 2019
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

Hindsight Credit Assignment


Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

* NeurIPS 2019 

  Access Paper or Ask Questions

Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem


Nov 25, 2019
John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng Jin, Satinder Singh, Chenxi Wang, Jieping Ye

* ICDM 2019 Short Paper 

  Access Paper or Ask Questions

Disentangled Cumulants Help Successor Representations Transfer to New Tasks


Nov 25, 2019
Christopher Grimm, Irina Higgins, Andre Barreto, Denis Teplyashin, Markus Wulfmeier, Tim Hertweck, Raia Hadsell, Satinder Singh


  Access Paper or Ask Questions

Object-oriented state editing for HRL


Oct 31, 2019
Victor Bapst, Alvaro Sanchez-Gonzalez, Omar Shams, Kimberly Stachenfeld, Peter W. Battaglia, Satinder Singh, Jessica B. Hamrick

* 8 pages; accepted to the Perception as Generative Reasoning workshop of the 33rd Conference on Neural InformationProcessing Systems (NeurIPS 2019) 

  Access Paper or Ask Questions

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles


Oct 23, 2019
Aditya Modi, Nan Jiang, Ambuj Tewari, Satinder Singh


  Access Paper or Ask Questions

Discovery of Useful Questions as Auxiliary Tasks


Sep 10, 2019
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh


  Access Paper or Ask Questions

No Press Diplomacy: Modeling Multi-Agent Gameplay


Sep 04, 2019
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

* Accepted at NeurIPS 2019 

  Access Paper or Ask Questions

Behaviour Suite for Reinforcement Learning


Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt


  Access Paper or Ask Questions

Learning Independently-Obtainable Reward Functions


Jan 31, 2019
Christopher Grimm, Satinder Singh


  Access Paper or Ask Questions

Generative Adversarial Self-Imitation Learning


Dec 03, 2018
Yijie Guo, Junhyuk Oh, Satinder Singh, Honglak Lee


  Access Paper or Ask Questions

Learning End-to-End Goal-Oriented Dialog with Multiple Answers


Aug 24, 2018
Janarthanan Rajendran, Jatin Ganhotra, Satinder Singh, Lazaros Polymenakos

* EMNLP 2018. permuted-bAbI dialog tasks are available at - https://github.com/IBM/permuted-bAbI-dialog-tasks 

  Access Paper or Ask Questions

Many-Goals Reinforcement Learning


Jun 22, 2018
Vivek Veeriah, Junhyuk Oh, Satinder Singh


  Access Paper or Ask Questions

On Learning Intrinsic Rewards for Policy Gradient Methods


Jun 22, 2018
Zeyu Zheng, Junhyuk Oh, Satinder Singh


  Access Paper or Ask Questions