Alert button
Picture for Satinder Singh

Satinder Singh

Alert button

How Should an Agent Practice?

Add code
Bookmark button
Alert button
Dec 15, 2019
Janarthanan Rajendran, Richard Lewis, Vivek Veeriah, Honglak Lee, Satinder Singh

Figure 1 for How Should an Agent Practice?
Figure 2 for How Should an Agent Practice?
Figure 3 for How Should an Agent Practice?
Figure 4 for How Should an Agent Practice?
Viaarxiv icon

What Can Learned Intrinsic Rewards Capture?

Add code
Bookmark button
Alert button
Dec 11, 2019
Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for What Can Learned Intrinsic Rewards Capture?
Figure 2 for What Can Learned Intrinsic Rewards Capture?
Figure 3 for What Can Learned Intrinsic Rewards Capture?
Figure 4 for What Can Learned Intrinsic Rewards Capture?
Viaarxiv icon

Hindsight Credit Assignment

Add code
Bookmark button
Alert button
Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

Figure 1 for Hindsight Credit Assignment
Figure 2 for Hindsight Credit Assignment
Figure 3 for Hindsight Credit Assignment
Figure 4 for Hindsight Credit Assignment
Viaarxiv icon

Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem

Add code
Bookmark button
Alert button
Nov 25, 2019
John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng Jin, Satinder Singh, Chenxi Wang, Jieping Ye

Figure 1 for Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
Figure 2 for Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
Figure 3 for Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
Figure 4 for Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem
Viaarxiv icon

Disentangled Cumulants Help Successor Representations Transfer to New Tasks

Add code
Bookmark button
Alert button
Nov 25, 2019
Christopher Grimm, Irina Higgins, Andre Barreto, Denis Teplyashin, Markus Wulfmeier, Tim Hertweck, Raia Hadsell, Satinder Singh

Figure 1 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 2 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 3 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Figure 4 for Disentangled Cumulants Help Successor Representations Transfer to New Tasks
Viaarxiv icon

Object-oriented state editing for HRL

Add code
Bookmark button
Alert button
Oct 31, 2019
Victor Bapst, Alvaro Sanchez-Gonzalez, Omar Shams, Kimberly Stachenfeld, Peter W. Battaglia, Satinder Singh, Jessica B. Hamrick

Figure 1 for Object-oriented state editing for HRL
Figure 2 for Object-oriented state editing for HRL
Figure 3 for Object-oriented state editing for HRL
Figure 4 for Object-oriented state editing for HRL
Viaarxiv icon

Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles

Add code
Bookmark button
Alert button
Oct 23, 2019
Aditya Modi, Nan Jiang, Ambuj Tewari, Satinder Singh

Viaarxiv icon

Discovery of Useful Questions as Auxiliary Tasks

Add code
Bookmark button
Alert button
Sep 10, 2019
Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Discovery of Useful Questions as Auxiliary Tasks
Figure 2 for Discovery of Useful Questions as Auxiliary Tasks
Figure 3 for Discovery of Useful Questions as Auxiliary Tasks
Figure 4 for Discovery of Useful Questions as Auxiliary Tasks
Viaarxiv icon

No Press Diplomacy: Modeling Multi-Agent Gameplay

Add code
Bookmark button
Alert button
Sep 04, 2019
Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

Figure 1 for No Press Diplomacy: Modeling Multi-Agent Gameplay
Figure 2 for No Press Diplomacy: Modeling Multi-Agent Gameplay
Figure 3 for No Press Diplomacy: Modeling Multi-Agent Gameplay
Figure 4 for No Press Diplomacy: Modeling Multi-Agent Gameplay
Viaarxiv icon

Behaviour Suite for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon