Alert button
Picture for Satinder Singh

Satinder Singh

Alert button

GrASP: Gradient-Based Affordance Selection for Planning

Add code
Bookmark button
Alert button
Feb 08, 2022
Vivek Veeriah, Zeyu Zheng, Richard Lewis, Satinder Singh

Figure 1 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 2 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 3 for GrASP: Gradient-Based Affordance Selection for Planning
Figure 4 for GrASP: Gradient-Based Affordance Selection for Planning
Viaarxiv icon

On the Expressivity of Markov Reward

Add code
Bookmark button
Alert button
Nov 01, 2021
David Abel, Will Dabney, Anna Harutyunyan, Mark K. Ho, Michael L. Littman, Doina Precup, Satinder Singh

Figure 1 for On the Expressivity of Markov Reward
Figure 2 for On the Expressivity of Markov Reward
Figure 3 for On the Expressivity of Markov Reward
Figure 4 for On the Expressivity of Markov Reward
Viaarxiv icon

Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks

Add code
Bookmark button
Alert button
Oct 10, 2021
Janarthanan Rajendran, Jonathan K. Kummerfeld, Satinder Singh

Figure 1 for Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Figure 2 for Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Figure 3 for Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Figure 4 for Learning to Learn End-to-End Goal-Oriented Dialog From Related Dialog Tasks
Viaarxiv icon

Bootstrapped Meta-Learning

Add code
Bookmark button
Alert button
Sep 09, 2021
Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Bootstrapped Meta-Learning
Figure 2 for Bootstrapped Meta-Learning
Figure 3 for Bootstrapped Meta-Learning
Figure 4 for Bootstrapped Meta-Learning
Viaarxiv icon

Proper Value Equivalence

Add code
Bookmark button
Alert button
Jun 18, 2021
Christopher Grimm, André Barreto, Gregory Farquhar, David Silver, Satinder Singh

Figure 1 for Proper Value Equivalence
Figure 2 for Proper Value Equivalence
Figure 3 for Proper Value Equivalence
Figure 4 for Proper Value Equivalence
Viaarxiv icon

Discovering Diverse Nearly Optimal Policies withSuccessor Features

Add code
Bookmark button
Alert button
Jun 01, 2021
Tom Zahavy, Brendan O'Donoghue, Andre Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh

Figure 1 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 2 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 3 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Figure 4 for Discovering Diverse Nearly Optimal Policies withSuccessor Features
Viaarxiv icon

Reward is enough for convex MDPs

Add code
Bookmark button
Alert button
Jun 01, 2021
Tom Zahavy, Brendan O'Donoghue, Guillaume Desjardins, Satinder Singh

Figure 1 for Reward is enough for convex MDPs
Figure 2 for Reward is enough for convex MDPs
Viaarxiv icon

Reinforcement Learning of Implicit and Explicit Control Flow in Instructions

Add code
Bookmark button
Alert button
Feb 25, 2021
Ethan A. Brooks, Janarthanan Rajendran, Richard L. Lewis, Satinder Singh

Figure 1 for Reinforcement Learning of Implicit and Explicit Control Flow in Instructions
Figure 2 for Reinforcement Learning of Implicit and Explicit Control Flow in Instructions
Figure 3 for Reinforcement Learning of Implicit and Explicit Control Flow in Instructions
Figure 4 for Reinforcement Learning of Implicit and Explicit Control Flow in Instructions
Viaarxiv icon

Discovery of Options via Meta-Learned Subgoals

Add code
Bookmark button
Alert button
Feb 12, 2021
Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Discovery of Options via Meta-Learned Subgoals
Figure 2 for Discovery of Options via Meta-Learned Subgoals
Figure 3 for Discovery of Options via Meta-Learned Subgoals
Figure 4 for Discovery of Options via Meta-Learned Subgoals
Viaarxiv icon

Pairwise Weights for Temporal Credit Assignment

Add code
Bookmark button
Alert button
Feb 09, 2021
Zeyu Zheng, Risto Vuorio, Richard Lewis, Satinder Singh

Figure 1 for Pairwise Weights for Temporal Credit Assignment
Figure 2 for Pairwise Weights for Temporal Credit Assignment
Figure 3 for Pairwise Weights for Temporal Credit Assignment
Figure 4 for Pairwise Weights for Temporal Credit Assignment
Viaarxiv icon