Alert button
Picture for Satinder Singh

Satinder Singh

Alert button

Learning State Representations from Random Deep Action-conditional Predictions

Add code
Bookmark button
Alert button
Feb 09, 2021
Zeyu Zheng, Vivek Veeriah, Risto Vuorio, Richard Lewis, Satinder Singh

Figure 1 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 2 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 3 for Learning State Representations from Random Deep Action-conditional Predictions
Figure 4 for Learning State Representations from Random Deep Action-conditional Predictions
Viaarxiv icon

Efficient Querying for Cooperative Probabilistic Commitments

Add code
Bookmark button
Alert button
Dec 14, 2020
Qi Zhang, Edmund H. Durfee, Satinder Singh

Figure 1 for Efficient Querying for Cooperative Probabilistic Commitments
Figure 2 for Efficient Querying for Cooperative Probabilistic Commitments
Figure 3 for Efficient Querying for Cooperative Probabilistic Commitments
Figure 4 for Efficient Querying for Cooperative Probabilistic Commitments
Viaarxiv icon

The Value Equivalence Principle for Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 06, 2020
Christopher Grimm, André Barreto, Satinder Singh, David Silver

Figure 1 for The Value Equivalence Principle for Model-Based Reinforcement Learning
Figure 2 for The Value Equivalence Principle for Model-Based Reinforcement Learning
Figure 3 for The Value Equivalence Principle for Model-Based Reinforcement Learning
Figure 4 for The Value Equivalence Principle for Model-Based Reinforcement Learning
Viaarxiv icon

Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments

Add code
Bookmark button
Alert button
Oct 28, 2020
Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh

Figure 1 for Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
Figure 2 for Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
Figure 3 for Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
Figure 4 for Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
Viaarxiv icon

Discovering Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Jul 17, 2020
Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Figure 1 for Discovering Reinforcement Learning Algorithms
Figure 2 for Discovering Reinforcement Learning Algorithms
Figure 3 for Discovering Reinforcement Learning Algorithms
Figure 4 for Discovering Reinforcement Learning Algorithms
Viaarxiv icon

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Add code
Bookmark button
Alert button
Jul 16, 2020
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver

Figure 1 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 2 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 3 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Figure 4 for Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Viaarxiv icon

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Add code
Bookmark button
Alert button
Jun 17, 2020
Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach

Figure 1 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 2 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 3 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 4 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Viaarxiv icon

Self-Tuning Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2020
Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh

Figure 1 for Self-Tuning Deep Reinforcement Learning
Figure 2 for Self-Tuning Deep Reinforcement Learning
Figure 3 for Self-Tuning Deep Reinforcement Learning
Figure 4 for Self-Tuning Deep Reinforcement Learning
Viaarxiv icon