Alert button
Picture for Satinder Singh

Satinder Singh

Alert button

Genie: Generative Interactive Environments

Add code
Bookmark button
Alert button
Feb 23, 2024
Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel

Viaarxiv icon

Combining Behaviors with the Successor Features Keyboard

Add code
Bookmark button
Alert button
Oct 24, 2023
Wilka Carvalho, Andre Saraiva, Angelos Filos, Andrew Kyle Lampinen, Loic Matthey, Richard L. Lewis, Honglak Lee, Satinder Singh, Danilo J. Rezende, Daniel Zoran

Figure 1 for Combining Behaviors with the Successor Features Keyboard
Figure 2 for Combining Behaviors with the Successor Features Keyboard
Figure 3 for Combining Behaviors with the Successor Features Keyboard
Figure 4 for Combining Behaviors with the Successor Features Keyboard
Viaarxiv icon

Diversifying AI: Towards Creative Chess with AlphaZero

Add code
Bookmark button
Alert button
Aug 29, 2023
Tom Zahavy, Vivek Veeriah, Shaobo Hou, Kevin Waugh, Matthew Lai, Edouard Leurent, Nenad Tomasev, Lisa Schut, Demis Hassabis, Satinder Singh

Figure 1 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 2 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 3 for Diversifying AI: Towards Creative Chess with AlphaZero
Figure 4 for Diversifying AI: Towards Creative Chess with AlphaZero
Viaarxiv icon

A Definition of Continual Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 20, 2023
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh

Figure 1 for A Definition of Continual Reinforcement Learning
Figure 2 for A Definition of Continual Reinforcement Learning
Figure 3 for A Definition of Continual Reinforcement Learning
Viaarxiv icon

On the Convergence of Bounded Agents

Add code
Bookmark button
Alert button
Jul 20, 2023
David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh

Figure 1 for On the Convergence of Bounded Agents
Viaarxiv icon

Structured State Space Models for In-Context Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 09, 2023
Chris Lu, Yannick Schroecker, Albert Gu, Emilio Parisotto, Jakob Foerster, Satinder Singh, Feryal Behbahani

Figure 1 for Structured State Space Models for In-Context Reinforcement Learning
Figure 2 for Structured State Space Models for In-Context Reinforcement Learning
Figure 3 for Structured State Space Models for In-Context Reinforcement Learning
Figure 4 for Structured State Space Models for In-Context Reinforcement Learning
Viaarxiv icon

Hierarchical Reinforcement Learning in Complex 3D Environments

Add code
Bookmark button
Alert button
Feb 28, 2023
Bernardo Avila Pires, Feryal Behbahani, Hubert Soyer, Kyriacos Nikiforou, Thomas Keck, Satinder Singh

Figure 1 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 2 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 3 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 4 for Hierarchical Reinforcement Learning in Complex 3D Environments
Viaarxiv icon

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

Add code
Bookmark button
Alert button
Feb 02, 2023
Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy

Figure 1 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 2 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 3 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Figure 4 for ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Viaarxiv icon

Composing Task Knowledge with Modular Successor Feature Approximators

Add code
Bookmark button
Alert button
Jan 28, 2023
Wilka Carvalho, Angelos Filos, Richard L. Lewis, Honglak lee, Satinder Singh

Figure 1 for Composing Task Knowledge with Modular Successor Feature Approximators
Figure 2 for Composing Task Knowledge with Modular Successor Feature Approximators
Figure 3 for Composing Task Knowledge with Modular Successor Feature Approximators
Figure 4 for Composing Task Knowledge with Modular Successor Feature Approximators
Viaarxiv icon