Alert button
Picture for Steven Kapturowski

Steven Kapturowski

Alert button

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Bookmark button
Alert button
Feb 08, 2024
Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin Riedmiller

Viaarxiv icon

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Add code
Bookmark button
Alert button
May 02, 2023
Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, Bilal Piot

Figure 1 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 2 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 3 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 4 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Viaarxiv icon

Human-level Atari 200x faster

Add code
Bookmark button
Alert button
Sep 15, 2022
Steven Kapturowski, Víctor Campos, Ray Jiang, Nemanja Rakićević, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia

Figure 1 for Human-level Atari 200x faster
Figure 2 for Human-level Atari 200x faster
Figure 3 for Human-level Atari 200x faster
Figure 4 for Human-level Atari 200x faster
Viaarxiv icon

Revisiting Peng's Q($λ$) for Modern Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 27, 2021
Tadashi Kozuno, Yunhao Tang, Mark Rowland, Rémi Munos, Steven Kapturowski, Will Dabney, Michal Valko, David Abel

Figure 1 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 2 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 3 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 4 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Viaarxiv icon

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 24, 2021
Víctor Campos, Pablo Sprechmann, Steven Hansen, Andre Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell

Figure 1 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 2 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 3 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 4 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Viaarxiv icon

Temporal Difference Uncertainties as a Signal for Exploration

Add code
Bookmark button
Alert button
Oct 05, 2020
Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, Andre Barreto, Razvan Pascanu

Figure 1 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 2 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 3 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 4 for Temporal Difference Uncertainties as a Signal for Exploration
Viaarxiv icon

Agent57: Outperforming the Atari Human Benchmark

Add code
Bookmark button
Alert button
Mar 30, 2020
Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Charles Blundell

Figure 1 for Agent57: Outperforming the Atari Human Benchmark
Figure 2 for Agent57: Outperforming the Atari Human Benchmark
Figure 3 for Agent57: Outperforming the Atari Human Benchmark
Figure 4 for Agent57: Outperforming the Atari Human Benchmark
Viaarxiv icon

Value-driven Hindsight Modelling

Add code
Bookmark button
Alert button
Feb 19, 2020
Arthur Guez, Fabio Viola, Théophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

Figure 1 for Value-driven Hindsight Modelling
Figure 2 for Value-driven Hindsight Modelling
Figure 3 for Value-driven Hindsight Modelling
Figure 4 for Value-driven Hindsight Modelling
Viaarxiv icon

Never Give Up: Learning Directed Exploration Strategies

Add code
Bookmark button
Alert button
Feb 14, 2020
Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

Figure 1 for Never Give Up: Learning Directed Exploration Strategies
Figure 2 for Never Give Up: Learning Directed Exploration Strategies
Figure 3 for Never Give Up: Learning Directed Exploration Strategies
Figure 4 for Never Give Up: Learning Directed Exploration Strategies
Viaarxiv icon

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Bookmark button
Alert button
Sep 03, 2019
Tom Le Paine, Caglar Gulcehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team

Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon