Picture for Steven Kapturowski

Steven Kapturowski

Transformers need glasses! Information over-squashing in language tasks

Add code
Jun 06, 2024
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Viaarxiv icon

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Add code
May 02, 2023
Figure 1 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 2 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 3 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Figure 4 for Unlocking the Power of Representations in Long-term Novelty-based Exploration
Viaarxiv icon

Human-level Atari 200x faster

Add code
Sep 15, 2022
Figure 1 for Human-level Atari 200x faster
Figure 2 for Human-level Atari 200x faster
Figure 3 for Human-level Atari 200x faster
Figure 4 for Human-level Atari 200x faster
Viaarxiv icon

Revisiting Peng's Q for Modern Reinforcement Learning

Add code
Feb 27, 2021
Figure 1 for Revisiting Peng's Q for Modern Reinforcement Learning
Figure 2 for Revisiting Peng's Q for Modern Reinforcement Learning
Figure 3 for Revisiting Peng's Q for Modern Reinforcement Learning
Figure 4 for Revisiting Peng's Q for Modern Reinforcement Learning
Viaarxiv icon

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning

Add code
Feb 24, 2021
Figure 1 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 2 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 3 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 4 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Viaarxiv icon

Temporal Difference Uncertainties as a Signal for Exploration

Add code
Oct 05, 2020
Figure 1 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 2 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 3 for Temporal Difference Uncertainties as a Signal for Exploration
Figure 4 for Temporal Difference Uncertainties as a Signal for Exploration
Viaarxiv icon

Agent57: Outperforming the Atari Human Benchmark

Add code
Mar 30, 2020
Figure 1 for Agent57: Outperforming the Atari Human Benchmark
Figure 2 for Agent57: Outperforming the Atari Human Benchmark
Figure 3 for Agent57: Outperforming the Atari Human Benchmark
Figure 4 for Agent57: Outperforming the Atari Human Benchmark
Viaarxiv icon

Value-driven Hindsight Modelling

Add code
Feb 19, 2020
Figure 1 for Value-driven Hindsight Modelling
Figure 2 for Value-driven Hindsight Modelling
Figure 3 for Value-driven Hindsight Modelling
Figure 4 for Value-driven Hindsight Modelling
Viaarxiv icon

Never Give Up: Learning Directed Exploration Strategies

Add code
Feb 14, 2020
Figure 1 for Never Give Up: Learning Directed Exploration Strategies
Figure 2 for Never Give Up: Learning Directed Exploration Strategies
Figure 3 for Never Give Up: Learning Directed Exploration Strategies
Figure 4 for Never Give Up: Learning Directed Exploration Strategies
Viaarxiv icon