Picture for Adrià Puigdomènech Badia

Adrià Puigdomènech Badia

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Human-level Atari 200x faster

Sep 15, 2022
Figure 1 for Human-level Atari 200x faster
Figure 2 for Human-level Atari 200x faster
Figure 3 for Human-level Atari 200x faster
Figure 4 for Human-level Atari 200x faster
Viaarxiv icon

The CLRS Algorithmic Reasoning Benchmark

Add code
Jun 04, 2022
Figure 1 for The CLRS Algorithmic Reasoning Benchmark
Figure 2 for The CLRS Algorithmic Reasoning Benchmark
Figure 3 for The CLRS Algorithmic Reasoning Benchmark
Figure 4 for The CLRS Algorithmic Reasoning Benchmark
Viaarxiv icon

Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning

Feb 24, 2021
Figure 1 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 2 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 3 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Figure 4 for Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
Viaarxiv icon

Agent57: Outperforming the Atari Human Benchmark

Mar 30, 2020
Figure 1 for Agent57: Outperforming the Atari Human Benchmark
Figure 2 for Agent57: Outperforming the Atari Human Benchmark
Figure 3 for Agent57: Outperforming the Atari Human Benchmark
Figure 4 for Agent57: Outperforming the Atari Human Benchmark
Viaarxiv icon

Never Give Up: Learning Directed Exploration Strategies

Add code
Feb 14, 2020
Figure 1 for Never Give Up: Learning Directed Exploration Strategies
Figure 2 for Never Give Up: Learning Directed Exploration Strategies
Figure 3 for Never Give Up: Learning Directed Exploration Strategies
Figure 4 for Never Give Up: Learning Directed Exploration Strategies
Viaarxiv icon

MEMO: A Deep Network for Flexible Combination of Episodic Memories

Add code
Jan 29, 2020
Figure 1 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 2 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 3 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 4 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Viaarxiv icon

Generalization of Reinforcement Learners with Working and Episodic Memory

Add code
Oct 29, 2019
Figure 1 for Generalization of Reinforcement Learners with Working and Episodic Memory
Figure 2 for Generalization of Reinforcement Learners with Working and Episodic Memory
Figure 3 for Generalization of Reinforcement Learners with Working and Episodic Memory
Figure 4 for Generalization of Reinforcement Learners with Working and Episodic Memory
Viaarxiv icon

Memory-based Parameter Adaptation

Feb 28, 2018
Figure 1 for Memory-based Parameter Adaptation
Figure 2 for Memory-based Parameter Adaptation
Figure 3 for Memory-based Parameter Adaptation
Figure 4 for Memory-based Parameter Adaptation
Viaarxiv icon

Asynchronous Methods for Deep Reinforcement Learning

Add code
Jun 16, 2016
Figure 1 for Asynchronous Methods for Deep Reinforcement Learning
Figure 2 for Asynchronous Methods for Deep Reinforcement Learning
Figure 3 for Asynchronous Methods for Deep Reinforcement Learning
Figure 4 for Asynchronous Methods for Deep Reinforcement Learning
Viaarxiv icon