Picture for Bilal Piot

Bilal Piot

Dima

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Jun 01, 2020
Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Add code
Apr 30, 2020
Figure 1 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 2 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 3 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 4 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Viaarxiv icon

Agent57: Outperforming the Atari Human Benchmark

Add code
Mar 30, 2020
Figure 1 for Agent57: Outperforming the Atari Human Benchmark
Figure 2 for Agent57: Outperforming the Atari Human Benchmark
Figure 3 for Agent57: Outperforming the Atari Human Benchmark
Figure 4 for Agent57: Outperforming the Atari Human Benchmark
Viaarxiv icon

Never Give Up: Learning Directed Exploration Strategies

Add code
Feb 14, 2020
Figure 1 for Never Give Up: Learning Directed Exploration Strategies
Figure 2 for Never Give Up: Learning Directed Exploration Strategies
Figure 3 for Never Give Up: Learning Directed Exploration Strategies
Figure 4 for Never Give Up: Learning Directed Exploration Strategies
Viaarxiv icon

Hindsight Credit Assignment

Add code
Dec 05, 2019
Figure 1 for Hindsight Credit Assignment
Figure 2 for Hindsight Credit Assignment
Figure 3 for Hindsight Credit Assignment
Figure 4 for Hindsight Credit Assignment
Viaarxiv icon

World Discovery Models

Add code
Mar 01, 2019
Figure 1 for World Discovery Models
Figure 2 for World Discovery Models
Figure 3 for World Discovery Models
Figure 4 for World Discovery Models
Viaarxiv icon

Neural Predictive Belief Representations

Add code
Nov 15, 2018
Figure 1 for Neural Predictive Belief Representations
Figure 2 for Neural Predictive Belief Representations
Figure 3 for Neural Predictive Belief Representations
Figure 4 for Neural Predictive Belief Representations
Viaarxiv icon

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

Add code
Oct 08, 2018
Figure 1 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 2 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 3 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 4 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Viaarxiv icon

Playing the Game of Universal Adversarial Perturbations

Add code
Sep 25, 2018
Figure 1 for Playing the Game of Universal Adversarial Perturbations
Figure 2 for Playing the Game of Universal Adversarial Perturbations
Figure 3 for Playing the Game of Universal Adversarial Perturbations
Figure 4 for Playing the Game of Universal Adversarial Perturbations
Viaarxiv icon

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning

Add code
Jun 19, 2018
Figure 1 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 2 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 3 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 4 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Viaarxiv icon