Alert button
Picture for Bilal Piot

Bilal Piot

Alert button

World Discovery Models

Add code
Bookmark button
Alert button
Mar 01, 2019
Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos

Figure 1 for World Discovery Models
Figure 2 for World Discovery Models
Figure 3 for World Discovery Models
Figure 4 for World Discovery Models
Viaarxiv icon

Neural Predictive Belief Representations

Add code
Bookmark button
Alert button
Nov 15, 2018
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A. Pires, Toby Pohlen, Rémi Munos

Figure 1 for Neural Predictive Belief Representations
Figure 2 for Neural Predictive Belief Representations
Figure 3 for Neural Predictive Belief Representations
Figure 4 for Neural Predictive Belief Representations
Viaarxiv icon

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

Add code
Bookmark button
Alert button
Oct 08, 2018
Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

Figure 1 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 2 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 3 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Figure 4 for Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards
Viaarxiv icon

Playing the Game of Universal Adversarial Perturbations

Add code
Bookmark button
Alert button
Sep 25, 2018
Julien Perolat, Mateusz Malinowski, Bilal Piot, Olivier Pietquin

Figure 1 for Playing the Game of Universal Adversarial Perturbations
Figure 2 for Playing the Game of Universal Adversarial Perturbations
Figure 3 for Playing the Game of Universal Adversarial Perturbations
Figure 4 for Playing the Game of Universal Adversarial Perturbations
Viaarxiv icon

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2018
Audrunas Gruslys, Will Dabney, Mohammad Gheshlaghi Azar, Bilal Piot, Marc Bellemare, Remi Munos

Figure 1 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 2 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 3 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 4 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Viaarxiv icon

Observe and Look Further: Achieving Consistent Performance on Atari

Add code
Bookmark button
Alert button
May 29, 2018
Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin

Figure 1 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 2 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 3 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 4 for Observe and Look Further: Achieving Consistent Performance on Atari
Viaarxiv icon

Noisy Networks for Exploration

Add code
Bookmark button
Alert button
Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

Figure 1 for Noisy Networks for Exploration
Figure 2 for Noisy Networks for Exploration
Figure 3 for Noisy Networks for Exploration
Figure 4 for Noisy Networks for Exploration
Viaarxiv icon

Is the Bellman residual a bad proxy?

Add code
Bookmark button
Alert button
Dec 12, 2017
Matthieu Geist, Bilal Piot, Olivier Pietquin

Figure 1 for Is the Bellman residual a bad proxy?
Figure 2 for Is the Bellman residual a bad proxy?
Figure 3 for Is the Bellman residual a bad proxy?
Figure 4 for Is the Bellman residual a bad proxy?
Viaarxiv icon

Deep Q-learning from Demonstrations

Add code
Bookmark button
Alert button
Nov 22, 2017
Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

Figure 1 for Deep Q-learning from Demonstrations
Figure 2 for Deep Q-learning from Demonstrations
Figure 3 for Deep Q-learning from Demonstrations
Viaarxiv icon