Alert button
Picture for Mohammad Gheshlaghi Azar

Mohammad Gheshlaghi Azar

Alert button

Geometric Entropic Exploration

Add code
Bookmark button
Alert button
Jan 07, 2021
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Avila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos

Figure 1 for Geometric Entropic Exploration
Figure 2 for Geometric Entropic Exploration
Figure 3 for Geometric Entropic Exploration
Figure 4 for Geometric Entropic Exploration
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Bookmark button
Alert button
Aug 27, 2020
Audrūnas Gruslys, Marc Lanctot, Rémi Munos, Finbarr Timbers, Martin Schmid, Julien Perolat, Dustin Morrill, Vinicius Zambaldi, Jean-Baptiste Lespiau, John Schultz, Mohammad Gheshlaghi Azar, Michael Bowling, Karl Tuyls

Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Add code
Bookmark button
Alert button
Jun 13, 2020
Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 30, 2020
Daniel Guo, Bernardo Avila Pires, Bilal Piot, Jean-bastien Grill, Florent Altché, Rémi Munos, Mohammad Gheshlaghi Azar

Figure 1 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 2 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 3 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 4 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Viaarxiv icon

World Discovery Models

Add code
Bookmark button
Alert button
Mar 01, 2019
Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos

Figure 1 for World Discovery Models
Figure 2 for World Discovery Models
Figure 3 for World Discovery Models
Figure 4 for World Discovery Models
Viaarxiv icon

Neural Predictive Belief Representations

Add code
Bookmark button
Alert button
Nov 15, 2018
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A. Pires, Toby Pohlen, Rémi Munos

Figure 1 for Neural Predictive Belief Representations
Figure 2 for Neural Predictive Belief Representations
Figure 3 for Neural Predictive Belief Representations
Figure 4 for Neural Predictive Belief Representations
Viaarxiv icon

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2018
Audrunas Gruslys, Will Dabney, Mohammad Gheshlaghi Azar, Bilal Piot, Marc Bellemare, Remi Munos

Figure 1 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 2 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 3 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 4 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Viaarxiv icon

Observe and Look Further: Achieving Consistent Performance on Atari

Add code
Bookmark button
Alert button
May 29, 2018
Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin

Figure 1 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 2 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 3 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 4 for Observe and Look Further: Achieving Consistent Performance on Atari
Viaarxiv icon

Noisy Networks for Exploration

Add code
Bookmark button
Alert button
Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

Figure 1 for Noisy Networks for Exploration
Figure 2 for Noisy Networks for Exploration
Figure 3 for Noisy Networks for Exploration
Figure 4 for Noisy Networks for Exploration
Viaarxiv icon