Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Mohammad Gheshlaghi Azar

Radboud University

The Advantage Regret-Matching Actor-Critic

Aug 27, 2020
Audrūnas Gruslys, Marc Lanctot, Rémi Munos, Finbarr Timbers, Martin Schmid, Julien Perolat, Dustin Morrill, Vinicius Zambaldi, Jean-Baptiste Lespiau, John Schultz, Mohammad Gheshlaghi Azar, Michael Bowling, Karl Tuyls


  Access Paper or Ask Questions

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Jun 13, 2020
Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko


  Access Paper or Ask Questions

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Apr 30, 2020
Daniel Guo, Bernardo Avila Pires, Bilal Piot, Jean-bastien Grill, Florent Altché, Rémi Munos, Mohammad Gheshlaghi Azar


  Access Paper or Ask Questions

World Discovery Models

Mar 01, 2019
Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos


  Access Paper or Ask Questions

Neural Predictive Belief Representations

Nov 15, 2018
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A. Pires, Toby Pohlen, Rémi Munos


  Access Paper or Ask Questions

Observe and Look Further: Achieving Consistent Performance on Atari

May 29, 2018
Tobias Pohlen, Bilal Piot, Todd Hester, Mohammad Gheshlaghi Azar, Dan Horgan, David Budden, Gabriel Barth-Maron, Hado van Hasselt, John Quan, Mel Večerík, Matteo Hessel, Rémi Munos, Olivier Pietquin


  Access Paper or Ask Questions

Noisy Networks for Exploration

Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

* ICLR 2018 

  Access Paper or Ask Questions

Minimax Regret Bounds for Reinforcement Learning

Jul 01, 2017
Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos


  Access Paper or Ask Questions

Convex Relaxation Regression: Black-Box Optimization of Smooth Functions by Learning Their Convex Envelopes

Mar 03, 2016
Mohammad Gheshlaghi Azar, Eva Dyer, Konrad Kording

* Proc. of the Conference on Uncertainty in Artificial Intelligence, pg. 22-31, 2016 

  Access Paper or Ask Questions

Online Stochastic Optimization under Correlated Bandit Feedback

May 19, 2014
Mohammad Gheshlaghi Azar, Alessandro Lazaric, Emma Brunskill


  Access Paper or Ask Questions

Sequential Transfer in Multi-armed Bandit with Finite Set of Models

Jul 25, 2013
Mohammad Gheshlaghi Azar, Alessandro Lazaric, Emma Brunskill


  Access Paper or Ask Questions

Regret Bounds for Reinforcement Learning with Policy Advice

Jul 17, 2013
Mohammad Gheshlaghi Azar, Alessandro Lazaric, Emma Brunskill


  Access Paper or Ask Questions

On the Sample Complexity of Reinforcement Learning with a Generative Model

Jun 27, 2012
Mohammad Gheshlaghi Azar, Remi Munos, Bert Kappen

* Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012) 

  Access Paper or Ask Questions

Dynamic Policy Programming

Sep 06, 2011
Mohammad Gheshlaghi Azar, Vicenc Gomez, Hilbert J. Kappen

* Submitted to Journal of Machine Learning Research 

  Access Paper or Ask Questions