Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Navigating the Landscape of Games

May 04, 2020
Shayegan Omidshafiei, Karl Tuyls, Wojciech M. Czarnecki, Francisco C. Santos, Mark Rowland, Jerome Connor, Daniel Hennes, Paul Muller, Julien Perolat, Bart De Vylder, Audrunas Gruslys, Remi Munos

  Access Model/Code and Paper
From Poincar茅 Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

Feb 19, 2020
Julien Perolat, Remi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro Ortega, Neil Burch, Thomas Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls

* 43 pages 

  Access Model/Code and Paper
Hindsight Credit Assignment

Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

* NeurIPS 2019 

  Access Model/Code and Paper
Multiagent Evaluation under Incomplete Information

Oct 30, 2019
Mark Rowland, Shayegan Omidshafiei, Karl Tuyls, Julien Perolat, Michal Valko, Georgios Piliouras, Remi Munos

  Access Model/Code and Paper
A Generalized Training Approach for Multiagent Learning

Sep 27, 2019
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos

  Access Model/Code and Paper
Neural Replicator Dynamics

Jun 01, 2019
Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Remi Munos, Julien Perolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Karl Tuyls

  Access Model/Code and Paper
The Termination Critic

Feb 26, 2019
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos, Doina Precup

* AISTATS 2019 

  Access Model/Code and Paper
The Uncertainty Bellman Equation and Exploration

Oct 22, 2018
Brendan O'Donoghue, Ian Osband, Remi Munos, Volodymyr Mnih

  Access Model/Code and Paper
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Oct 21, 2018
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling

* NIPS 2018 

  Access Model/Code and Paper
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Jun 28, 2018
Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

  Access Model/Code and Paper
Maximum a Posteriori Policy Optimisation

Jun 14, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Remi Munos, Nicolas Heess, Martin Riedmiller

  Access Model/Code and Paper
Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery

May 13, 2018
Thomas Stepleton, Razvan Pascanu, Will Dabney, Siddhant M. Jayakumar, Hubert Soyer, Remi Munos

  Access Model/Code and Paper
A Study on Overfitting in Deep Reinforcement Learning

Apr 20, 2018
Chiyuan Zhang, Oriol Vinyals, Remi Munos, Samy Bengio

  Access Model/Code and Paper
Noisy Networks for Exploration

Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

* ICLR 2018 

  Access Model/Code and Paper
Sample Efficient Actor-Critic with Experience Replay

Jul 10, 2017
Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Remi Munos, Koray Kavukcuoglu, Nando de Freitas

* 20 pages. Prepared for ICLR 2017 

  Access Model/Code and Paper
Count-Based Exploration with Neural Density Models

Jun 14, 2017
Georg Ostrovski, Marc G. Bellemare, Aaron van den Oord, Remi Munos

  Access Model/Code and Paper
Automated Curriculum Learning for Neural Networks

Apr 10, 2017
Alex Graves, Marc G. Bellemare, Jacob Menick, Remi Munos, Koray Kavukcuoglu

  Access Model/Code and Paper
Combining policy gradient and Q-learning

Apr 07, 2017
Brendan O'Donoghue, Remi Munos, Koray Kavukcuoglu, Volodymyr Mnih

  Access Model/Code and Paper
Learning to reinforcement learn

Jan 23, 2017
Jane X Wang, Zeb Kurth-Nelson, Dhruva Tirumala, Hubert Soyer, Joel Z Leibo, Remi Munos, Charles Blundell, Dharshan Kumaran, Matt Botvinick

* 17 pages, 7 figures, 1 table 

  Access Model/Code and Paper
Unifying Count-Based Exploration and Intrinsic Motivation

Nov 07, 2016
Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Remi Munos

  Access Model/Code and Paper
Q($位$) with Off-Policy Corrections

Aug 11, 2016
Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Remi Munos

  Access Model/Code and Paper
Memory-Efficient Backpropagation Through Time

Jun 10, 2016
Audr奴nas Gruslys, Remi Munos, Ivo Danihelka, Marc Lanctot, Alex Graves

  Access Model/Code and Paper
Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis

Nov 27, 2015
Assaf Hallak, Aviv Tamar, Remi Munos, Shie Mannor

* arXiv admin note: text overlap with arXiv:1508.03411 

  Access Model/Code and Paper
Bounded Regret for Finite-Armed Structured Bandits

Nov 11, 2014
Tor Lattimore, Remi Munos

* 16 pages 

  Access Model/Code and Paper
Active Regression by Stratification

Oct 22, 2014
Sivan Sabato, Remi Munos

* Neural Information Processing Systems, 2014 

  Access Model/Code and Paper
On Minimax Optimal Offline Policy Evaluation

Sep 12, 2014
Lihong Li, Remi Munos, Csaba Szepesvari

  Access Model/Code and Paper
Bandit Algorithms for Tree Search

Aug 09, 2014
Pierre-Arnuad Coquelin, Remi Munos

* Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007) 

  Access Model/Code and Paper
Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem

Dec 17, 2013
Masrour Zoghi, Shimon Whiteson, Remi Munos, Maarten de Rijke

* 13 pages, 6 figures 

  Access Model/Code and Paper
Finite-Time Analysis of Kernelised Contextual Bandits

Sep 26, 2013
Michal Valko, Nathaniel Korda, Remi Munos, Ilias Flaounas, Nelo Cristianini

* Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013) 

  Access Model/Code and Paper