Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Game Plan: What AI can do for Football, and What Football can do for AI

Nov 18, 2020
Karl Tuyls, Shayegan Omidshafiei, Paul Muller, Zhe Wang, Jerome Connor, Daniel Hennes, Ian Graham, William Spearman, Tim Waskett, Dafydd Steele, Pauline Luc, Adria Recasens, Alexandre Galashov, Gregory Thornton, Romuald Elie, Pablo Sprechmann, Pol Moreno, Kris Cao, Marta Garnelo, Praneet Dutta, Michal Valko, Nicolas Heess, Alex Bridgland, Julien Perolat, Bart De Vylder, Ali Eslami, Mark Rowland, Andrew Jaegle, Remi Munos, Trevor Back, Razia Ahamed, Simon Bouton, Nathalie Beauguerlange, Jackson Broshear, Thore Graepel, Demis Hassabis


  Access Paper or Ask Questions

Navigating the Landscape of Games

May 04, 2020
Shayegan Omidshafiei, Karl Tuyls, Wojciech M. Czarnecki, Francisco C. Santos, Mark Rowland, Jerome Connor, Daniel Hennes, Paul Muller, Julien Perolat, Bart De Vylder, Audrunas Gruslys, Remi Munos


  Access Paper or Ask Questions

From Poincar茅 Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

Feb 19, 2020
Julien Perolat, Remi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro Ortega, Neil Burch, Thomas Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls

* 43 pages 

  Access Paper or Ask Questions

Hindsight Credit Assignment

Dec 05, 2019
Anna Harutyunyan, Will Dabney, Thomas Mesnard, Mohammad Azar, Bilal Piot, Nicolas Heess, Hado van Hasselt, Greg Wayne, Satinder Singh, Doina Precup, Remi Munos

* NeurIPS 2019 

  Access Paper or Ask Questions

Multiagent Evaluation under Incomplete Information

Oct 30, 2019
Mark Rowland, Shayegan Omidshafiei, Karl Tuyls, Julien Perolat, Michal Valko, Georgios Piliouras, Remi Munos


  Access Paper or Ask Questions

A Generalized Training Approach for Multiagent Learning

Sep 27, 2019
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos


  Access Paper or Ask Questions

Neural Replicator Dynamics

Jun 01, 2019
Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Remi Munos, Julien Perolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Karl Tuyls


  Access Paper or Ask Questions

The Termination Critic

Feb 26, 2019
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos, Doina Precup

* AISTATS 2019 

  Access Paper or Ask Questions

The Uncertainty Bellman Equation and Exploration

Oct 22, 2018
Brendan O'Donoghue, Ian Osband, Remi Munos, Volodymyr Mnih


  Access Paper or Ask Questions

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Oct 21, 2018
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling

* NIPS 2018 

  Access Paper or Ask Questions

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Jun 28, 2018
Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu


  Access Paper or Ask Questions

Maximum a Posteriori Policy Optimisation

Jun 14, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Remi Munos, Nicolas Heess, Martin Riedmiller


  Access Paper or Ask Questions

Low-pass Recurrent Neural Networks - A memory architecture for longer-term correlation discovery

May 13, 2018
Thomas Stepleton, Razvan Pascanu, Will Dabney, Siddhant M. Jayakumar, Hubert Soyer, Remi Munos


  Access Paper or Ask Questions

A Study on Overfitting in Deep Reinforcement Learning

Apr 20, 2018
Chiyuan Zhang, Oriol Vinyals, Remi Munos, Samy Bengio


  Access Paper or Ask Questions

Noisy Networks for Exploration

Feb 15, 2018
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

* ICLR 2018 

  Access Paper or Ask Questions

Sample Efficient Actor-Critic with Experience Replay

Jul 10, 2017
Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Remi Munos, Koray Kavukcuoglu, Nando de Freitas

* 20 pages. Prepared for ICLR 2017 

  Access Paper or Ask Questions

Count-Based Exploration with Neural Density Models

Jun 14, 2017
Georg Ostrovski, Marc G. Bellemare, Aaron van den Oord, Remi Munos


  Access Paper or Ask Questions

Automated Curriculum Learning for Neural Networks

Apr 10, 2017
Alex Graves, Marc G. Bellemare, Jacob Menick, Remi Munos, Koray Kavukcuoglu


  Access Paper or Ask Questions

Combining policy gradient and Q-learning

Apr 07, 2017
Brendan O'Donoghue, Remi Munos, Koray Kavukcuoglu, Volodymyr Mnih


  Access Paper or Ask Questions

Learning to reinforcement learn

Jan 23, 2017
Jane X Wang, Zeb Kurth-Nelson, Dhruva Tirumala, Hubert Soyer, Joel Z Leibo, Remi Munos, Charles Blundell, Dharshan Kumaran, Matt Botvinick

* 17 pages, 7 figures, 1 table 

  Access Paper or Ask Questions

Unifying Count-Based Exploration and Intrinsic Motivation

Nov 07, 2016
Marc G. Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, Remi Munos


  Access Paper or Ask Questions

Q($位$) with Off-Policy Corrections

Aug 11, 2016
Anna Harutyunyan, Marc G. Bellemare, Tom Stepleton, Remi Munos


  Access Paper or Ask Questions

Memory-Efficient Backpropagation Through Time

Jun 10, 2016
Audr奴nas Gruslys, Remi Munos, Ivo Danihelka, Marc Lanctot, Alex Graves


  Access Paper or Ask Questions

Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis

Nov 27, 2015
Assaf Hallak, Aviv Tamar, Remi Munos, Shie Mannor

* arXiv admin note: text overlap with arXiv:1508.03411 

  Access Paper or Ask Questions

Bounded Regret for Finite-Armed Structured Bandits

Nov 11, 2014
Tor Lattimore, Remi Munos

* 16 pages 

  Access Paper or Ask Questions

Active Regression by Stratification

Oct 22, 2014
Sivan Sabato, Remi Munos

* Neural Information Processing Systems, 2014 

  Access Paper or Ask Questions

On Minimax Optimal Offline Policy Evaluation

Sep 12, 2014
Lihong Li, Remi Munos, Csaba Szepesvari


  Access Paper or Ask Questions

Bandit Algorithms for Tree Search

Aug 09, 2014
Pierre-Arnuad Coquelin, Remi Munos

* Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007) 

  Access Paper or Ask Questions

Relative Upper Confidence Bound for the K-Armed Dueling Bandit Problem

Dec 17, 2013
Masrour Zoghi, Shimon Whiteson, Remi Munos, Maarten de Rijke

* 13 pages, 6 figures 

  Access Paper or Ask Questions