Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning


Jun 30, 2022
Julien Perolat , Bart de Vylder , Daniel Hennes , Eugene Tarassov , Florian Strub , Vincent de Boer , Paul Muller , Jerome T. Connor , Neil Burch , Thomas Anthony , Stephen McAleer , Romuald Elie , Sarah H. Cen , Zhe Wang , Audrunas Gruslys , Aleksandra Malysheva , Mina Khan , Sherjil Ozair , Finbarr Timbers , Toby Pohlen , Tom Eccles , Mark Rowland , Marc Lanctot , Jean-Baptiste Lespiau , Bilal Piot , Shayegan Omidshafiei , Edward Lockhart , Laurent Sifre , Nathalie Beauguerlange , Remi Munos , David Silver , Satinder Singh , Demis Hassabis , Karl Tuyls


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving language models by retrieving from trillions of tokens


Jan 11, 2022
Sebastian Borgeaud , Arthur Mensch , Jordan Hoffmann , Trevor Cai , Eliza Rutherford , Katie Millican , George van den Driessche , Jean-Baptiste Lespiau , Bogdan Damoc , Aidan Clark , Diego de Las Casas , Aurelia Guy , Jacob Menick , Roman Ring , Tom Hennigan , Saffron Huang , Loren Maggiore , Chris Jones , Albin Cassirer , Andy Brock , Michela Paganini , Geoffrey Irving , Oriol Vinyals , Simon Osindero , Karen Simonyan , Jack W. Rae , Erich Elsen , Laurent Sifre

* Add missing references. Fix some typos 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Scaling Language Models: Methods, Analysis & Insights from Training Gopher


Dec 08, 2021
Jack W. Rae , Sebastian Borgeaud , Trevor Cai , Katie Millican , Jordan Hoffmann , Francis Song , John Aslanides , Sarah Henderson , Roman Ring , Susannah Young , Eliza Rutherford , Tom Hennigan , Jacob Menick , Albin Cassirer , Richard Powell , George van den Driessche , Lisa Anne Hendricks , Maribeth Rauh , Po-Sen Huang , Amelia Glaese , Johannes Welbl , Sumanth Dathathri , Saffron Huang , Jonathan Uesato , John Mellor , Irina Higgins , Antonia Creswell , Nat McAleese , Amy Wu , Erich Elsen , Siddhant Jayakumar , Elena Buchatskaya , David Budden , Esme Sutherland , Karen Simonyan , Michela Paganini , Laurent Sifre , Lena Martens , Xiang Lorraine Li , Adhiguna Kuncoro , Aida Nematzadeh , Elena Gribovskaya , Domenic Donato , Angeliki Lazaridou , Arthur Mensch , Jean-Baptiste Lespiau , Maria Tsimpoukelli , Nikolai Grigorev , Doug Fritz , Thibault Sottiaux , Mantas Pajarskas , Toby Pohlen , Zhitao Gong , Daniel Toyama , Cyprien de Masson d'Autume , Yujia Li , Tayfun Terzi , Vladimir Mikulik , Igor Babuschkin , Aidan Clark , Diego de Las Casas , Aurelia Guy , Chris Jones , James Bradbury , Matthew Johnson , Blake Hechtman , Laura Weidinger , Iason Gabriel , William Isaac , Ed Lockhart , Simon Osindero , Laura Rimell , Chris Dyer , Oriol Vinyals , Kareem Ayoub , Jeff Stanway , Lorrayne Bennett , Demis Hassabis , Koray Kavukcuoglu , Geoffrey Irving

* 118 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Machine Translation Decoding beyond Beam Search


Apr 12, 2021
RĂ©mi Leblond , Jean-Baptiste Alayrac , Laurent Sifre , Miruna Pislar , Jean-Baptiste Lespiau , Ioannis Antonoglou , Karen Simonyan , Oriol Vinyals

* 23 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Advantage Regret-Matching Actor-Critic


Aug 27, 2020
Audrūnas Gruslys , Marc Lanctot , Rémi Munos , Finbarr Timbers , Martin Schmid , Julien Perolat , Dustin Morrill , Vinicius Zambaldi , Jean-Baptiste Lespiau , John Schultz , Mohammad Gheshlaghi Azar , Michael Bowling , Karl Tuyls


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization


Feb 19, 2020
Julien Perolat , Remi Munos , Jean-Baptiste Lespiau , Shayegan Omidshafiei , Mark Rowland , Pedro Ortega , Neil Burch , Thomas Anthony , David Balduzzi , Bart De Vylder , Georgios Piliouras , Marc Lanctot , Karl Tuyls

* 43 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

OpenSpiel: A Framework for Reinforcement Learning in Games


Oct 10, 2019
Marc Lanctot , Edward Lockhart , Jean-Baptiste Lespiau , Vinicius Zambaldi , Satyaki Upadhyay , Julien Pérolat , Sriram Srinivasan , Finbarr Timbers , Karl Tuyls , Shayegan Omidshafiei , Daniel Hennes , Dustin Morrill , Paul Muller , Timo Ewalds , Ryan Faulkner , János Kramár , Bart De Vylder , Brennan Saeta , James Bradbury , David Ding , Sebastian Borgeaud , Matthew Lai , Julian Schrittwieser , Thomas Anthony , Edward Hughes , Ivo Danihelka , Jonah Ryan-Davis


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>