Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Large-Scale Retrieval for Reinforcement Learning



Peter C. Humphreys , Arthur Guez , Olivier Tieleman , Laurent Sifre , Théophane Weber , Timothy Lillicrap

* Preprint, 16 pages 

   Access Paper or Ask Questions

Training Compute-Optimal Large Language Models



Jordan Hoffmann , Sebastian Borgeaud , Arthur Mensch , Elena Buchatskaya , Trevor Cai , Eliza Rutherford , Diego de Las Casas , Lisa Anne Hendricks , Johannes Welbl , Aidan Clark , Tom Hennigan , Eric Noland , Katie Millican , George van den Driessche , Bogdan Damoc , Aurelia Guy , Simon Osindero , Karen Simonyan , Erich Elsen , Jack W. Rae , Oriol Vinyals , Laurent Sifre


   Access Paper or Ask Questions

Unified Scaling Laws for Routed Language Models



Aidan Clark , Diego de las Casas , Aurelia Guy , Arthur Mensch , Michela Paganini , Jordan Hoffmann , Bogdan Damoc , Blake Hechtman , Trevor Cai , Sebastian Borgeaud , George van den Driessche , Eliza Rutherford , Tom Hennigan , Matthew Johnson , Katie Millican , Albin Cassirer , Chris Jones , Elena Buchatskaya , David Budden , Laurent Sifre , Simon Osindero , Oriol Vinyals , Jack Rae , Erich Elsen , Koray Kavukcuoglu , Karen Simonyan

* Fixing typos and affiliation clarity 

   Access Paper or Ask Questions

Improving language models by retrieving from trillions of tokens



Sebastian Borgeaud , Arthur Mensch , Jordan Hoffmann , Trevor Cai , Eliza Rutherford , Katie Millican , George van den Driessche , Jean-Baptiste Lespiau , Bogdan Damoc , Aidan Clark , Diego de Las Casas , Aurelia Guy , Jacob Menick , Roman Ring , Tom Hennigan , Saffron Huang , Loren Maggiore , Chris Jones , Albin Cassirer , Andy Brock , Michela Paganini , Geoffrey Irving , Oriol Vinyals , Simon Osindero , Karen Simonyan , Jack W. Rae , Erich Elsen , Laurent Sifre

* Add missing references. Fix some typos 

   Access Paper or Ask Questions

Scaling Language Models: Methods, Analysis & Insights from Training Gopher



Jack W. Rae , Sebastian Borgeaud , Trevor Cai , Katie Millican , Jordan Hoffmann , Francis Song , John Aslanides , Sarah Henderson , Roman Ring , Susannah Young , Eliza Rutherford , Tom Hennigan , Jacob Menick , Albin Cassirer , Richard Powell , George van den Driessche , Lisa Anne Hendricks , Maribeth Rauh , Po-Sen Huang , Amelia Glaese , Johannes Welbl , Sumanth Dathathri , Saffron Huang , Jonathan Uesato , John Mellor , Irina Higgins , Antonia Creswell , Nat McAleese , Amy Wu , Erich Elsen , Siddhant Jayakumar , Elena Buchatskaya , David Budden , Esme Sutherland , Karen Simonyan , Michela Paganini , Laurent Sifre , Lena Martens , Xiang Lorraine Li , Adhiguna Kuncoro , Aida Nematzadeh , Elena Gribovskaya , Domenic Donato , Angeliki Lazaridou , Arthur Mensch , Jean-Baptiste Lespiau , Maria Tsimpoukelli , Nikolai Grigorev , Doug Fritz , Thibault Sottiaux , Mantas Pajarskas , Toby Pohlen , Zhitao Gong , Daniel Toyama , Cyprien de Masson d'Autume , Yujia Li , Tayfun Terzi , Vladimir Mikulik , Igor Babuschkin , Aidan Clark , Diego de Las Casas , Aurelia Guy , Chris Jones , James Bradbury , Matthew Johnson , Blake Hechtman , Laura Weidinger , Iason Gabriel , William Isaac , Ed Lockhart , Simon Osindero , Laura Rimell , Chris Dyer , Oriol Vinyals , Kareem Ayoub , Jeff Stanway , Lorrayne Bennett , Demis Hassabis , Koray Kavukcuoglu , Geoffrey Irving

* 118 pages 

   Access Paper or Ask Questions

Muesli: Combining Improvements in Policy Optimization



Matteo Hessel , Ivo Danihelka , Fabio Viola , Arthur Guez , Simon Schmitt , Laurent Sifre , Theophane Weber , David Silver , Hado van Hasselt


   Access Paper or Ask Questions

Machine Translation Decoding beyond Beam Search



Rémi Leblond , Jean-Baptiste Alayrac , Laurent Sifre , Miruna Pislar , Jean-Baptiste Lespiau , Ioannis Antonoglou , Karen Simonyan , Oriol Vinyals

* 23 pages 

   Access Paper or Ask Questions

Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model



Julian Schrittwieser , Ioannis Antonoglou , Thomas Hubert , Karen Simonyan , Laurent Sifre , Simon Schmitt , Arthur Guez , Edward Lockhart , Demis Hassabis , Thore Graepel , Timothy Lillicrap , David Silver


   Access Paper or Ask Questions

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm



David Silver , Thomas Hubert , Julian Schrittwieser , Ioannis Antonoglou , Matthew Lai , Arthur Guez , Marc Lanctot , Laurent Sifre , Dharshan Kumaran , Thore Graepel , Timothy Lillicrap , Karen Simonyan , Demis Hassabis


   Access Paper or Ask Questions

1
2
>>