Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Teaching language models to support answers with verified quotes



Jacob Menick , Maja Trebacz , Vladimir Mikulik , John Aslanides , Francis Song , Martin Chadwick , Mia Glaese , Susannah Young , Lucy Campbell-Gillingham , Geoffrey Irving , Nat McAleese


   Access Paper or Ask Questions

Red Teaming Language Models with Language Models



Ethan Perez , Saffron Huang , Francis Song , Trevor Cai , Roman Ring , John Aslanides , Amelia Glaese , Nat McAleese , Geoffrey Irving


   Access Paper or Ask Questions

Scaling Language Models: Methods, Analysis & Insights from Training Gopher



Jack W. Rae , Sebastian Borgeaud , Trevor Cai , Katie Millican , Jordan Hoffmann , Francis Song , John Aslanides , Sarah Henderson , Roman Ring , Susannah Young , Eliza Rutherford , Tom Hennigan , Jacob Menick , Albin Cassirer , Richard Powell , George van den Driessche , Lisa Anne Hendricks , Maribeth Rauh , Po-Sen Huang , Amelia Glaese , Johannes Welbl , Sumanth Dathathri , Saffron Huang , Jonathan Uesato , John Mellor , Irina Higgins , Antonia Creswell , Nat McAleese , Amy Wu , Erich Elsen , Siddhant Jayakumar , Elena Buchatskaya , David Budden , Esme Sutherland , Karen Simonyan , Michela Paganini , Laurent Sifre , Lena Martens , Xiang Lorraine Li , Adhiguna Kuncoro , Aida Nematzadeh , Elena Gribovskaya , Domenic Donato , Angeliki Lazaridou , Arthur Mensch , Jean-Baptiste Lespiau , Maria Tsimpoukelli , Nikolai Grigorev , Doug Fritz , Thibault Sottiaux , Mantas Pajarskas , Toby Pohlen , Zhitao Gong , Daniel Toyama , Cyprien de Masson d'Autume , Yujia Li , Tayfun Terzi , Vladimir Mikulik , Igor Babuschkin , Aidan Clark , Diego de Las Casas , Aurelia Guy , Chris Jones , James Bradbury , Matthew Johnson , Blake Hechtman , Laura Weidinger , Iason Gabriel , William Isaac , Ed Lockhart , Simon Osindero , Laura Rimell , Chris Dyer , Oriol Vinyals , Kareem Ayoub , Jeff Stanway , Lorrayne Bennett , Demis Hassabis , Koray Kavukcuoglu , Geoffrey Irving

* 118 pages 

   Access Paper or Ask Questions

Acme: A Research Framework for Distributed Reinforcement Learning



Matt Hoffman , Bobak Shahriari , John Aslanides , Gabriel Barth-Maron , Feryal Behbahani , Tamara Norman , Abbas Abdolmaleki , Albin Cassirer , Fan Yang , Kate Baumli , Sarah Henderson , Alex Novikov , Sergio G贸mez Colmenarejo , Serkan Cabi , Caglar Gulcehre , Tom Le Paine , Andrew Cowie , Ziyu Wang , Bilal Piot , Nando de Freitas


   Access Paper or Ask Questions

Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning



Giambattista Parascandolo , Lars Buesing , Josh Merel , Leonard Hasenclever , John Aslanides , Jessica B. Hamrick , Nicolas Heess , Alexander Neitz , Theophane Weber


   Access Paper or Ask Questions

Behaviour Suite for Reinforcement Learning



Ian Osband , Yotam Doron , Matteo Hessel , John Aslanides , Eren Sezener , Andre Saraiva , Katrina McKinney , Tor Lattimore , Csaba Szepezvari , Satinder Singh , Benjamin Van Roy , Richard Sutton , David Silver , Hado Van Hasselt


   Access Paper or Ask Questions

When to use parametric models in reinforcement learning?



Hado van Hasselt , Matteo Hessel , John Aslanides


   Access Paper or Ask Questions

TF-Replicator: Distributed Machine Learning for Researchers



Peter Buchlovsky , David Budden , Dominik Grewe , Chris Jones , John Aslanides , Frederic Besse , Andy Brock , Aidan Clark , Sergio G贸mez Colmenarejo , Aedan Pope , Fabio Viola , Dan Belov


   Access Paper or Ask Questions

Randomized Prior Functions for Deep Reinforcement Learning



Ian Osband , John Aslanides , Albin Cassirer


   Access Paper or Ask Questions

1
2
>>