Alert button
Picture for David Budden

David Budden

Alert button

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas

Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

Viaarxiv icon

The CLRS Algorithmic Reasoning Benchmark

Add code
Bookmark button
Alert button
Jun 04, 2022
Petar Veličković, Adrià Puigdomènech Badia, David Budden, Razvan Pascanu, Andrea Banino, Misha Dashevskiy, Raia Hadsell, Charles Blundell

Figure 1 for The CLRS Algorithmic Reasoning Benchmark
Figure 2 for The CLRS Algorithmic Reasoning Benchmark
Figure 3 for The CLRS Algorithmic Reasoning Benchmark
Figure 4 for The CLRS Algorithmic Reasoning Benchmark
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Bookmark button
Alert button
Feb 09, 2022
Aidan Clark, Diego de las Casas, Aurelia Guy, Arthur Mensch, Michela Paganini, Jordan Hoffmann, Bogdan Damoc, Blake Hechtman, Trevor Cai, Sebastian Borgeaud, George van den Driessche, Eliza Rutherford, Tom Hennigan, Matthew Johnson, Katie Millican, Albin Cassirer, Chris Jones, Elena Buchatskaya, David Budden, Laurent Sifre, Simon Osindero, Oriol Vinyals, Jack Rae, Erich Elsen, Koray Kavukcuoglu, Karen Simonyan

Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Bookmark button
Alert button
Dec 08, 2021
Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat McAleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-Baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien de Masson d'Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving

Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Large-scale graph representation learning with very deep GNNs and self-supervision

Add code
Bookmark button
Alert button
Jul 20, 2021
Ravichandra Addanki, Peter W. Battaglia, David Budden, Andreea Deac, Jonathan Godwin, Thomas Keck, Wai Lok Sibon Li, Alvaro Sanchez-Gonzalez, Jacklynn Stott, Shantanu Thakoor, Petar Veličković

Figure 1 for Large-scale graph representation learning with very deep GNNs and self-supervision
Figure 2 for Large-scale graph representation learning with very deep GNNs and self-supervision
Figure 3 for Large-scale graph representation learning with very deep GNNs and self-supervision
Viaarxiv icon

A Combinatorial Perspective on Transfer Learning

Add code
Bookmark button
Alert button
Oct 23, 2020
Jianan Wang, Eren Sezener, David Budden, Marcus Hutter, Joel Veness

Figure 1 for A Combinatorial Perspective on Transfer Learning
Figure 2 for A Combinatorial Perspective on Transfer Learning
Figure 3 for A Combinatorial Perspective on Transfer Learning
Figure 4 for A Combinatorial Perspective on Transfer Learning
Viaarxiv icon

Gaussian Gated Linear Networks

Add code
Bookmark button
Alert button
Jun 10, 2020
David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg Wayne, Joel Veness

Figure 1 for Gaussian Gated Linear Networks
Figure 2 for Gaussian Gated Linear Networks
Figure 3 for Gaussian Gated Linear Networks
Figure 4 for Gaussian Gated Linear Networks
Viaarxiv icon

Online Learning in Contextual Bandits using Gated Linear Networks

Add code
Bookmark button
Alert button
Feb 21, 2020
Eren Sezener, Marcus Hutter, David Budden, Jianan Wang, Joel Veness

Figure 1 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 2 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 3 for Online Learning in Contextual Bandits using Gated Linear Networks
Figure 4 for Online Learning in Contextual Bandits using Gated Linear Networks
Viaarxiv icon