Alert button
Picture for Soham De

Soham De

Alert button

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas

Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Bookmark button
Alert button
Mar 13, 2024
Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari, Charline Le Lan, Christopher A. Choquette-Choo, Clément Crepy, Daniel Cer, Daphne Ippolito, David Reid, Elena Buchatskaya, Eric Ni, Eric Noland, Geng Yan, George Tucker, George-Christian Muraru, Grigory Rozhdestvenskiy, Henryk Michalewski, Ian Tenney, Ivan Grishchenko, Jacob Austin, James Keeling, Jane Labanowski, Jean-Baptiste Lespiau, Jeff Stanway, Jenny Brennan, Jeremy Chen, Johan Ferret, Justin Chiu, Justin Mao-Jones, Katherine Lee, Kathy Yu, Katie Millican, Lars Lowe Sjoesund, Lisa Lee, Lucas Dixon, Machel Reid, Maciej Mikuła, Mateo Wirth, Michael Sharman, Nikolai Chinaev, Nithum Thain, Olivier Bachem, Oscar Chang, Oscar Wahltinez, Paige Bailey, Paul Michel, Petko Yotov, Pier Giuseppe Sessa, Rahma Chaabouni, Ramona Comanescu, Reena Jana, Rohan Anil, Ross McIlroy, Ruibo Liu, Ryan Mullins, Samuel L Smith, Sebastian Borgeaud, Sertan Girgin, Sholto Douglas, Shree Pandya, Siamak Shakeri, Soham De, Ted Klimenko, Tom Hennigan, Vlad Feinberg, Wojciech Stokowiec, Yu-hui Chen, Zafarali Ahmed, Zhitao Gong, Tris Warkentin, Ludovic Peran, Minh Giang, Clément Farabet, Oriol Vinyals, Jeff Dean, Koray Kavukcuoglu, Demis Hassabis, Zoubin Ghahramani, Douglas Eck, Joelle Barral, Fernando Pereira, Eli Collins, Armand Joulin, Noah Fiedel, Evan Senter, Alek Andreev, Kathleen Kenealy

Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

Viaarxiv icon

ConvNets Match Vision Transformers at Scale

Add code
Bookmark button
Alert button
Oct 25, 2023
Samuel L. Smith, Andrew Brock, Leonard Berrada, Soham De

Viaarxiv icon

Unlocking Accuracy and Fairness in Differentially Private Image Classification

Add code
Bookmark button
Alert button
Aug 21, 2023
Leonard Berrada, Soham De, Judy Hanwen Shen, Jamie Hayes, Robert Stanforth, David Stutz, Pushmeet Kohli, Samuel L. Smith, Borja Balle

Figure 1 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 2 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 3 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 4 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Viaarxiv icon

On the Universality of Linear Recurrences Followed by Nonlinear Projections

Add code
Bookmark button
Alert button
Jul 21, 2023
Antonio Orvieto, Soham De, Caglar Gulcehre, Razvan Pascanu, Samuel L. Smith

Figure 1 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 2 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 3 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 4 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Viaarxiv icon

Resurrecting Recurrent Neural Networks for Long Sequences

Add code
Bookmark button
Alert button
Mar 11, 2023
Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De

Figure 1 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 2 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 3 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 4 for Resurrecting Recurrent Neural Networks for Long Sequences
Viaarxiv icon

Differentially Private Diffusion Models Generate Useful Synthetic Images

Add code
Bookmark button
Alert button
Feb 27, 2023
Sahra Ghalebikesabi, Leonard Berrada, Sven Gowal, Ira Ktena, Robert Stanforth, Jamie Hayes, Soham De, Samuel L. Smith, Olivia Wiles, Borja Balle

Figure 1 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 2 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 3 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 4 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Viaarxiv icon

Unlocking High-Accuracy Differentially Private Image Classification through Scale

Add code
Bookmark button
Alert button
Apr 28, 2022
Soham De, Leonard Berrada, Jamie Hayes, Samuel L. Smith, Borja Balle

Figure 1 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 2 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 3 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 4 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Viaarxiv icon

Regularising for invariance to data augmentation improves supervised learning

Add code
Bookmark button
Alert button
Mar 07, 2022
Aleksander Botev, Matthias Bauer, Soham De

Figure 1 for Regularising for invariance to data augmentation improves supervised learning
Figure 2 for Regularising for invariance to data augmentation improves supervised learning
Figure 3 for Regularising for invariance to data augmentation improves supervised learning
Figure 4 for Regularising for invariance to data augmentation improves supervised learning
Viaarxiv icon