Picture for Soham De

Soham De

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Feb 29, 2024
Viaarxiv icon

ConvNets Match Vision Transformers at Scale

Add code
Oct 25, 2023
Viaarxiv icon

Unlocking Accuracy and Fairness in Differentially Private Image Classification

Add code
Aug 21, 2023
Figure 1 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 2 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 3 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Figure 4 for Unlocking Accuracy and Fairness in Differentially Private Image Classification
Viaarxiv icon

On the Universality of Linear Recurrences Followed by Nonlinear Projections

Add code
Jul 21, 2023
Figure 1 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 2 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 3 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 4 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Viaarxiv icon

Resurrecting Recurrent Neural Networks for Long Sequences

Add code
Mar 11, 2023
Figure 1 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 2 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 3 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 4 for Resurrecting Recurrent Neural Networks for Long Sequences
Viaarxiv icon

Differentially Private Diffusion Models Generate Useful Synthetic Images

Add code
Feb 27, 2023
Figure 1 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 2 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 3 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Figure 4 for Differentially Private Diffusion Models Generate Useful Synthetic Images
Viaarxiv icon

Unlocking High-Accuracy Differentially Private Image Classification through Scale

Add code
Apr 28, 2022
Figure 1 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 2 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 3 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Figure 4 for Unlocking High-Accuracy Differentially Private Image Classification through Scale
Viaarxiv icon

Regularising for invariance to data augmentation improves supervised learning

Add code
Mar 07, 2022
Figure 1 for Regularising for invariance to data augmentation improves supervised learning
Figure 2 for Regularising for invariance to data augmentation improves supervised learning
Figure 3 for Regularising for invariance to data augmentation improves supervised learning
Figure 4 for Regularising for invariance to data augmentation improves supervised learning
Viaarxiv icon