Picture for Anushan Fernando

Anushan Fernando

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Resurrecting Recurrent Neural Networks for Long Sequences

Add code
Mar 11, 2023
Figure 1 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 2 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 3 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 4 for Resurrecting Recurrent Neural Networks for Long Sequences
Viaarxiv icon