Picture for Sebastian Borgeaud

Sebastian Borgeaud

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Accelerating Large Language Model Decoding with Speculative Sampling

Add code
Feb 02, 2023
Figure 1 for Accelerating Large Language Model Decoding with Speculative Sampling
Figure 2 for Accelerating Large Language Model Decoding with Speculative Sampling
Figure 3 for Accelerating Large Language Model Decoding with Speculative Sampling
Viaarxiv icon

Emergent Abilities of Large Language Models

Add code
Jun 15, 2022
Figure 1 for Emergent Abilities of Large Language Models
Figure 2 for Emergent Abilities of Large Language Models
Figure 3 for Emergent Abilities of Large Language Models
Figure 4 for Emergent Abilities of Large Language Models
Viaarxiv icon

Flamingo: a Visual Language Model for Few-Shot Learning

Add code
Apr 29, 2022
Figure 1 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 2 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 3 for Flamingo: a Visual Language Model for Few-Shot Learning
Figure 4 for Flamingo: a Visual Language Model for Few-Shot Learning
Viaarxiv icon

Training Compute-Optimal Large Language Models

Add code
Mar 29, 2022
Figure 1 for Training Compute-Optimal Large Language Models
Figure 2 for Training Compute-Optimal Large Language Models
Figure 3 for Training Compute-Optimal Large Language Models
Figure 4 for Training Compute-Optimal Large Language Models
Viaarxiv icon

General-purpose, long-context autoregressive modeling with Perceiver AR

Add code
Feb 15, 2022
Figure 1 for General-purpose, long-context autoregressive modeling with Perceiver AR
Figure 2 for General-purpose, long-context autoregressive modeling with Perceiver AR
Figure 3 for General-purpose, long-context autoregressive modeling with Perceiver AR
Figure 4 for General-purpose, long-context autoregressive modeling with Perceiver AR
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon