Picture for Michela Paganini

Michela Paganini

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Towards Compute-Optimal Transfer Learning

Add code
Apr 25, 2023
Figure 1 for Towards Compute-Optimal Transfer Learning
Figure 2 for Towards Compute-Optimal Transfer Learning
Figure 3 for Towards Compute-Optimal Transfer Learning
Figure 4 for Towards Compute-Optimal Transfer Learning
Viaarxiv icon

Neural Algorithmic Reasoning with Causal Regularisation

Add code
Feb 20, 2023
Figure 1 for Neural Algorithmic Reasoning with Causal Regularisation
Figure 2 for Neural Algorithmic Reasoning with Causal Regularisation
Figure 3 for Neural Algorithmic Reasoning with Causal Regularisation
Figure 4 for Neural Algorithmic Reasoning with Causal Regularisation
Viaarxiv icon

Unified Scaling Laws for Routed Language Models

Add code
Feb 09, 2022
Figure 1 for Unified Scaling Laws for Routed Language Models
Figure 2 for Unified Scaling Laws for Routed Language Models
Figure 3 for Unified Scaling Laws for Routed Language Models
Figure 4 for Unified Scaling Laws for Routed Language Models
Viaarxiv icon

Improving language models by retrieving from trillions of tokens

Add code
Jan 11, 2022
Figure 1 for Improving language models by retrieving from trillions of tokens
Figure 2 for Improving language models by retrieving from trillions of tokens
Figure 3 for Improving language models by retrieving from trillions of tokens
Figure 4 for Improving language models by retrieving from trillions of tokens
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Prune Responsibly

Add code
Sep 10, 2020
Figure 1 for Prune Responsibly
Figure 2 for Prune Responsibly
Figure 3 for Prune Responsibly
Figure 4 for Prune Responsibly
Viaarxiv icon

Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding

Add code
Jul 06, 2020
Figure 1 for Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding
Figure 2 for Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding
Figure 3 for Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding
Figure 4 for Bespoke vs. Prêt-à-Porter Lottery Tickets: Exploiting Mask Similarity for Trainable Sub-Network Finding
Viaarxiv icon

dagger: A Python Framework for Reproducible Machine Learning Experiment Orchestration

Add code
Jun 12, 2020
Figure 1 for dagger: A Python Framework for Reproducible Machine Learning Experiment Orchestration
Viaarxiv icon