Picture for Ethan Dyer

Ethan Dyer

Shammie

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Dec 22, 2023
Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

Exploring Length Generalization in Large Language Models

Add code
Jul 11, 2022
Figure 1 for Exploring Length Generalization in Large Language Models
Figure 2 for Exploring Length Generalization in Large Language Models
Figure 3 for Exploring Length Generalization in Large Language Models
Figure 4 for Exploring Length Generalization in Large Language Models
Viaarxiv icon

Solving Quantitative Reasoning Problems with Language Models

Add code
Jul 01, 2022
Figure 1 for Solving Quantitative Reasoning Problems with Language Models
Figure 2 for Solving Quantitative Reasoning Problems with Language Models
Figure 3 for Solving Quantitative Reasoning Problems with Language Models
Figure 4 for Solving Quantitative Reasoning Problems with Language Models
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Block-Recurrent Transformers

Add code
Mar 11, 2022
Figure 1 for Block-Recurrent Transformers
Figure 2 for Block-Recurrent Transformers
Figure 3 for Block-Recurrent Transformers
Figure 4 for Block-Recurrent Transformers
Viaarxiv icon

Explaining Neural Scaling Laws

Add code
Feb 12, 2021
Figure 1 for Explaining Neural Scaling Laws
Figure 2 for Explaining Neural Scaling Laws
Figure 3 for Explaining Neural Scaling Laws
Figure 4 for Explaining Neural Scaling Laws
Viaarxiv icon

When Do Curricula Work?

Add code
Dec 05, 2020
Figure 1 for When Do Curricula Work?
Figure 2 for When Do Curricula Work?
Figure 3 for When Do Curricula Work?
Figure 4 for When Do Curricula Work?
Viaarxiv icon