Picture for Leo Gao

Leo Gao

Shammie

Scaling and evaluating sparse autoencoders

Add code
Jun 06, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Scaling Laws for Reward Model Overoptimization

Add code
Oct 19, 2022
Figure 1 for Scaling Laws for Reward Model Overoptimization
Figure 2 for Scaling Laws for Reward Model Overoptimization
Figure 3 for Scaling Laws for Reward Model Overoptimization
Figure 4 for Scaling Laws for Reward Model Overoptimization
Viaarxiv icon

EleutherAI: Going Beyond "Open Science" to "Science in the Open"

Add code
Oct 12, 2022
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

Add code
Apr 14, 2022
Figure 1 for GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Figure 2 for GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Figure 3 for GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Figure 4 for GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Viaarxiv icon

Datasheet for the Pile

Add code
Jan 13, 2022
Viaarxiv icon

Multitask Prompted Training Enables Zero-Shot Task Generalization

Add code
Oct 15, 2021
Figure 1 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 2 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 3 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 4 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Viaarxiv icon