Picture for Jesse Dodge

Jesse Dodge

OLMES: A Standard for Language Model Evaluations

Add code
Jun 12, 2024
Figure 1 for OLMES: A Standard for Language Model Evaluations
Figure 2 for OLMES: A Standard for Language Model Evaluations
Figure 3 for OLMES: A Standard for Language Model Evaluations
Figure 4 for OLMES: A Standard for Language Model Evaluations
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Add code
Jan 16, 2024
Viaarxiv icon

Paloma: A Benchmark for Evaluating Language Model Fit

Add code
Dec 16, 2023
Figure 1 for Paloma: A Benchmark for Evaluating Language Model Fit
Figure 2 for Paloma: A Benchmark for Evaluating Language Model Fit
Figure 3 for Paloma: A Benchmark for Evaluating Language Model Fit
Figure 4 for Paloma: A Benchmark for Evaluating Language Model Fit
Viaarxiv icon

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Add code
Dec 15, 2023
Viaarxiv icon

What's In My Big Data?

Add code
Oct 31, 2023
Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Language Models Hallucinate, but May Excel at Fact Verification

Add code
Oct 23, 2023
Viaarxiv icon

The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices

Add code
Oct 04, 2023
Figure 1 for The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices
Figure 2 for The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices
Figure 3 for The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices
Figure 4 for The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices
Viaarxiv icon

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

Add code
Jul 19, 2023
Figure 1 for Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation
Figure 2 for Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation
Figure 3 for Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation
Figure 4 for Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation
Viaarxiv icon