Picture for Ian Magnusson

Ian Magnusson

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation

Add code
Aug 18, 2025
Viaarxiv icon

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Add code
Apr 15, 2025
Figure 1 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 2 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 3 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Figure 4 for DataDecide: How to Predict Best Pretraining Data with Small Experiments
Viaarxiv icon

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Add code
Oct 21, 2024
Figure 1 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 2 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 3 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Figure 4 for Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

Paloma: A Benchmark for Evaluating Language Model Fit

Add code
Dec 16, 2023
Viaarxiv icon

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Add code
Dec 15, 2023
Viaarxiv icon

What's In My Big Data?

Add code
Oct 31, 2023
Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Reproducibility in NLP: What Have We Learned from the Checklist?

Add code
Jun 16, 2023
Figure 1 for Reproducibility in NLP: What Have We Learned from the Checklist?
Figure 2 for Reproducibility in NLP: What Have We Learned from the Checklist?
Figure 3 for Reproducibility in NLP: What Have We Learned from the Checklist?
Figure 4 for Reproducibility in NLP: What Have We Learned from the Checklist?
Viaarxiv icon

Just-DREAM-about-it: Figurative Language Understanding with DREAM-FLUTE

Add code
Oct 28, 2022
Viaarxiv icon