Picture for Noah A. Smith

Noah A. Smith

Paul G. Allen School of Computer Science & Engineering, University of Washington, Allen Institute for Artificial Intelligence

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Jan 19, 2024
Figure 1 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 2 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 3 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 4 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Viaarxiv icon

Tuning Language Models by Proxy

Add code
Jan 16, 2024
Figure 1 for Tuning Language Models by Proxy
Figure 2 for Tuning Language Models by Proxy
Figure 3 for Tuning Language Models by Proxy
Figure 4 for Tuning Language Models by Proxy
Viaarxiv icon

Time is Encoded in the Weights of Finetuned Language Models

Add code
Dec 30, 2023
Figure 1 for Time is Encoded in the Weights of Finetuned Language Models
Figure 2 for Time is Encoded in the Weights of Finetuned Language Models
Figure 3 for Time is Encoded in the Weights of Finetuned Language Models
Figure 4 for Time is Encoded in the Weights of Finetuned Language Models
Viaarxiv icon

Paloma: A Benchmark for Evaluating Language Model Fit

Add code
Dec 16, 2023
Viaarxiv icon

Language Models: A Guide for the Perplexed

Add code
Nov 29, 2023
Figure 1 for Language Models: A Guide for the Perplexed
Figure 2 for Language Models: A Guide for the Perplexed
Figure 3 for Language Models: A Guide for the Perplexed
Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Nov 20, 2023
Figure 1 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 2 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 3 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 4 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Viaarxiv icon

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals

Add code
Nov 16, 2023
Viaarxiv icon

ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models

Add code
Nov 14, 2023
Figure 1 for ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models
Figure 2 for ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models
Figure 3 for ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models
Figure 4 for ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models
Viaarxiv icon