Picture for Luke Zettlemoyer

Luke Zettlemoyer

University of Washington

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Add code
Jan 30, 2024
Figure 1 for Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Figure 2 for Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Figure 3 for Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Figure 4 for Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens
Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Jan 19, 2024
Figure 1 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 2 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 3 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 4 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Viaarxiv icon

PathFinder: Guided Search over Multi-Step Reasoning Paths

Add code
Dec 12, 2023
Figure 1 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 2 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 3 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Figure 4 for PathFinder: Guided Search over Multi-Step Reasoning Paths
Viaarxiv icon

Detecting Pretraining Data from Large Language Models

Add code
Nov 03, 2023
Viaarxiv icon

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Add code
Oct 20, 2023
Figure 1 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 2 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 3 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 4 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Viaarxiv icon

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Add code
Oct 17, 2023
Viaarxiv icon

RA-DIT: Retrieval-Augmented Dual Instruction Tuning

Add code
Oct 08, 2023
Figure 1 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 2 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 3 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Figure 4 for RA-DIT: Retrieval-Augmented Dual Instruction Tuning
Viaarxiv icon

Demystifying CLIP Data

Add code
Oct 02, 2023
Figure 1 for Demystifying CLIP Data
Figure 2 for Demystifying CLIP Data
Figure 3 for Demystifying CLIP Data
Figure 4 for Demystifying CLIP Data
Viaarxiv icon