Picture for Margaret Li

Margaret Li

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Add code
Oct 27, 2025
Figure 1 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 2 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 3 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Figure 4 for Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
Viaarxiv icon

FlexOlmo: Open Language Models for Flexible Data Use

Add code
Jul 09, 2025
Figure 1 for FlexOlmo: Open Language Models for Flexible Data Use
Figure 2 for FlexOlmo: Open Language Models for Flexible Data Use
Figure 3 for FlexOlmo: Open Language Models for Flexible Data Use
Figure 4 for FlexOlmo: Open Language Models for Flexible Data Use
Viaarxiv icon

Precise Information Control in Long-Form Text Generation

Add code
Jun 06, 2025
Viaarxiv icon

(Mis)Fitting: A Survey of Scaling Laws

Add code
Feb 26, 2025
Figure 1 for (Mis)Fitting: A Survey of Scaling Laws
Figure 2 for (Mis)Fitting: A Survey of Scaling Laws
Figure 3 for (Mis)Fitting: A Survey of Scaling Laws
Figure 4 for (Mis)Fitting: A Survey of Scaling Laws
Viaarxiv icon

Byte Latent Transformer: Patches Scale Better Than Tokens

Add code
Dec 13, 2024
Viaarxiv icon

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling

Add code
Jul 02, 2024
Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Jan 19, 2024
Figure 1 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 2 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 3 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 4 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Viaarxiv icon

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Add code
Oct 20, 2023
Figure 1 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 2 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 3 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Figure 4 for In-Context Pretraining: Language Modeling Beyond Document Boundaries
Viaarxiv icon

Scaling Expert Language Models with Unsupervised Domain Discovery

Add code
Mar 24, 2023
Figure 1 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 2 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 3 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 4 for Scaling Expert Language Models with Unsupervised Domain Discovery
Viaarxiv icon

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

Add code
Aug 05, 2022
Figure 1 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 2 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 3 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 4 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Viaarxiv icon