Picture for Suchin Gururangan

Suchin Gururangan

Language models scale reliably with over-training and on downstream tasks

Add code
Mar 13, 2024
Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon

LESS: Selecting Influential Data for Targeted Instruction Tuning

Add code
Feb 20, 2024
Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Jan 19, 2024
Viaarxiv icon

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Add code
Jan 16, 2024
Viaarxiv icon

Time is Encoded in the Weights of Finetuned Language Models

Add code
Dec 30, 2023
Viaarxiv icon

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

Add code
Aug 08, 2023
Figure 1 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 2 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 3 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 4 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Viaarxiv icon

Information Flow Control in Machine Learning through Modular Model Architecture

Add code
Jun 05, 2023
Figure 1 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 2 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 3 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 4 for Information Flow Control in Machine Learning through Modular Model Architecture
Viaarxiv icon

Scaling Expert Language Models with Unsupervised Domain Discovery

Add code
Mar 24, 2023
Figure 1 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 2 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 3 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 4 for Scaling Expert Language Models with Unsupervised Domain Discovery
Viaarxiv icon

Editing Models with Task Arithmetic

Add code
Dec 08, 2022
Figure 1 for Editing Models with Task Arithmetic
Figure 2 for Editing Models with Task Arithmetic
Figure 3 for Editing Models with Task Arithmetic
Figure 4 for Editing Models with Task Arithmetic
Viaarxiv icon

M2D2: A Massively Multi-domain Language Modeling Dataset

Add code
Oct 13, 2022
Figure 1 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 2 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 3 for M2D2: A Massively Multi-domain Language Modeling Dataset
Figure 4 for M2D2: A Massively Multi-domain Language Modeling Dataset
Viaarxiv icon