Alert button
Picture for Suchin Gururangan

Suchin Gururangan

Alert button

Language models scale reliably with over-training and on downstream tasks

Add code
Bookmark button
Alert button
Mar 13, 2024
Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G. Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt

Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon

LESS: Selecting Influential Data for Targeted Instruction Tuning

Add code
Bookmark button
Alert button
Feb 20, 2024
Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen

Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Bookmark button
Alert button
Jan 19, 2024
Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer

Viaarxiv icon

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Add code
Bookmark button
Alert button
Jan 16, 2024
Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge

Viaarxiv icon

Time is Encoded in the Weights of Finetuned Language Models

Add code
Bookmark button
Alert button
Dec 30, 2023
Kai Nylund, Suchin Gururangan, Noah A. Smith

Viaarxiv icon

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

Add code
Bookmark button
Alert button
Aug 08, 2023
Sewon Min, Suchin Gururangan, Eric Wallace, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer

Figure 1 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 2 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 3 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Figure 4 for SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Viaarxiv icon

Information Flow Control in Machine Learning through Modular Model Architecture

Add code
Bookmark button
Alert button
Jun 05, 2023
Trishita Tiwari, Suchin Gururangan, Chuan Guo, Weizhe Hua, Sanjay Kariyappa, Udit Gupta, Wenjie Xiong, Kiwan Maeng, Hsien-Hsin S. Lee, G. Edward Suh

Figure 1 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 2 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 3 for Information Flow Control in Machine Learning through Modular Model Architecture
Figure 4 for Information Flow Control in Machine Learning through Modular Model Architecture
Viaarxiv icon

Scaling Expert Language Models with Unsupervised Domain Discovery

Add code
Bookmark button
Alert button
Mar 24, 2023
Suchin Gururangan, Margaret Li, Mike Lewis, Weijia Shi, Tim Althoff, Noah A. Smith, Luke Zettlemoyer

Figure 1 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 2 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 3 for Scaling Expert Language Models with Unsupervised Domain Discovery
Figure 4 for Scaling Expert Language Models with Unsupervised Domain Discovery
Viaarxiv icon