Alert button
Picture for Emma Strubell

Emma Strubell

Alert button

Source-Aware Training Enables Knowledge Attribution in Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, Hao Peng

Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Bookmark button
Alert button
Jan 31, 2024
Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo

Viaarxiv icon

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Add code
Bookmark button
Alert button
Jan 16, 2024
Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge

Viaarxiv icon

Understanding the Effect of Model Compression on Social Bias in Large Language Models

Add code
Bookmark button
Alert button
Dec 12, 2023
Gustavo Gonçalves, Emma Strubell

Viaarxiv icon

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Add code
Bookmark button
Alert button
Nov 28, 2023
Alexandra Sasha Luccioni, Yacine Jernite, Emma Strubell

Viaarxiv icon

Energy and Carbon Considerations of Fine-Tuning BERT

Add code
Bookmark button
Alert button
Nov 17, 2023
Xiaorong Wang, Clara Na, Emma Strubell, Sorelle Friedler, Sasha Luccioni

Viaarxiv icon

Making Scalable Meta Learning Practical

Add code
Bookmark button
Alert button
Oct 23, 2023
Sang Keun Choe, Sanket Vaibhav Mehta, Hwijeen Ahn, Willie Neiswanger, Pengtao Xie, Emma Strubell, Eric Xing

Viaarxiv icon

To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing

Add code
Bookmark button
Alert button
Oct 11, 2023
Sireesh Gururaja, Amanda Bertsch, Clara Na, David Gray Widder, Emma Strubell

Viaarxiv icon