Alert button
Picture for Aaron Mueller

Aaron Mueller

Alert button

In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax

Nov 13, 2023
Aaron Mueller, Albert Webson, Jackson Petty, Tal Linzen

Viaarxiv icon

Function Vectors in Large Language Models

Oct 23, 2023
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau

Viaarxiv icon

Meta-training with Demonstration Retrieval for Efficient Few-shot Learning

Jun 30, 2023
Aaron Mueller, Kanika Narang, Lambert Mathias, Qifan Wang, Hamed Firooz

Figure 1 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 2 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 3 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 4 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Jun 15, 2023
Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez

Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon

How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases

May 31, 2023
Aaron Mueller, Tal Linzen

Figure 1 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 2 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 3 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 4 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Viaarxiv icon

Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

Jan 27, 2023
Alex Warstadt, Leshem Choshen, Aaron Mueller, Adina Williams, Ethan Wilcox, Chengxu Zhuang

Figure 1 for Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Figure 2 for Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Viaarxiv icon

Language model acceptability judgements are not always robust to context

Dec 18, 2022
Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes, Roger Levy, Adina Williams

Figure 1 for Language model acceptability judgements are not always robust to context
Figure 2 for Language model acceptability judgements are not always robust to context
Figure 3 for Language model acceptability judgements are not always robust to context
Figure 4 for Language model acceptability judgements are not always robust to context
Viaarxiv icon

Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models

Oct 25, 2022
Aaron Mueller, Yu Xia, Tal Linzen

Figure 1 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 2 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 3 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 4 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Viaarxiv icon

What Do NLP Researchers Believe? Results of the NLP Community Metasurvey

Aug 26, 2022
Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman

Figure 1 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 2 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 3 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Figure 4 for What Do NLP Researchers Believe? Results of the NLP Community Metasurvey
Viaarxiv icon

Label Semantic Aware Pre-training for Few-shot Text Classification

Apr 14, 2022
Aaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth

Figure 1 for Label Semantic Aware Pre-training for Few-shot Text Classification
Figure 2 for Label Semantic Aware Pre-training for Few-shot Text Classification
Figure 3 for Label Semantic Aware Pre-training for Few-shot Text Classification
Figure 4 for Label Semantic Aware Pre-training for Few-shot Text Classification
Viaarxiv icon