Alert button
Picture for Aaron Mueller

Aaron Mueller

Alert button

[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

Add code
Bookmark button
Alert button
Apr 09, 2024
Leshem Choshen, Ryan Cotterell, Michael Y. Hu, Tal Linzen, Aaron Mueller, Candace Ross, Alex Warstadt, Ethan Wilcox, Adina Williams, Chengxu Zhuang

Viaarxiv icon

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Add code
Bookmark button
Alert button
Mar 31, 2024
Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller

Viaarxiv icon

In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax

Add code
Bookmark button
Alert button
Nov 13, 2023
Aaron Mueller, Albert Webson, Jackson Petty, Tal Linzen

Viaarxiv icon

Function Vectors in Large Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau

Viaarxiv icon

Meta-training with Demonstration Retrieval for Efficient Few-shot Learning

Add code
Bookmark button
Alert button
Jun 30, 2023
Aaron Mueller, Kanika Narang, Lambert Mathias, Qifan Wang, Hamed Firooz

Figure 1 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 2 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 3 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Figure 4 for Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Bookmark button
Alert button
Jun 15, 2023
Ian R. McKenzie, Alexander Lyzhov, Michael Pieler, Alicia Parrish, Aaron Mueller, Ameya Prabhu, Euan McLean, Aaron Kirtland, Alexis Ross, Alisa Liu, Andrew Gritsevskiy, Daniel Wurgaft, Derik Kauffman, Gabriel Recchia, Jiacheng Liu, Joe Cavanagh, Max Weiss, Sicong Huang, The Floating Droid, Tom Tseng, Tomasz Korbak, Xudong Shen, Yuhui Zhang, Zhengping Zhou, Najoung Kim, Samuel R. Bowman, Ethan Perez

Figure 1 for Inverse Scaling: When Bigger Isn't Better
Figure 2 for Inverse Scaling: When Bigger Isn't Better
Figure 3 for Inverse Scaling: When Bigger Isn't Better
Figure 4 for Inverse Scaling: When Bigger Isn't Better
Viaarxiv icon

How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases

Add code
Bookmark button
Alert button
May 31, 2023
Aaron Mueller, Tal Linzen

Figure 1 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 2 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 3 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 4 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Viaarxiv icon

Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

Add code
Bookmark button
Alert button
Jan 27, 2023
Alex Warstadt, Leshem Choshen, Aaron Mueller, Adina Williams, Ethan Wilcox, Chengxu Zhuang

Figure 1 for Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Figure 2 for Call for Papers -- The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus
Viaarxiv icon

Language model acceptability judgements are not always robust to context

Add code
Bookmark button
Alert button
Dec 18, 2022
Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes, Roger Levy, Adina Williams

Figure 1 for Language model acceptability judgements are not always robust to context
Figure 2 for Language model acceptability judgements are not always robust to context
Figure 3 for Language model acceptability judgements are not always robust to context
Figure 4 for Language model acceptability judgements are not always robust to context
Viaarxiv icon

Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models

Add code
Bookmark button
Alert button
Oct 25, 2022
Aaron Mueller, Yu Xia, Tal Linzen

Figure 1 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 2 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 3 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Figure 4 for Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Viaarxiv icon