Alert button
Picture for Aaditya K. Singh

Aaditya K. Singh

Alert button

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Add code
Bookmark button
Alert button
Apr 10, 2024
Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

Viaarxiv icon

Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs

Add code
Bookmark button
Alert button
Feb 22, 2024
Aaditya K. Singh, DJ Strouse

Viaarxiv icon

Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data

Add code
Bookmark button
Alert button
Dec 05, 2023
Yu Yang, Aaditya K. Singh, Mostafa Elhoushi, Anas Mahmoud, Kushal Tirumala, Fabian Gloeckle, Baptiste Rozière, Carole-Jean Wu, Ari S. Morcos, Newsha Ardalani

Viaarxiv icon

The Transient Nature of Emergent In-Context Learning in Transformers

Add code
Bookmark button
Alert button
Nov 15, 2023
Aaditya K. Singh, Stephanie C. Y. Chan, Ted Moskovitz, Erin Grant, Andrew M. Saxe, Felix Hill

Viaarxiv icon

Confronting Reward Model Overoptimization with Constrained RLHF

Add code
Bookmark button
Alert button
Oct 10, 2023
Ted Moskovitz, Aaditya K. Singh, DJ Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca D. Dragan, Stephen McAleer

Viaarxiv icon

Know your audience: specializing grounded language models with the game of Dixit

Add code
Bookmark button
Alert button
Jun 16, 2022
Aaditya K. Singh, David Ding, Andrew Saxe, Felix Hill, Andrew K. Lampinen

Figure 1 for Know your audience: specializing grounded language models with the game of Dixit
Figure 2 for Know your audience: specializing grounded language models with the game of Dixit
Figure 3 for Know your audience: specializing grounded language models with the game of Dixit
Figure 4 for Know your audience: specializing grounded language models with the game of Dixit
Viaarxiv icon