Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding In-context Learning of Addition via Activation Subspaces

May 08, 2025

Xinyan Hu, Kayo Yin, Michael I. Jordan, Jacob Steinhardt, Lijie Chen

Figure 1 for Understanding In-context Learning of Addition via Activation Subspaces

Figure 2 for Understanding In-context Learning of Addition via Activation Subspaces

Figure 3 for Understanding In-context Learning of Addition via Activation Subspaces

Figure 4 for Understanding In-context Learning of Addition via Activation Subspaces

Share this with someone who'll enjoy it:

Abstract:To perform in-context learning, language models must extract signals from individual few-shot examples, aggregate these into a learned prediction rule, and then apply this rule to new examples. How is this implemented in the forward pass of modern transformer models? To study this, we consider a structured family of few-shot learning tasks for which the true prediction rule is to add an integer $k$ to the input. We find that Llama-3-8B attains high accuracy on this task for a range of $k$, and localize its few-shot ability to just three attention heads via a novel optimization approach. We further show the extracted signals lie in a six-dimensional subspace, where four of the dimensions track the unit digit and the other two dimensions track overall magnitude. We finally examine how these heads extract information from individual few-shot examples, identifying a self-correction mechanism in which mistakes from earlier examples are suppressed by later examples. Our results demonstrate how tracking low-dimensional subspaces across a forward pass can provide insight into fine-grained computational structures.

* 16 pages

View paper on

Share this with someone who'll enjoy it:

Title:Understanding In-context Learning of Addition via Activation Subspaces

Paper and Code