Picture for Stephanie C. Y. Chan

Stephanie C. Y. Chan

Representation biases: will we achieve complete understanding by analyzing representations?

Add code
Jul 29, 2025
Figure 1 for Representation biases: will we achieve complete understanding by analyzing representations?
Figure 2 for Representation biases: will we achieve complete understanding by analyzing representations?
Figure 3 for Representation biases: will we achieve complete understanding by analyzing representations?
Figure 4 for Representation biases: will we achieve complete understanding by analyzing representations?
Viaarxiv icon

The emergence of sparse attention: impact of data distribution and benefits of repetition

Add code
May 23, 2025
Viaarxiv icon

Predictability Shapes Adaptation: An Evolutionary Perspective on Modes of Learning in Transformers

Add code
May 14, 2025
Viaarxiv icon

On the generalization of language models from in-context learning and finetuning: a controlled study

Add code
May 01, 2025
Viaarxiv icon

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Add code
Mar 07, 2025
Viaarxiv icon

The broader spectrum of in-context learning

Add code
Dec 05, 2024
Viaarxiv icon

Learned feature representations are biased by complexity, learning order, position, and more

Add code
May 09, 2024
Figure 1 for Learned feature representations are biased by complexity, learning order, position, and more
Figure 2 for Learned feature representations are biased by complexity, learning order, position, and more
Figure 3 for Learned feature representations are biased by complexity, learning order, position, and more
Figure 4 for Learned feature representations are biased by complexity, learning order, position, and more
Viaarxiv icon

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Add code
Apr 10, 2024
Figure 1 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 2 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 3 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 4 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Viaarxiv icon

The Transient Nature of Emergent In-Context Learning in Transformers

Add code
Nov 15, 2023
Figure 1 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 2 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 3 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 4 for The Transient Nature of Emergent In-Context Learning in Transformers
Viaarxiv icon

Transformers generalize differently from information stored in context vs in weights

Add code
Oct 11, 2022
Figure 1 for Transformers generalize differently from information stored in context vs in weights
Figure 2 for Transformers generalize differently from information stored in context vs in weights
Figure 3 for Transformers generalize differently from information stored in context vs in weights
Figure 4 for Transformers generalize differently from information stored in context vs in weights
Viaarxiv icon