Picture for Andrew M. Saxe

Andrew M. Saxe

Distinct Computations Emerge From Compositional Curricula in In-Context Learning

Add code
Jun 16, 2025
Viaarxiv icon

Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU Networks

Add code
Mar 08, 2025
Viaarxiv icon

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Add code
Mar 07, 2025
Viaarxiv icon

Nonlinear dynamics of localization in neural receptive fields

Add code
Jan 28, 2025
Figure 1 for Nonlinear dynamics of localization in neural receptive fields
Figure 2 for Nonlinear dynamics of localization in neural receptive fields
Figure 3 for Nonlinear dynamics of localization in neural receptive fields
Figure 4 for Nonlinear dynamics of localization in neural receptive fields
Viaarxiv icon

Flexible task abstractions emerge in linear networks with fast and bounded units

Add code
Nov 06, 2024
Figure 1 for Flexible task abstractions emerge in linear networks with fast and bounded units
Figure 2 for Flexible task abstractions emerge in linear networks with fast and bounded units
Figure 3 for Flexible task abstractions emerge in linear networks with fast and bounded units
Figure 4 for Flexible task abstractions emerge in linear networks with fast and bounded units
Viaarxiv icon

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks

Add code
Sep 22, 2024
Figure 1 for From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
Figure 2 for From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
Figure 3 for From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
Figure 4 for From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
Viaarxiv icon

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Add code
Apr 10, 2024
Figure 1 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 2 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 3 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Figure 4 for What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation
Viaarxiv icon

When Representations Align: Universality in Representation Learning Dynamics

Add code
Feb 14, 2024
Figure 1 for When Representations Align: Universality in Representation Learning Dynamics
Figure 2 for When Representations Align: Universality in Representation Learning Dynamics
Figure 3 for When Representations Align: Universality in Representation Learning Dynamics
Figure 4 for When Representations Align: Universality in Representation Learning Dynamics
Viaarxiv icon

The Transient Nature of Emergent In-Context Learning in Transformers

Add code
Nov 15, 2023
Figure 1 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 2 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 3 for The Transient Nature of Emergent In-Context Learning in Transformers
Figure 4 for The Transient Nature of Emergent In-Context Learning in Transformers
Viaarxiv icon

Meta-Learning Strategies through Value Maximization in Neural Networks

Add code
Oct 30, 2023
Figure 1 for Meta-Learning Strategies through Value Maximization in Neural Networks
Figure 2 for Meta-Learning Strategies through Value Maximization in Neural Networks
Figure 3 for Meta-Learning Strategies through Value Maximization in Neural Networks
Figure 4 for Meta-Learning Strategies through Value Maximization in Neural Networks
Viaarxiv icon