Picture for Yiqiao Zhong

Yiqiao Zhong

Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

Add code
May 05, 2026
Viaarxiv icon

How Does Unfaithful Reasoning Emerge from Autoregressive Training? A Study of Synthetic Experiments

Add code
Feb 01, 2026
Viaarxiv icon

Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic

Add code
Jan 30, 2026
Viaarxiv icon

Unifying Attention Heads and Task Vectors via Hidden State Geometry in In-Context Learning

Add code
May 24, 2025
Viaarxiv icon

Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective

Add code
Oct 22, 2024
Figure 1 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 2 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 3 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Figure 4 for Assessing and improving reliability of neighbor embedding methods: a map-continuity perspective
Viaarxiv icon

How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes

Add code
Apr 04, 2024
Viaarxiv icon

Uncovering hidden geometry in Transformers via disentangling position and context

Add code
Oct 07, 2023
Figure 1 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 2 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 3 for Uncovering hidden geometry in Transformers via disentangling position and context
Figure 4 for Uncovering hidden geometry in Transformers via disentangling position and context
Viaarxiv icon

Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage

Add code
Jun 06, 2023
Figure 1 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 2 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 3 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Figure 4 for Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Viaarxiv icon

Tractability from overparametrization: The example of the negative perceptron

Add code
Oct 28, 2021
Figure 1 for Tractability from overparametrization: The example of the negative perceptron
Figure 2 for Tractability from overparametrization: The example of the negative perceptron
Figure 3 for Tractability from overparametrization: The example of the negative perceptron
Figure 4 for Tractability from overparametrization: The example of the negative perceptron
Viaarxiv icon

The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training

Add code
Jul 25, 2020
Figure 1 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 2 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 3 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Figure 4 for The Interpolation Phase Transition in Neural Networks: Memorization and Generalization under Lazy Training
Viaarxiv icon