Picture for Claudio Altafini

Claudio Altafini

Analogies between Transformer Layers and Power Method

Add code
May 25, 2026
Viaarxiv icon

Multistability of Self-Attention Dynamics in Transformers

Add code
Nov 14, 2025
Viaarxiv icon

Gradient Flow Equations for Deep Linear Neural Networks: A Survey from a Network Perspective

Add code
Nov 13, 2025
Figure 1 for Gradient Flow Equations for Deep Linear Neural Networks: A Survey from a Network Perspective
Figure 2 for Gradient Flow Equations for Deep Linear Neural Networks: A Survey from a Network Perspective
Figure 3 for Gradient Flow Equations for Deep Linear Neural Networks: A Survey from a Network Perspective
Figure 4 for Gradient Flow Equations for Deep Linear Neural Networks: A Survey from a Network Perspective
Viaarxiv icon