Picture for Timur Mudarisov

Timur Mudarisov

Geometric Analysis of Token Selection in Multi-Head Attention

Add code
Feb 02, 2026
Viaarxiv icon

Limitations of Normalization in Attention Mechanism

Add code
Aug 25, 2025
Figure 1 for Limitations of Normalization in Attention Mechanism
Figure 2 for Limitations of Normalization in Attention Mechanism
Figure 3 for Limitations of Normalization in Attention Mechanism
Figure 4 for Limitations of Normalization in Attention Mechanism
Viaarxiv icon