Picture for Timur Mudarisov

Timur Mudarisov

Geometric Analysis of Token Selection in Multi-Head Attention

Add code
Feb 02, 2026
Viaarxiv icon

Limitations of Normalization in Attention Mechanism

Add code
Aug 25, 2025
Viaarxiv icon