Picture for Hila Ofek

Hila Ofek

A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

Add code
Apr 16, 2026
Viaarxiv icon