Picture for Andrei Kanavalau

Andrei Kanavalau

Gated Removal of Normalization in Transformers Enables Stable Training and Efficient Inference

Add code
Feb 11, 2026
Viaarxiv icon