Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hagay Michaeli

Alias-Free ViT: Fractional Shift Invariance via Linear Attention

Oct 26, 2025

Hagay Michaeli, Daniel Soudry

Abstract:Transformers have emerged as a competitive alternative to convnets in vision tasks, yet they lack the architectural inductive bias of convnets, which may hinder their potential performance. Specifically, Vision Transformers (ViTs) are not translation-invariant and are more sensitive to minor image translations than standard convnets. Previous studies have shown, however, that convnets are also not perfectly shift-invariant, due to aliasing in downsampling and nonlinear layers. Consequently, anti-aliasing approaches have been proposed to certify convnets' translation robustness. Building on this line of work, we propose an Alias-Free ViT, which combines two main components. First, it uses alias-free downsampling and nonlinearities. Second, it uses linear cross-covariance attention that is shift-equivariant to both integer and fractional translations, enabling a shift-invariant global representation. Our model maintains competitive performance in image classification and outperforms similar-sized models in terms of robustness to adversarial translations.

* Accepted at NeurIPS 2025. Code is available at https://github.com/hmichaeli/alias_free_vit

Via

Access Paper or Ask Questions

Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations

Mar 15, 2023

Hagay Michaeli, Tomer Michaeli, Daniel Soudry

Abstract:Although CNNs are believed to be invariant to translations, recent works have shown this is not the case, due to aliasing effects that stem from downsampling layers. The existing architectural solutions to prevent aliasing are partial since they do not solve these effects, that originate in non-linearities. We propose an extended anti-aliasing method that tackles both downsampling and non-linear layers, thus creating truly alias-free, shift-invariant CNNs. We show that the presented model is invariant to integer as well as fractional (i.e., sub-pixel) translations, thus outperforming other shift-invariant methods in terms of robustness to adversarial translations.

* The paper was accepted to CVPR 2023. Our code is available at https://github.com/hmichaeli/alias_free_convnets/

Via

Access Paper or Ask Questions