Alert button
Picture for Lucas Beyer

Lucas Beyer

Alert button

A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation

Add code
Bookmark button
Alert button
Dec 17, 2021
Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou

Figure 1 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 2 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 3 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Figure 4 for A Simple Single-Scale Vision Transformer for Object Localization and Instance Segmentation
Viaarxiv icon

LiT: Zero-Shot Transfer with Locked-image Text Tuning

Add code
Bookmark button
Alert button
Nov 15, 2021
Xiaohua Zhai, Xiao Wang, Basil Mustafa, Andreas Steiner, Daniel Keysers, Alexander Kolesnikov, Lucas Beyer

Figure 1 for LiT: Zero-Shot Transfer with Locked-image Text Tuning
Figure 2 for LiT: Zero-Shot Transfer with Locked-image Text Tuning
Figure 3 for LiT: Zero-Shot Transfer with Locked-image Text Tuning
Figure 4 for LiT: Zero-Shot Transfer with Locked-image Text Tuning
Viaarxiv icon

The Efficiency Misnomer

Add code
Bookmark button
Alert button
Oct 25, 2021
Mostafa Dehghani, Anurag Arnab, Lucas Beyer, Ashish Vaswani, Yi Tay

Figure 1 for The Efficiency Misnomer
Figure 2 for The Efficiency Misnomer
Figure 3 for The Efficiency Misnomer
Viaarxiv icon

How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers

Add code
Bookmark button
Alert button
Jun 18, 2021
Andreas Steiner, Alexander Kolesnikov, Xiaohua Zhai, Ross Wightman, Jakob Uszkoreit, Lucas Beyer

Figure 1 for How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Figure 2 for How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Figure 3 for How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Figure 4 for How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Viaarxiv icon

Knowledge distillation: A good teacher is patient and consistent

Add code
Bookmark button
Alert button
Jun 09, 2021
Lucas Beyer, Xiaohua Zhai, Amélie Royer, Larisa Markeeva, Rohan Anil, Alexander Kolesnikov

Figure 1 for Knowledge distillation: A good teacher is patient and consistent
Figure 2 for Knowledge distillation: A good teacher is patient and consistent
Figure 3 for Knowledge distillation: A good teacher is patient and consistent
Figure 4 for Knowledge distillation: A good teacher is patient and consistent
Viaarxiv icon

Scaling Vision Transformers

Add code
Bookmark button
Alert button
Jun 08, 2021
Xiaohua Zhai, Alexander Kolesnikov, Neil Houlsby, Lucas Beyer

Figure 1 for Scaling Vision Transformers
Figure 2 for Scaling Vision Transformers
Figure 3 for Scaling Vision Transformers
Figure 4 for Scaling Vision Transformers
Viaarxiv icon

MLP-Mixer: An all-MLP Architecture for Vision

Add code
Bookmark button
Alert button
May 17, 2021
Ilya Tolstikhin, Neil Houlsby, Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Thomas Unterthiner, Jessica Yung, Andreas Steiner, Daniel Keysers, Jakob Uszkoreit, Mario Lucic, Alexey Dosovitskiy

Figure 1 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 2 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 3 for MLP-Mixer: An all-MLP Architecture for Vision
Figure 4 for MLP-Mixer: An all-MLP Architecture for Vision
Viaarxiv icon

SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size

Add code
Bookmark button
Alert button
Apr 09, 2021
Jessica Yung, Rob Romijnders, Alexander Kolesnikov, Lucas Beyer, Josip Djolonga, Neil Houlsby, Sylvain Gelly, Mario Lucic, Xiaohua Zhai

Figure 1 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 2 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 3 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Figure 4 for SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size
Viaarxiv icon

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Add code
Bookmark button
Alert button
Oct 22, 2020
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby

Figure 1 for An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Figure 2 for An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Figure 3 for An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Figure 4 for An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Viaarxiv icon