Alert button
Picture for Aravindh Mahendran

Aravindh Mahendran

Alert button

Scaling Vision Transformers to 22 Billion Parameters

Add code
Bookmark button
Alert button
Feb 10, 2023
Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames

Add code
Bookmark button
Alert button
Feb 09, 2023
Ondrej Biza, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Gamaleldin F. Elsayed, Aravindh Mahendran, Thomas Kipf

Figure 1 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 2 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 3 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Figure 4 for Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames
Viaarxiv icon

RUST: Latent Neural Scene Representations from Unposed Imagery

Add code
Bookmark button
Alert button
Nov 25, 2022
Mehdi S. M. Sajjadi, Aravindh Mahendran, Thomas Kipf, Etienne Pot, Daniel Duckworth, Mario Lucic, Klaus Greff

Figure 1 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 2 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 3 for RUST: Latent Neural Scene Representations from Unposed Imagery
Figure 4 for RUST: Latent Neural Scene Representations from Unposed Imagery
Viaarxiv icon

Iterative Patch Selection for High-Resolution Image Recognition

Add code
Bookmark button
Alert button
Oct 24, 2022
Benjamin Bergner, Christoph Lippert, Aravindh Mahendran

Figure 1 for Iterative Patch Selection for High-Resolution Image Recognition
Figure 2 for Iterative Patch Selection for High-Resolution Image Recognition
Figure 3 for Iterative Patch Selection for High-Resolution Image Recognition
Figure 4 for Iterative Patch Selection for High-Resolution Image Recognition
Viaarxiv icon

SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos

Add code
Bookmark button
Alert button
Jun 15, 2022
Gamaleldin F. Elsayed, Aravindh Mahendran, Sjoerd van Steenkiste, Klaus Greff, Michael C. Mozer, Thomas Kipf

Figure 1 for SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Figure 2 for SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Figure 3 for SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Figure 4 for SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos
Viaarxiv icon

Object Scene Representation Transformer

Add code
Bookmark button
Alert button
Jun 14, 2022
Mehdi S. M. Sajjadi, Daniel Duckworth, Aravindh Mahendran, Sjoerd van Steenkiste, Filip Pavetić, Mario Lučić, Leonidas J. Guibas, Klaus Greff, Thomas Kipf

Figure 1 for Object Scene Representation Transformer
Figure 2 for Object Scene Representation Transformer
Figure 3 for Object Scene Representation Transformer
Figure 4 for Object Scene Representation Transformer
Viaarxiv icon

Simple Open-Vocabulary Object Detection with Vision Transformers

Add code
Bookmark button
Alert button
May 12, 2022
Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby

Figure 1 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 2 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 3 for Simple Open-Vocabulary Object Detection with Vision Transformers
Figure 4 for Simple Open-Vocabulary Object Detection with Vision Transformers
Viaarxiv icon

Conditional Object-Centric Learning from Video

Add code
Bookmark button
Alert button
Nov 24, 2021
Thomas Kipf, Gamaleldin F. Elsayed, Aravindh Mahendran, Austin Stone, Sara Sabour, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff

Figure 1 for Conditional Object-Centric Learning from Video
Figure 2 for Conditional Object-Centric Learning from Video
Figure 3 for Conditional Object-Centric Learning from Video
Figure 4 for Conditional Object-Centric Learning from Video
Viaarxiv icon

Differentiable Patch Selection for Image Recognition

Add code
Bookmark button
Alert button
Apr 07, 2021
Jean-Baptiste Cordonnier, Aravindh Mahendran, Alexey Dosovitskiy, Dirk Weissenborn, Jakob Uszkoreit, Thomas Unterthiner

Figure 1 for Differentiable Patch Selection for Image Recognition
Figure 2 for Differentiable Patch Selection for Image Recognition
Figure 3 for Differentiable Patch Selection for Image Recognition
Figure 4 for Differentiable Patch Selection for Image Recognition
Viaarxiv icon

Representation learning from videos in-the-wild: An object-centric approach

Add code
Bookmark button
Alert button
Oct 06, 2020
Rob Romijnders, Aravindh Mahendran, Michael Tschannen, Josip Djolonga, Marvin Ritter, Neil Houlsby, Mario Lucic

Figure 1 for Representation learning from videos in-the-wild: An object-centric approach
Figure 2 for Representation learning from videos in-the-wild: An object-centric approach
Figure 3 for Representation learning from videos in-the-wild: An object-centric approach
Figure 4 for Representation learning from videos in-the-wild: An object-centric approach
Viaarxiv icon