Alert button
Picture for Anurag Arnab

Anurag Arnab

Alert button

End-to-End Spatio-Temporal Action Localisation with Video Transformers

Add code
Bookmark button
Alert button
Apr 24, 2023
Alexey Gritsenko, Xuehan Xiong, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lučić, Cordelia Schmid, Anurag Arnab

Figure 1 for End-to-End Spatio-Temporal Action Localisation with Video Transformers
Figure 2 for End-to-End Spatio-Temporal Action Localisation with Video Transformers
Figure 3 for End-to-End Spatio-Temporal Action Localisation with Video Transformers
Figure 4 for End-to-End Spatio-Temporal Action Localisation with Video Transformers
Viaarxiv icon

VicTR: Video-conditioned Text Representations for Activity Recognition

Add code
Bookmark button
Alert button
Apr 05, 2023
Kumara Kahatapitiya, Anurag Arnab, Arsha Nagrani, Michael S. Ryoo

Figure 1 for VicTR: Video-conditioned Text Representations for Activity Recognition
Figure 2 for VicTR: Video-conditioned Text Representations for Activity Recognition
Figure 3 for VicTR: Video-conditioned Text Representations for Activity Recognition
Figure 4 for VicTR: Video-conditioned Text Representations for Activity Recognition
Viaarxiv icon

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation

Add code
Bookmark button
Alert button
Mar 21, 2023
Seokju Cho, Heeseong Shin, Sunghwan Hong, Seungjun An, Seungjun Lee, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim

Figure 1 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 2 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 3 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Figure 4 for CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Add code
Bookmark button
Alert button
Feb 10, 2023
Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Adaptive Computation with Elastic Input Sequence

Add code
Bookmark button
Alert button
Jan 30, 2023
Fuzhao Xue, Valerii Likhosherstov, Anurag Arnab, Neil Houlsby, Mostafa Dehghani, Yang You

Figure 1 for Adaptive Computation with Elastic Input Sequence
Figure 2 for Adaptive Computation with Elastic Input Sequence
Figure 3 for Adaptive Computation with Elastic Input Sequence
Figure 4 for Adaptive Computation with Elastic Input Sequence
Viaarxiv icon

Audiovisual Masked Autoencoders

Add code
Bookmark button
Alert button
Dec 09, 2022
Mariana-Iuliana Georgescu, Eduardo Fonseca, Radu Tudor Ionescu, Mario Lucic, Cordelia Schmid, Anurag Arnab

Figure 1 for Audiovisual Masked Autoencoders
Figure 2 for Audiovisual Masked Autoencoders
Figure 3 for Audiovisual Masked Autoencoders
Figure 4 for Audiovisual Masked Autoencoders
Viaarxiv icon

Token Turing Machines

Add code
Bookmark button
Alert button
Nov 16, 2022
Michael S. Ryoo, Keerthana Gopalakrishnan, Kumara Kahatapitiya, Ted Xiao, Kanishka Rao, Austin Stone, Yao Lu, Julian Ibarz, Anurag Arnab

Figure 1 for Token Turing Machines
Figure 2 for Token Turing Machines
Figure 3 for Token Turing Machines
Figure 4 for Token Turing Machines
Viaarxiv icon

Dynamic Graph Message Passing Networks for Visual Recognition

Add code
Bookmark button
Alert button
Sep 20, 2022
Li Zhang, Mohan Chen, Anurag Arnab, Xiangyang Xue, Philip H. S. Torr

Figure 1 for Dynamic Graph Message Passing Networks for Visual Recognition
Figure 2 for Dynamic Graph Message Passing Networks for Visual Recognition
Figure 3 for Dynamic Graph Message Passing Networks for Visual Recognition
Figure 4 for Dynamic Graph Message Passing Networks for Visual Recognition
Viaarxiv icon

Beyond Transfer Learning: Co-finetuning for Action Localisation

Add code
Bookmark button
Alert button
Jul 08, 2022
Anurag Arnab, Xuehan Xiong, Alexey Gritsenko, Rob Romijnders, Josip Djolonga, Mostafa Dehghani, Chen Sun, Mario Lučić, Cordelia Schmid

Figure 1 for Beyond Transfer Learning: Co-finetuning for Action Localisation
Figure 2 for Beyond Transfer Learning: Co-finetuning for Action Localisation
Figure 3 for Beyond Transfer Learning: Co-finetuning for Action Localisation
Figure 4 for Beyond Transfer Learning: Co-finetuning for Action Localisation
Viaarxiv icon

M&M Mix: A Multimodal Multiview Transformer Ensemble

Add code
Bookmark button
Alert button
Jun 20, 2022
Xuehan Xiong, Anurag Arnab, Arsha Nagrani, Cordelia Schmid

Figure 1 for M&M Mix: A Multimodal Multiview Transformer Ensemble
Figure 2 for M&M Mix: A Multimodal Multiview Transformer Ensemble
Figure 3 for M&M Mix: A Multimodal Multiview Transformer Ensemble
Figure 4 for M&M Mix: A Multimodal Multiview Transformer Ensemble
Viaarxiv icon