Alert button
Picture for Manoj Kumar

Manoj Kumar

Alert button

Frozen Feature Augmentation for Few-Shot Image Classification

Mar 15, 2024
Andreas Bär, Neil Houlsby, Mostafa Dehghani, Manoj Kumar

Viaarxiv icon

Image Captioners Are Scalable Vision Learners Too

Jun 13, 2023
Michael Tschannen, Manoj Kumar, Andreas Steiner, Xiaohua Zhai, Neil Houlsby, Lucas Beyer

Figure 1 for Image Captioners Are Scalable Vision Learners Too
Figure 2 for Image Captioners Are Scalable Vision Learners Too
Figure 3 for Image Captioners Are Scalable Vision Learners Too
Figure 4 for Image Captioners Are Scalable Vision Learners Too
Viaarxiv icon

Scaling Vision Transformers to 22 Billion Parameters

Feb 10, 2023
Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver, Fantine Huot, Jasmijn Bastings, Mark Patrick Collier, Alexey Gritsenko, Vighnesh Birodkar, Cristina Vasconcelos, Yi Tay, Thomas Mensink, Alexander Kolesnikov, Filip Pavetić, Dustin Tran, Thomas Kipf, Mario Lučić, Xiaohua Zhai, Daniel Keysers, Jeremiah Harmsen, Neil Houlsby

Figure 1 for Scaling Vision Transformers to 22 Billion Parameters
Figure 2 for Scaling Vision Transformers to 22 Billion Parameters
Figure 3 for Scaling Vision Transformers to 22 Billion Parameters
Figure 4 for Scaling Vision Transformers to 22 Billion Parameters
Viaarxiv icon

Dual PatchNorm

Feb 06, 2023
Manoj Kumar, Mostafa Dehghani, Neil Houlsby

Figure 1 for Dual PatchNorm
Figure 2 for Dual PatchNorm
Figure 3 for Dual PatchNorm
Figure 4 for Dual PatchNorm
Viaarxiv icon

Large language models can segment narrative events similarly to humans

Jan 24, 2023
Sebastian Michelmann, Manoj Kumar, Kenneth A. Norman, Mariya Toneva

Figure 1 for Large language models can segment narrative events similarly to humans
Figure 2 for Large language models can segment narrative events similarly to humans
Figure 3 for Large language models can segment narrative events similarly to humans
Figure 4 for Large language models can segment narrative events similarly to humans
Viaarxiv icon

A Unified Framework for Optimization-Based Graph Coarsening

Oct 02, 2022
Manoj Kumar, Anurag Sharma, Sandeep Kumar

Figure 1 for A Unified Framework for Optimization-Based Graph Coarsening
Figure 2 for A Unified Framework for Optimization-Based Graph Coarsening
Figure 3 for A Unified Framework for Optimization-Based Graph Coarsening
Figure 4 for A Unified Framework for Optimization-Based Graph Coarsening
Viaarxiv icon

Functional Optimization Reinforcement Learning for Real-Time Bidding

Jul 03, 2022
Changjie Lu, Yining Lu, Naina Bandyopadhyay, Manoj Kumar, Gaurav Gupta

Figure 1 for Functional Optimization Reinforcement Learning for Real-Time Bidding
Figure 2 for Functional Optimization Reinforcement Learning for Real-Time Bidding
Figure 3 for Functional Optimization Reinforcement Learning for Real-Time Bidding
Figure 4 for Functional Optimization Reinforcement Learning for Real-Time Bidding
Viaarxiv icon

On the surprising tradeoff between ImageNet accuracy and perceptual similarity

Mar 09, 2022
Manoj Kumar, Neil Houlsby, Nal Kalchbrenner, Ekin D. Cubuk

Figure 1 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity
Figure 2 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity
Figure 3 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity
Figure 4 for On the surprising tradeoff between ImageNet accuracy and perceptual similarity
Viaarxiv icon