Alert button
Picture for Karttikeya Mangalam

Karttikeya Mangalam

Alert button

Reversible Vision Transformers

Add code
Bookmark button
Alert button
Feb 09, 2023
Karttikeya Mangalam, Haoqi Fan, Yanghao Li, Chao-Yuan Wu, Bo Xiong, Christoph Feichtenhofer, Jitendra Malik

Figure 1 for Reversible Vision Transformers
Figure 2 for Reversible Vision Transformers
Figure 3 for Reversible Vision Transformers
Figure 4 for Reversible Vision Transformers
Viaarxiv icon

Does unsupervised grammar induction need pixels?

Add code
Bookmark button
Alert button
Dec 20, 2022
Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Q. Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein

Figure 1 for Does unsupervised grammar induction need pixels?
Figure 2 for Does unsupervised grammar induction need pixels?
Figure 3 for Does unsupervised grammar induction need pixels?
Figure 4 for Does unsupervised grammar induction need pixels?
Viaarxiv icon

Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

Add code
Bookmark button
Alert button
Nov 25, 2022
Chen Zhao, Shuming Liu, Karttikeya Mangalam, Bernard Ghanem

Figure 1 for Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Figure 2 for Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Figure 3 for Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Figure 4 for Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Viaarxiv icon

Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022

Add code
Bookmark button
Alert button
Jun 15, 2022
Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson

Figure 1 for Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022
Figure 2 for Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022
Figure 3 for Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022
Viaarxiv icon

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Add code
Bookmark button
Alert button
Jun 15, 2022
Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson

Figure 1 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 2 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 3 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 4 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Viaarxiv icon

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 02, 2022
Sehoon Kim, Amir Gholami, Albert Shaw, Nicholas Lee, Karttikeya Mangalam, Jitendra Malik, Michael W. Mahoney, Kurt Keutzer

Figure 1 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 2 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 3 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Figure 4 for Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Viaarxiv icon

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Add code
Bookmark button
Alert button
Jan 20, 2022
Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Figure 1 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 2 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 3 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 4 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Viaarxiv icon

Overcoming Mode Collapse with Adaptive Multi Adversarial Training

Add code
Bookmark button
Alert button
Dec 29, 2021
Karttikeya Mangalam, Rohin Garg

Figure 1 for Overcoming Mode Collapse with Adaptive Multi Adversarial Training
Figure 2 for Overcoming Mode Collapse with Adaptive Multi Adversarial Training
Figure 3 for Overcoming Mode Collapse with Adaptive Multi Adversarial Training
Figure 4 for Overcoming Mode Collapse with Adaptive Multi Adversarial Training
Viaarxiv icon

Improved Multiscale Vision Transformers for Classification and Detection

Add code
Bookmark button
Alert button
Dec 02, 2021
Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Figure 1 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 2 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 3 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 4 for Improved Multiscale Vision Transformers for Classification and Detection
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Bookmark button
Alert button
Oct 13, 2021
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon