Alert button
Picture for Rohit Girdhar

Rohit Girdhar

Alert button

HierVL: Learning Hierarchical Video-Language Embeddings

Add code
Bookmark button
Alert button
Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Figure 1 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 2 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 3 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 4 for HierVL: Learning Hierarchical Video-Language Embeddings
Viaarxiv icon

What You Say Is What You Show: Visual Narration Detection in Instructional Videos

Add code
Bookmark button
Alert button
Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Figure 1 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 2 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 3 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Figure 4 for What You Say Is What You Show: Visual Narration Detection in Instructional Videos
Viaarxiv icon

Learning Video Representations from Large Language Models

Add code
Bookmark button
Alert button
Dec 08, 2022
Yue Zhao, Ishan Misra, Philipp Krähenbühl, Rohit Girdhar

Figure 1 for Learning Video Representations from Large Language Models
Figure 2 for Learning Video Representations from Large Language Models
Figure 3 for Learning Video Representations from Large Language Models
Figure 4 for Learning Video Representations from Large Language Models
Viaarxiv icon

OmniMAE: Single Model Masked Pretraining on Images and Videos

Add code
Bookmark button
Alert button
Jun 16, 2022
Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Figure 1 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 2 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 3 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 4 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Viaarxiv icon

Omnivore: A Single Model for Many Visual Modalities

Add code
Bookmark button
Alert button
Jan 20, 2022
Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan Misra

Figure 1 for Omnivore: A Single Model for Many Visual Modalities
Figure 2 for Omnivore: A Single Model for Many Visual Modalities
Figure 3 for Omnivore: A Single Model for Many Visual Modalities
Figure 4 for Omnivore: A Single Model for Many Visual Modalities
Viaarxiv icon

Detecting Twenty-thousand Classes using Image-level Supervision

Add code
Bookmark button
Alert button
Jan 10, 2022
Xingyi Zhou, Rohit Girdhar, Armand Joulin, Phillip Krähenbühl, Ishan Misra

Figure 1 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 2 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 3 for Detecting Twenty-thousand Classes using Image-level Supervision
Figure 4 for Detecting Twenty-thousand Classes using Image-level Supervision
Viaarxiv icon

Mask2Former for Video Instance Segmentation

Add code
Bookmark button
Alert button
Dec 20, 2021
Bowen Cheng, Anwesa Choudhuri, Ishan Misra, Alexander Kirillov, Rohit Girdhar, Alexander G. Schwing

Figure 1 for Mask2Former for Video Instance Segmentation
Figure 2 for Mask2Former for Video Instance Segmentation
Figure 3 for Mask2Former for Video Instance Segmentation
Viaarxiv icon

Masked-attention Mask Transformer for Universal Image Segmentation

Add code
Bookmark button
Alert button
Dec 10, 2021
Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

Figure 1 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 2 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 3 for Masked-attention Mask Transformer for Universal Image Segmentation
Figure 4 for Masked-attention Mask Transformer for Universal Image Segmentation
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Bookmark button
Alert button
Oct 13, 2021
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon