Alert button
Picture for Rohit Girdhar

Rohit Girdhar

Alert button

InstanceDiffusion: Instance-level Control for Image Generation

Feb 05, 2024
Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra

Viaarxiv icon

Generating Illustrated Instructions

Dec 07, 2023
Sachit Menon, Ishan Misra, Rohit Girdhar

Viaarxiv icon

Motion-Conditioned Image Animation for Video Editing

Nov 30, 2023
Wilson Yan, Andrew Brown, Pieter Abbeel, Rohit Girdhar, Samaneh Azadi

Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Nov 17, 2023
Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Aug 28, 2023
Xudong Wang, Ishan Misra, Ziyun Zeng, Rohit Girdhar, Trevor Darrell

Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

May 09, 2023
Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Figure 1 for ImageBind: One Embedding Space To Bind Them All
Figure 2 for ImageBind: One Embedding Space To Bind Them All
Figure 3 for ImageBind: One Embedding Space To Bind Them All
Figure 4 for ImageBind: One Embedding Space To Bind Them All
Viaarxiv icon

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Mar 23, 2023
Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

Figure 1 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 2 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 3 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 4 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Viaarxiv icon

Learning to Substitute Ingredients in Recipes

Feb 15, 2023
Bahare Fatemi, Quentin Duval, Rohit Girdhar, Michal Drozdzal, Adriana Romero-Soriano

Figure 1 for Learning to Substitute Ingredients in Recipes
Figure 2 for Learning to Substitute Ingredients in Recipes
Figure 3 for Learning to Substitute Ingredients in Recipes
Figure 4 for Learning to Substitute Ingredients in Recipes
Viaarxiv icon

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Jan 26, 2023
Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra

Figure 1 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 2 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 3 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 4 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Viaarxiv icon

HierVL: Learning Hierarchical Video-Language Embeddings

Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Figure 1 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 2 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 3 for HierVL: Learning Hierarchical Video-Language Embeddings
Figure 4 for HierVL: Learning Hierarchical Video-Language Embeddings
Viaarxiv icon