Alert button
Picture for Rohit Girdhar

Rohit Girdhar

Alert button

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Add code
Bookmark button
Alert button
Apr 08, 2024
Changan Chen, Kumar Ashutosh, Rohit Girdhar, David Harwath, Kristen Grauman

Viaarxiv icon

InstanceDiffusion: Instance-level Control for Image Generation

Add code
Bookmark button
Alert button
Feb 05, 2024
Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra

Viaarxiv icon

Generating Illustrated Instructions

Add code
Bookmark button
Alert button
Dec 07, 2023
Sachit Menon, Ishan Misra, Rohit Girdhar

Viaarxiv icon

Motion-Conditioned Image Animation for Video Editing

Add code
Bookmark button
Alert button
Nov 30, 2023
Wilson Yan, Andrew Brown, Pieter Abbeel, Rohit Girdhar, Samaneh Azadi

Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Add code
Bookmark button
Alert button
Nov 17, 2023
Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Add code
Bookmark button
Alert button
Aug 28, 2023
Xudong Wang, Ishan Misra, Ziyun Zeng, Rohit Girdhar, Trevor Darrell

Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
Bookmark button
Alert button
May 09, 2023
Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Figure 1 for ImageBind: One Embedding Space To Bind Them All
Figure 2 for ImageBind: One Embedding Space To Bind Them All
Figure 3 for ImageBind: One Embedding Space To Bind Them All
Figure 4 for ImageBind: One Embedding Space To Bind Them All
Viaarxiv icon

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Add code
Bookmark button
Alert button
Mar 23, 2023
Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

Figure 1 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 2 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 3 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 4 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Viaarxiv icon

Learning to Substitute Ingredients in Recipes

Add code
Bookmark button
Alert button
Feb 15, 2023
Bahare Fatemi, Quentin Duval, Rohit Girdhar, Michal Drozdzal, Adriana Romero-Soriano

Figure 1 for Learning to Substitute Ingredients in Recipes
Figure 2 for Learning to Substitute Ingredients in Recipes
Figure 3 for Learning to Substitute Ingredients in Recipes
Figure 4 for Learning to Substitute Ingredients in Recipes
Viaarxiv icon

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Add code
Bookmark button
Alert button
Jan 26, 2023
Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra

Figure 1 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 2 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 3 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 4 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Viaarxiv icon