Picture for Rohit Girdhar

Rohit Girdhar

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Add code
Apr 08, 2024
Figure 1 for SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
Figure 2 for SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
Figure 3 for SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
Figure 4 for SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
Viaarxiv icon

InstanceDiffusion: Instance-level Control for Image Generation

Add code
Feb 05, 2024
Viaarxiv icon

Generating Illustrated Instructions

Add code
Dec 07, 2023
Figure 1 for Generating Illustrated Instructions
Figure 2 for Generating Illustrated Instructions
Figure 3 for Generating Illustrated Instructions
Figure 4 for Generating Illustrated Instructions
Viaarxiv icon

Motion-Conditioned Image Animation for Video Editing

Add code
Nov 30, 2023
Figure 1 for Motion-Conditioned Image Animation for Video Editing
Figure 2 for Motion-Conditioned Image Animation for Video Editing
Figure 3 for Motion-Conditioned Image Animation for Video Editing
Figure 4 for Motion-Conditioned Image Animation for Video Editing
Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Add code
Nov 17, 2023
Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Add code
Aug 28, 2023
Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Figure 1 for ImageBind: One Embedding Space To Bind Them All
Figure 2 for ImageBind: One Embedding Space To Bind Them All
Figure 3 for ImageBind: One Embedding Space To Bind Them All
Figure 4 for ImageBind: One Embedding Space To Bind Them All
Viaarxiv icon

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Add code
Mar 23, 2023
Figure 1 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 2 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 3 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 4 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Viaarxiv icon

Learning to Substitute Ingredients in Recipes

Add code
Feb 15, 2023
Figure 1 for Learning to Substitute Ingredients in Recipes
Figure 2 for Learning to Substitute Ingredients in Recipes
Figure 3 for Learning to Substitute Ingredients in Recipes
Figure 4 for Learning to Substitute Ingredients in Recipes
Viaarxiv icon

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Add code
Jan 26, 2023
Figure 1 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 2 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 3 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Figure 4 for Cut and Learn for Unsupervised Object Detection and Instance Segmentation
Viaarxiv icon