Picture for Rohit Girdhar

Rohit Girdhar

Jack

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Add code
Nov 17, 2023
Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Add code
Aug 28, 2023
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Viaarxiv icon

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Add code
Mar 23, 2023
Figure 1 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 2 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 3 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 4 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Viaarxiv icon

Learning to Substitute Ingredients in Recipes

Add code
Feb 15, 2023
Figure 1 for Learning to Substitute Ingredients in Recipes
Figure 2 for Learning to Substitute Ingredients in Recipes
Figure 3 for Learning to Substitute Ingredients in Recipes
Figure 4 for Learning to Substitute Ingredients in Recipes
Viaarxiv icon

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

Add code
Jan 26, 2023
Viaarxiv icon

What You Say Is What You Show: Visual Narration Detection in Instructional Videos

Add code
Jan 05, 2023
Viaarxiv icon

HierVL: Learning Hierarchical Video-Language Embeddings

Add code
Jan 05, 2023
Viaarxiv icon

Learning Video Representations from Large Language Models

Add code
Dec 08, 2022
Viaarxiv icon

OmniMAE: Single Model Masked Pretraining on Images and Videos

Add code
Jun 16, 2022
Figure 1 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 2 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 3 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 4 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Viaarxiv icon