Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

ImageBind: One Embedding Space To Bind Them All


May 09, 2023
Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Add code

* CVPR 2023 (Highlighted Paper). Website: https://imagebind.metademolab.com/ Code/Models: https://github.com/facebookresearch/ImageBind 

   Access Paper or Ask Questions

The effectiveness of MAE pre-pretraining for billion-scale pretraining


Mar 23, 2023
Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

Add code


   Access Paper or Ask Questions

Learning to Substitute Ingredients in Recipes


Feb 15, 2023
Bahare Fatemi, Quentin Duval, Rohit Girdhar, Michal Drozdzal, Adriana Romero-Soriano

Add code


   Access Paper or Ask Questions

Cut and Learn for Unsupervised Object Detection and Instance Segmentation


Jan 26, 2023
Xudong Wang, Rohit Girdhar, Stella X. Yu, Ishan Misra

Add code

* Tech report. Project page: http://people.eecs.berkeley.edu/~xdwang/projects/CutLER/. Code is available at https://github.com/facebookresearch/CutLER 

   Access Paper or Ask Questions

HierVL: Learning Hierarchical Video-Language Embeddings


Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Add code

* Technical Report 

   Access Paper or Ask Questions

What You Say Is What You Show: Visual Narration Detection in Instructional Videos


Jan 05, 2023
Kumar Ashutosh, Rohit Girdhar, Lorenzo Torresani, Kristen Grauman

Add code

* Technical Report 

   Access Paper or Ask Questions

Learning Video Representations from Large Language Models


Dec 08, 2022
Yue Zhao, Ishan Misra, Philipp Krähenbühl, Rohit Girdhar

Add code

* Tech report. Project page: https://facebookresearch.github.io/LaViLa; Code is available at http://github.com/facebookresearch/LaViLa 

   Access Paper or Ask Questions

OmniMAE: Single Model Masked Pretraining on Images and Videos


Jun 16, 2022
Rohit Girdhar, Alaaeldin El-Nouby, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Add code


   Access Paper or Ask Questions

Omnivore: A Single Model for Many Visual Modalities


Jan 20, 2022
Rohit Girdhar, Mannat Singh, Nikhila Ravi, Laurens van der Maaten, Armand Joulin, Ishan Misra

Add code


   Access Paper or Ask Questions

Detecting Twenty-thousand Classes using Image-level Supervision


Jan 10, 2022
Xingyi Zhou, Rohit Girdhar, Armand Joulin, Phillip Krähenbühl, Ishan Misra

Add code

* Code is available at https://github.com/facebookresearch/Detic 

   Access Paper or Ask Questions

1
2
3
4
>>