Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning


Dec 06, 2022
AJ Piergiovanni, Weicheng Kuo, Anelia Angelova

Add code


   Access Paper or Ask Questions

Compound Tokens: Channel Fusion for Vision-Language Representation Learning


Dec 02, 2022
Maxwell Mbabilla Aladago, AJ Piergiovanni

Add code


   Access Paper or Ask Questions

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models


Sep 30, 2022
Weicheng Kuo, Yin Cui, Xiuye Gu, AJ Piergiovanni, Anelia Angelova

Add code

* 19 pages, 6 figures 

   Access Paper or Ask Questions

PaLI: A Jointly-Scaled Multilingual Language-Image Model


Sep 16, 2022
Xi Chen, Xiao Wang, Soravit Changpinyo, AJ Piergiovanni, Piotr Padlewski, Daniel Salz, Sebastian Goodman, Adam Grycner, Basil Mustafa, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Nan Ding, Keran Rong, Hassan Akbari, Gaurav Mishra, Linting Xue, Ashish Thapliyal, James Bradbury, Weicheng Kuo, Mojtaba Seyedhosseini, Chao Jia, Burcu Karagol Ayan, Carlos Riquelme, Andreas Steiner, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut

Add code


   Access Paper or Ask Questions

Pre-training image-language transformers for open-vocabulary tasks


Sep 09, 2022
AJ Piergiovanni, Weicheng Kuo, Anelia Angelova

Add code


   Access Paper or Ask Questions

Video Question Answering with Iterative Video-Text Co-Tokenization


Aug 01, 2022
AJ Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova

Add code

* ECCV 2022 

   Access Paper or Ask Questions

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering


May 02, 2022
AJ Piergiovanni, Wei Li, Weicheng Kuo, Mohammad Saffar, Fred Bertsch, Anelia Angelova

Add code


   Access Paper or Ask Questions

FindIt: Generalized Localization with Natural Language Queries


Mar 31, 2022
Weicheng Kuo, Fred Bertsch, Wei Li, AJ Piergiovanni, Mohammad Saffar, Anelia Angelova

Add code

* Tech report 

   Access Paper or Ask Questions

4D-Net for Learned Multi-Modal Alignment


Sep 02, 2021
AJ Piergiovanni, Vincent Casser, Michael S. Ryoo, Anelia Angelova

Add code

* ICCV 2021 

   Access Paper or Ask Questions

Unsupervised Discovery of Actions in Instructional Videos


Jun 28, 2021
AJ Piergiovanni, Anelia Angelova, Michael S. Ryoo, Irfan Essa

Add code

* Full paper 

   Access Paper or Ask Questions

1
2
3
4
>>