Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Masked Autoencoders As Spatiotemporal Learners



Christoph Feichtenhofer , Haoqi Fan , Yanghao Li , Kaiming He

* Technical report 

   Access Paper or Ask Questions

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition



Chao-Yuan Wu , Yanghao Li , Karttikeya Mangalam , Haoqi Fan , Bo Xiong , Jitendra Malik , Christoph Feichtenhofer

* Technical report 

   Access Paper or Ask Questions

A ConvNet for the 2020s



Zhuang Liu , Hanzi Mao , Chao-Yuan Wu , Christoph Feichtenhofer , Trevor Darrell , Saining Xie

* Technical report; Code: https://github.com/facebookresearch/ConvNeXt 

   Access Paper or Ask Questions

Masked Feature Prediction for Self-Supervised Visual Pre-Training



Chen Wei , Haoqi Fan , Saining Xie , Chao-Yuan Wu , Alan Yuille , Christoph Feichtenhofer

* Technical report 

   Access Paper or Ask Questions

Improved Multiscale Vision Transformers for Classification and Detection



Yanghao Li , Chao-Yuan Wu , Haoqi Fan , Karttikeya Mangalam , Bo Xiong , Jitendra Malik , Christoph Feichtenhofer

* Technical report 

   Access Paper or Ask Questions

PyTorchVideo: A Deep Learning Library for Video Understanding



Haoqi Fan , Tullie Murrell , Heng Wang , Kalyan Vasudev Alwala , Yanghao Li , Yilei Li , Bo Xiong , Nikhila Ravi , Meng Li , Haichuan Yang , Jitendra Malik , Ross Girshick , Matt Feiszli , Aaron Adcock , Wan-Yen Lo , Christoph Feichtenhofer

* Technical report 

   Access Paper or Ask Questions

Ego4D: Around the World in 3,000 Hours of Egocentric Video



Kristen Grauman , Andrew Westbury , Eugene Byrne , Zachary Chavis , Antonino Furnari , Rohit Girdhar , Jackson Hamburger , Hao Jiang , Miao Liu , Xingyu Liu , Miguel Martin , Tushar Nagarajan , Ilija Radosavovic , Santhosh Kumar Ramakrishnan , Fiona Ryan , Jayant Sharma , Michael Wray , Mengmeng Xu , Eric Zhongcong Xu , Chen Zhao , Siddhant Bansal , Dhruv Batra , Vincent Cartillier , Sean Crane , Tien Do , Morrie Doulaty , Akshay Erapalli , Christoph Feichtenhofer , Adriano Fragomeni , Qichen Fu , Christian Fuegen , Abrham Gebreselasie , Cristina Gonzalez , James Hillis , Xuhua Huang , Yifei Huang , Wenqi Jia , Weslie Khoo , Jachym Kolar , Satwik Kottur , Anurag Kumar , Federico Landini , Chao Li , Yanghao Li , Zhenqiang Li , Karttikeya Mangalam , Raghava Modhugu , Jonathan Munro , Tullie Murrell , Takumi Nishiyasu , Will Price , Paola Ruiz Puentes , Merey Ramazanova , Leda Sari , Kiran Somasundaram , Audrey Southerland , Yusuke Sugano , Ruijie Tao , Minh Vo , Yuchen Wang , Xindi Wu , Takuma Yagi , Yunyi Zhu , Pablo Arbelaez , David Crandall , Dima Damen , Giovanni Maria Farinella , Bernard Ghanem , Vamsi Krishna Ithapu , C. V. Jawahar , Hanbyul Joo , Kris Kitani , Haizhou Li , Richard Newcombe , Aude Oliva , Hyun Soo Park , James M. Rehg , Yoichi Sato , Jianbo Shi , Mike Zheng Shou , Antonio Torralba , Lorenzo Torresani , Mingfei Yan , Jitendra Malik


   Access Paper or Ask Questions

VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding



Hu Xu , Gargi Ghosh , Po-Yao Huang , Dmytro Okhonko , Armen Aghajanyan , Florian Metze , Luke Zettlemoyer , Christoph Feichtenhofer

* EMNLP 2021 

   Access Paper or Ask Questions

Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers



Mandela Patrick , Dylan Campbell , Yuki M. Asano , Ishan Misra Florian Metze , Christoph Feichtenhofer , Andrea Vedaldi , Jo\ão F. Henriques

* Project page: https://facebookresearch.github.io/Motionformer 

   Access Paper or Ask Questions

VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding



Hu Xu , Gargi Ghosh , Po-Yao Huang , Prahal Arora , Masoumeh Aminzadeh , Christoph Feichtenhofer , Florian Metze , Luke Zettlemoyer

* 9 pages, ACL Findings 2021 

   Access Paper or Ask Questions

1
2
3
4
>>