Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Affordance Grounding from Demonstration Video to Target Image


Mar 26, 2023
Joya Chen, Difei Gao, Kevin Qinghong Lin, Mike Zheng Shou

Add code

* CVPR 2023 

   Access Paper or Ask Questions

DeepfakeMAE: Facial Part Consistency Aware Masked Autoencoder for Deepfake Video Detection


Mar 03, 2023
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Zheng Qin, Mike Zheng Shou

Add code


   Access Paper or Ask Questions

MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering


Dec 19, 2022
Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou

Add code


   Access Paper or Ask Questions

An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022


Nov 16, 2022
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing-Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan

Add code

* Technical report for ECCV 2022 Ego4D workshop, 4 pages, 2 figures, 2 tables. arXiv admin note: substantial text overlap with arXiv:2209.10918 

   Access Paper or Ask Questions

CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding


Sep 22, 2022
Zhijian Hou, Wanjun Zhong, Lei Ji, Difei Gao, Kun Yan, Wing-Kwong Chan, Chong-Wah Ngo, Zheng Shou, Nan Duan

Add code

* Preprint. 9 pages, 5 figures, 3 tables 

   Access Paper or Ask Questions

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task


Aug 29, 2022
Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou

Add code

* 18 pages, 13 figures 

   Access Paper or Ask Questions

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022


Jul 04, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Add code

* To appeared in CVPRW22. 4 pages, 2 figures, 5 tables. Code: https://github.com/showlab/EgoVLP. arXiv admin note: substantial text overlap with arXiv:2206.01670. substantial text overlap with arXiv:2207.01334 

   Access Paper or Ask Questions

Egocentric Video-Language Pretraining


Jun 03, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Add code

* Preprint. 22 pages, 13 figures, 11 tables. Code: https://github.com/showlab/EgoVLP 

   Access Paper or Ask Questions

GEB+: A benchmark for generic event boundary captioning, grounding and text-based retrieval


Apr 10, 2022
Yuxuan Wang, Difei Gao, Licheng Yu, Stan Weixian Lei, Matt Feiszli, Mike Zheng Shou

Add code

* 24 pages 

   Access Paper or Ask Questions

1
2
>>