Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Mike Zheng Shou

Contrastive Learning of Semantic and Visual Representations for Text Tracking


Dec 30, 2021
Zhuang Li, Weijia Wu, Mike Zheng Shou, Jiahong Li, Size Li, Zhongyuan Wang, Hong Zhou

* 10 pages, 5 figures 

  Access Paper or Ask Questions

AssistSR: Affordance-centric Question-driven Video Segment Retrieval


Dec 06, 2021
Stan Weixian Lei, Yuxuan Wang, Dongxing Mao, Difei Gao, Mike Zheng Shou

* 15 pages, 11 figures 

  Access Paper or Ask Questions

AVA-AVD: Audio-visual Speaker Diarization in the Wild


Dec 06, 2021
Eric Zhongcong Xu, Zeyang Song, Chao Feng, Mang Ye, Mike Zheng Shou


  Access Paper or Ask Questions

Video-Text Pre-training with Learned Regions


Dec 06, 2021
Rui Yan, Mike Zheng Shou, Yixiao Ge, Alex Jinpeng Wang, Xudong Lin, Guanyu Cai, Jinhui Tang


  Access Paper or Ask Questions

Object-aware Video-language Pre-training for Retrieval


Dec 06, 2021
Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou

* 10 pages, 5 figures 

  Access Paper or Ask Questions

MorphMLP: A Self-Attention Free, MLP-Like Backbone for Image and Video


Nov 24, 2021
David Junhao Zhang, Kunchang Li, Yunpeng Chen, Yali Wang, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou

* preprint version 

  Access Paper or Ask Questions

Ego4D: Around the World in 3,000 Hours of Egocentric Video


Oct 13, 2021
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik


  Access Paper or Ask Questions

On Pursuit of Designing Multi-modal Transformer for Video Grounding


Sep 13, 2021
Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou

* Accepted by Conference on Empirical Methods in Natural Language Processing (EMNLP 2021) 

  Access Paper or Ask Questions

Deep Motion Prior for Weakly-Supervised Temporal Action Localization


Aug 12, 2021
Meng Cao, Can Zhang, Long Chen, Mike Zheng Shou, Yuexian Zou


  Access Paper or Ask Questions

Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection


Jul 25, 2021
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li

* ACM Multimedia 2021 

  Access Paper or Ask Questions

Generic Event Boundary Detection: A Benchmark for Event Segmentation


Jan 26, 2021
Mike Zheng Shou, Deepti Ghadiyaram, Weiyao Wang, Matt Feiszli


  Access Paper or Ask Questions