Alert button
Picture for Yanghao Li

Yanghao Li

Alert button

Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization

Add code
Bookmark button
Alert button
Nov 18, 2022
Mengmeng Xu, Yanghao Li, Cheng-Yang Fu, Bernard Ghanem, Tao Xiang, Juan-Manuel Perez-Rua

Figure 1 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 2 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 3 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Figure 4 for Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization
Viaarxiv icon

Bit Allocation using Optimization

Add code
Bookmark button
Alert button
Sep 20, 2022
Tongda Xu, Han Gao, Chenjian Gao, Jinyong Pi, Yanghao Li, Yuanyuan Wang, Ziyu Zhu, Dailan He, Mao Ye, Hongwei Qin, Yan Wang

Figure 1 for Bit Allocation using Optimization
Figure 2 for Bit Allocation using Optimization
Figure 3 for Bit Allocation using Optimization
Figure 4 for Bit Allocation using Optimization
Viaarxiv icon

Negative Frames Matter in Egocentric Visual Query 2D Localization

Add code
Bookmark button
Alert button
Aug 03, 2022
Mengmeng Xu, Cheng-Yang Fu, Yanghao Li, Bernard Ghanem, Juan-Manuel Perez-Rua, Tao Xiang

Figure 1 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 2 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 3 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Figure 4 for Negative Frames Matter in Egocentric Visual Query 2D Localization
Viaarxiv icon

Masked Autoencoders As Spatiotemporal Learners

Add code
Bookmark button
Alert button
May 18, 2022
Christoph Feichtenhofer, Haoqi Fan, Yanghao Li, Kaiming He

Figure 1 for Masked Autoencoders As Spatiotemporal Learners
Figure 2 for Masked Autoencoders As Spatiotemporal Learners
Figure 3 for Masked Autoencoders As Spatiotemporal Learners
Figure 4 for Masked Autoencoders As Spatiotemporal Learners
Viaarxiv icon

Exploring Plain Vision Transformer Backbones for Object Detection

Add code
Bookmark button
Alert button
Mar 30, 2022
Yanghao Li, Hanzi Mao, Ross Girshick, Kaiming He

Figure 1 for Exploring Plain Vision Transformer Backbones for Object Detection
Figure 2 for Exploring Plain Vision Transformer Backbones for Object Detection
Figure 3 for Exploring Plain Vision Transformer Backbones for Object Detection
Figure 4 for Exploring Plain Vision Transformer Backbones for Object Detection
Viaarxiv icon

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Add code
Bookmark button
Alert button
Jan 20, 2022
Chao-Yuan Wu, Yanghao Li, Karttikeya Mangalam, Haoqi Fan, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Figure 1 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 2 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 3 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 4 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Viaarxiv icon

Improved Multiscale Vision Transformers for Classification and Detection

Add code
Bookmark button
Alert button
Dec 02, 2021
Yanghao Li, Chao-Yuan Wu, Haoqi Fan, Karttikeya Mangalam, Bo Xiong, Jitendra Malik, Christoph Feichtenhofer

Figure 1 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 2 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 3 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 4 for Improved Multiscale Vision Transformers for Classification and Detection
Viaarxiv icon

Masked Autoencoders Are Scalable Vision Learners

Add code
Bookmark button
Alert button
Dec 02, 2021
Kaiming He, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, Ross Girshick

Figure 1 for Masked Autoencoders Are Scalable Vision Learners
Figure 2 for Masked Autoencoders Are Scalable Vision Learners
Figure 3 for Masked Autoencoders Are Scalable Vision Learners
Figure 4 for Masked Autoencoders Are Scalable Vision Learners
Viaarxiv icon

Benchmarking Detection Transfer Learning with Vision Transformers

Add code
Bookmark button
Alert button
Nov 22, 2021
Yanghao Li, Saining Xie, Xinlei Chen, Piotr Dollar, Kaiming He, Ross Girshick

Figure 1 for Benchmarking Detection Transfer Learning with Vision Transformers
Figure 2 for Benchmarking Detection Transfer Learning with Vision Transformers
Figure 3 for Benchmarking Detection Transfer Learning with Vision Transformers
Figure 4 for Benchmarking Detection Transfer Learning with Vision Transformers
Viaarxiv icon

PyTorchVideo: A Deep Learning Library for Video Understanding

Add code
Bookmark button
Alert button
Nov 18, 2021
Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph Feichtenhofer

Figure 1 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 2 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 3 for PyTorchVideo: A Deep Learning Library for Video Understanding
Viaarxiv icon