Picture for Yufei Xu

Yufei Xu

ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation

Add code
Dec 07, 2022
Viaarxiv icon

1st Workshop on Maritime Computer Vision 2023: Challenge Results

Add code
Nov 28, 2022
Figure 1 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 2 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 3 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 4 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Viaarxiv icon

Rethinking Hierarchies in Pre-trained Plain Vision Transformer

Add code
Nov 08, 2022
Viaarxiv icon

Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model

Add code
Aug 10, 2022
Figure 1 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 2 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 3 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 4 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Viaarxiv icon

Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection

Add code
Jul 14, 2022
Figure 1 for Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection
Figure 2 for Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection
Figure 3 for Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection
Figure 4 for Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection
Viaarxiv icon

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

Add code
Jun 12, 2022
Figure 1 for APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Figure 2 for APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Figure 3 for APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Figure 4 for APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
Viaarxiv icon

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

Add code
Apr 26, 2022
Figure 1 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 2 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 3 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 4 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Viaarxiv icon

VSA: Learning Varied-Size Window Attention in Vision Transformers

Add code
Apr 18, 2022
Figure 1 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 2 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 3 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 4 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Viaarxiv icon

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond

Add code
Feb 21, 2022
Figure 1 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 2 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 3 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 4 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Viaarxiv icon

RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?

Add code
Nov 24, 2021
Figure 1 for RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
Figure 2 for RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
Figure 3 for RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
Figure 4 for RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
Viaarxiv icon