Picture for Qiming Zhang

Qiming Zhang

Frank

BEVSimDet: Simulated Multi-modal Distillation in Bird's-Eye View for Multi-view 3D Object Detection

Add code
Apr 15, 2023
Viaarxiv icon

Vision Transformer with Quadrangle Attention

Add code
Mar 27, 2023
Figure 1 for Vision Transformer with Quadrangle Attention
Figure 2 for Vision Transformer with Quadrangle Attention
Figure 3 for Vision Transformer with Quadrangle Attention
Figure 4 for Vision Transformer with Quadrangle Attention
Viaarxiv icon

ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation

Add code
Dec 07, 2022
Viaarxiv icon

1st Workshop on Maritime Computer Vision 2023: Challenge Results

Add code
Nov 28, 2022
Figure 1 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 2 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 3 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Figure 4 for 1st Workshop on Maritime Computer Vision  2023: Challenge Results
Viaarxiv icon

Rethinking Hierarchies in Pre-trained Plain Vision Transformer

Add code
Nov 08, 2022
Viaarxiv icon

Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model

Add code
Aug 10, 2022
Figure 1 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 2 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 3 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Figure 4 for Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model
Viaarxiv icon

Toward Real-world Single Image Deraining: A New Benchmark and Beyond

Add code
Jun 11, 2022
Figure 1 for Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Figure 2 for Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Figure 3 for Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Figure 4 for Toward Real-world Single Image Deraining: A New Benchmark and Beyond
Viaarxiv icon

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

Add code
Apr 26, 2022
Figure 1 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 2 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 3 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Figure 4 for ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Viaarxiv icon

VSA: Learning Varied-Size Window Attention in Vision Transformers

Add code
Apr 18, 2022
Figure 1 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 2 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 3 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Figure 4 for VSA: Learning Varied-Size Window Attention in Vision Transformers
Viaarxiv icon

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond

Add code
Feb 21, 2022
Figure 1 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 2 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 3 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Figure 4 for ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
Viaarxiv icon