Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Rethinking Hierarchies in Pre-trained Plain Vision Transformer


Nov 08, 2022
Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao

* Tech report, work in progress 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Rethinking Hierarchicies in Pre-trained Plain Vision Transformer


Nov 03, 2022
Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao

* Tech report, work in progress 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model


Aug 10, 2022
Di Wang, Qiming Zhang, Yufei Xu, Jing Zhang, Bo Du, Dacheng Tao, Liangpei Zhang

* The code and models will be released at https://github.com/ViTAE-Transformer/Remote-Sensing-RVSA 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection


Jul 14, 2022
Zhe Chen, Jing Zhang, Yufei Xu, Dacheng Tao


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking


Jun 12, 2022
Yuxiang Yang, Junjie Yang, Yufei Xu, Jing Zhang, Long Lan, Dacheng Tao


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation


Apr 26, 2022
Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao

* Tech report. 81.1 mAP on MS COCO Keypoint Detection test-dev set 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

VSA: Learning Varied-Size Window Attention in Vision Transformers


Apr 18, 2022
Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao

* 23 pages, 13 tables, and 5 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond


Feb 21, 2022
Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao

* An extended version of the Neurips paper "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias". arXiv admin note: substantial text overlap with arXiv:2106.03348 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?


Nov 24, 2021
Yufei Xu, Qiming Zhang, Jing Zhang, Dacheng Tao

* 15 pages, 8 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>