Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Semantic-Conditional Diffusion Networks for Image Captioning


Dec 06, 2022
Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei

Add code

* Source code is available at \url{https://github.com/YehLi/xmodaler/tree/master/configs/image_caption/scdnet

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement


Nov 15, 2022
Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei

Add code

* ECCV 2022. Source code is available at https://github.com/ZhaofanQiu/SPE-Net 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Dual Vision Transformer


Jul 12, 2022
Ting Yao, Yehao Li, Yingwei Pan, Yu Wang, Xiao-Ping Zhang, Tao Mei

Add code

* Source code is available at \url{https://github.com/YehLi/ImageNetModel

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning


Jul 11, 2022
Ting Yao, Yingwei Pan, Yehao Li, Chong-Wah Ngo, Tao Mei

Add code

* ECCV 2022. Source code is available at \url{https://github.com/YehLi/ImageNetModel

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Comprehending and Ordering Semantics for Image Captioning


Jun 14, 2022
Yehao Li, Yingwei Pan, Ting Yao, Tao Mei

Add code

* CVPR 2022; Code is publicly available at: https://github.com/YehLi/xmodaler/tree/master/configs/image_caption/cosnet 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Silver-Bullet-3D at ManiSkill 2021: Learning-from-Demonstrations and Heuristic Rule-based Methods for Object Manipulation


Jun 13, 2022
Yingwei Pan, Yehao Li, Yiheng Zhang, Qi Cai, Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei

Add code

* Accepted by ICLR 2022 Workshop on Generalizable Policy Learning in Physical World. Top-performing systems for both no interaction and no restriction tracks in SAPIEN ManiSkill Challenge 2021. The source code and model are publicly available at: https://github.com/caiqi/Silver-Bullet-3D/ 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training


Jan 11, 2022
Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei

Add code

* ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising


Dec 14, 2021
Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

Add code

* ACM Multimedia 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics


Aug 18, 2021
Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, Tao Mei

Add code

* Accepted by 2021 ACMMM Open Source Software Competition. Source code: https://github.com/YehLi/xmodaler 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Contextual Transformer Networks for Visual Recognition


Jul 26, 2021
Yehao Li, Ting Yao, Yingwei Pan, Tao Mei

Add code

* Rank 1 in open-set image classification task of Open World Vision Challenge @ CVPR 2021; The source code and models are publicly available at: \url{https://github.com/JDAI-CV/CoTNet

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>