Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer



Jiajun Deng , Zhengyuan Yang , Daqing Liu , Tianlang Chen , Wengang Zhou , Yanyong Zhang , Houqiang Li , Wanli Ouyang

* arXiv admin note: text overlap with arXiv:2104.08541 

   Access Paper or Ask Questions

GIT: A Generative Image-to-text Transformer for Vision and Language



Jianfeng Wang , Zhengyuan Yang , Xiaowei Hu , Linjie Li , Kevin Lin , Zhe Gan , Zicheng Liu , Ce Liu , Lijuan Wang


   Access Paper or Ask Questions

Cross-modal Contrastive Distillation for Instructional Activity Anticipation



Zhengyuan Yang , Jingen Liu , Jing Huang , Xiaodong He , Tao Mei , Chenliang Xu , Jiebo Luo


   Access Paper or Ask Questions

Scaling Up Vision-Language Pre-training for Image Captioning



Xiaowei Hu , Zhe Gan , Jianfeng Wang , Zhengyuan Yang , Zicheng Liu , Yumao Lu , Lijuan Wang


   Access Paper or Ask Questions

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling



Zhengyuan Yang , Zhe Gan , Jianfeng Wang , Xiaowei Hu , Faisal Ahmed , Zicheng Liu , Yumao Lu , Lijuan Wang


   Access Paper or Ask Questions

UFO: A UniFied TransfOrmer for Vision-Language Representation Learning



Jianfeng Wang , Xiaowei Hu , Zhe Gan , Zhengyuan Yang , Xiyang Dai , Zicheng Liu , Yumao Lu , Lijuan Wang


   Access Paper or Ask Questions

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA



Zhengyuan Yang , Zhe Gan , Jianfeng Wang , Xiaowei Hu , Yumao Lu , Zicheng Liu , Lijuan Wang


   Access Paper or Ask Questions

SAT: 2D Semantics Assisted Training for 3D Visual Grounding



Zhengyuan Yang , Songyang Zhang , Liwei Wang , Jiebo Luo


   Access Paper or Ask Questions

TransVG: End-to-End Visual Grounding with Transformers



Jiajun Deng , Zhengyuan Yang , Tianlang Chen , Wengang Zhou , Houqiang Li


   Access Paper or Ask Questions

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption



Zhengyuan Yang , Yijuan Lu , Jianfeng Wang , Xi Yin , Dinei Florencio , Lijuan Wang , Cha Zhang , Lei Zhang , Jiebo Luo


   Access Paper or Ask Questions

1
2
3
>>