Alert button
Picture for Jianfeng Wang

Jianfeng Wang

Alert button

Violet

An Empirical Study of Training End-to-End Vision-and-Language Transformers

Add code
Bookmark button
Alert button
Nov 25, 2021
Zi-Yi Dou, Yichong Xu, Zhe Gan, Jianfeng Wang, Shuohang Wang, Lijuan Wang, Chenguang Zhu, Pengchuan Zhang, Lu Yuan, Nanyun Peng, Zicheng Liu, Michael Zeng

Figure 1 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 2 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 3 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Figure 4 for An Empirical Study of Training End-to-End Vision-and-Language Transformers
Viaarxiv icon

Scaling Up Vision-Language Pre-training for Image Captioning

Add code
Bookmark button
Alert button
Nov 24, 2021
Xiaowei Hu, Zhe Gan, Jianfeng Wang, Zhengyuan Yang, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 2 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 3 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 4 for Scaling Up Vision-Language Pre-training for Image Captioning
Viaarxiv icon

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

Add code
Bookmark button
Alert button
Nov 23, 2021
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 2 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 3 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 4 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Viaarxiv icon

Florence: A New Foundation Model for Computer Vision

Add code
Bookmark button
Alert button
Nov 22, 2021
Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, Jianfeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan Zhang

Figure 1 for Florence: A New Foundation Model for Computer Vision
Figure 2 for Florence: A New Foundation Model for Computer Vision
Figure 3 for Florence: A New Foundation Model for Computer Vision
Figure 4 for Florence: A New Foundation Model for Computer Vision
Viaarxiv icon

UFO: A UniFied TransfOrmer for Vision-Language Representation Learning

Add code
Bookmark button
Alert button
Nov 19, 2021
Jianfeng Wang, Xiaowei Hu, Zhe Gan, Zhengyuan Yang, Xiyang Dai, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 2 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 3 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 4 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Viaarxiv icon

Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images

Add code
Bookmark button
Alert button
Sep 18, 2021
Yuedong Chen, Junjia Huang, Jianfeng Wang, Xiaohua Xie

Figure 1 for Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images
Figure 2 for Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images
Figure 3 for Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images
Figure 4 for Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images
Viaarxiv icon

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

Add code
Bookmark button
Alert button
Sep 10, 2021
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Yumao Lu, Zicheng Liu, Lijuan Wang

Figure 1 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 2 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 3 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 4 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Viaarxiv icon

Is Object Detection Necessary for Human-Object Interaction Recognition?

Add code
Bookmark button
Alert button
Jul 27, 2021
Ying Jin, Yinpeng Chen, Lijuan Wang, Jianfeng Wang, Pei Yu, Zicheng Liu, Jenq-Neng Hwang

Figure 1 for Is Object Detection Necessary for Human-Object Interaction Recognition?
Figure 2 for Is Object Detection Necessary for Human-Object Interaction Recognition?
Figure 3 for Is Object Detection Necessary for Human-Object Interaction Recognition?
Figure 4 for Is Object Detection Necessary for Human-Object Interaction Recognition?
Viaarxiv icon

RSG: A Simple but Effective Module for Learning Imbalanced Datasets

Add code
Bookmark button
Alert button
Jun 18, 2021
Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, Zhenghua Xu

Figure 1 for RSG: A Simple but Effective Module for Learning Imbalanced Datasets
Figure 2 for RSG: A Simple but Effective Module for Learning Imbalanced Datasets
Figure 3 for RSG: A Simple but Effective Module for Learning Imbalanced Datasets
Figure 4 for RSG: A Simple but Effective Module for Learning Imbalanced Datasets
Viaarxiv icon