Alert button
Picture for Xiaowei Hu

Xiaowei Hu

Alert button

Scaling Up Vision-Language Pre-training for Image Captioning

Add code
Bookmark button
Alert button
Nov 24, 2021
Xiaowei Hu, Zhe Gan, Jianfeng Wang, Zhengyuan Yang, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 2 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 3 for Scaling Up Vision-Language Pre-training for Image Captioning
Figure 4 for Scaling Up Vision-Language Pre-training for Image Captioning
Viaarxiv icon

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

Add code
Bookmark button
Alert button
Nov 23, 2021
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 2 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 3 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Figure 4 for Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling
Viaarxiv icon

UFO: A UniFied TransfOrmer for Vision-Language Representation Learning

Add code
Bookmark button
Alert button
Nov 19, 2021
Jianfeng Wang, Xiaowei Hu, Zhe Gan, Zhengyuan Yang, Xiyang Dai, Zicheng Liu, Yumao Lu, Lijuan Wang

Figure 1 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 2 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 3 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Figure 4 for UFO: A UniFied TransfOrmer for Vision-Language Representation Learning
Viaarxiv icon

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

Add code
Bookmark button
Alert button
Sep 10, 2021
Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Yumao Lu, Zicheng Liu, Lijuan Wang

Figure 1 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 2 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 3 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Figure 4 for An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Viaarxiv icon

Compressing Visual-linguistic Model via Knowledge Distillation

Add code
Bookmark button
Alert button
Apr 05, 2021
Zhiyuan Fang, Jianfeng Wang, Xiaowei Hu, Lijuan Wang, Yezhou Yang, Zicheng Liu

Figure 1 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 2 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 3 for Compressing Visual-linguistic Model via Knowledge Distillation
Figure 4 for Compressing Visual-linguistic Model via Knowledge Distillation
Viaarxiv icon

Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images

Add code
Bookmark button
Alert button
Apr 05, 2021
Cheng Xue, Lei Zhu, Huazhu Fu, Xiaowei Hu, Xiaomeng Li, Hai Zhang, Pheng Ann Heng

Figure 1 for Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Figure 2 for Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Figure 3 for Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Figure 4 for Global Guidance Network for Breast Lesion Segmentation in Ultrasound Images
Viaarxiv icon

Deep Texture-Aware Features for Camouflaged Object Detection

Add code
Bookmark button
Alert button
Feb 05, 2021
Jingjing Ren, Xiaowei Hu, Lei Zhu, Xuemiao Xu, Yangyang Xu, Weiming Wang, Zijun Deng, Pheng-Ann Heng

Figure 1 for Deep Texture-Aware Features for Camouflaged Object Detection
Figure 2 for Deep Texture-Aware Features for Camouflaged Object Detection
Figure 3 for Deep Texture-Aware Features for Camouflaged Object Detection
Figure 4 for Deep Texture-Aware Features for Camouflaged Object Detection
Viaarxiv icon

Incorporating Vision Bias into Click Models for Image-oriented Search Engine

Add code
Bookmark button
Alert button
Jan 07, 2021
Ningxin Xu, Cheng Yang, Yixin Zhu, Xiaowei Hu, Changhu Wang

Figure 1 for Incorporating Vision Bias into Click Models for Image-oriented Search Engine
Figure 2 for Incorporating Vision Bias into Click Models for Image-oriented Search Engine
Figure 3 for Incorporating Vision Bias into Click Models for Image-oriented Search Engine
Figure 4 for Incorporating Vision Bias into Click Models for Image-oriented Search Engine
Viaarxiv icon

VinVL: Making Visual Representations Matter in Vision-Language Models

Add code
Bookmark button
Alert button
Jan 02, 2021
Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao

Figure 1 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 2 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 3 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 4 for VinVL: Making Visual Representations Matter in Vision-Language Models
Viaarxiv icon

MiniVLM: A Smaller and Faster Vision-Language Model

Add code
Bookmark button
Alert button
Dec 13, 2020
Jianfeng Wang, Xiaowei Hu, Pengchuan Zhang, Xiujun Li, Lijuan Wang, Lei Zhang, Jianfeng Gao, Zicheng Liu

Figure 1 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 2 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 3 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 4 for MiniVLM: A Smaller and Faster Vision-Language Model
Viaarxiv icon