Alert button
Picture for Lu Yuan

Lu Yuan

Alert button

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training

Add code
Bookmark button
Alert button
Jul 26, 2022
Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan

Figure 1 for Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
Figure 2 for Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
Figure 3 for Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
Figure 4 for Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
Viaarxiv icon

TinyViT: Fast Pretraining Distillation for Small Vision Transformers

Add code
Bookmark button
Alert button
Jul 21, 2022
Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan

Figure 1 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 2 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 3 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 4 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Viaarxiv icon

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

Add code
Bookmark button
Alert button
Jul 14, 2022
Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu

Figure 1 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 2 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 3 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Figure 4 for Bootstrapped Masked Autoencoders for Vision BERT Pretraining
Viaarxiv icon

Should All Proposals be Treated Equally in Object Detection?

Add code
Bookmark button
Alert button
Jul 07, 2022
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos

Figure 1 for Should All Proposals be Treated Equally in Object Detection?
Figure 2 for Should All Proposals be Treated Equally in Object Detection?
Figure 3 for Should All Proposals be Treated Equally in Object Detection?
Figure 4 for Should All Proposals be Treated Equally in Object Detection?
Viaarxiv icon

Semantic Image Synthesis via Diffusion Models

Add code
Bookmark button
Alert button
Jun 30, 2022
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, Houqiang Li

Figure 1 for Semantic Image Synthesis via Diffusion Models
Figure 2 for Semantic Image Synthesis via Diffusion Models
Figure 3 for Semantic Image Synthesis via Diffusion Models
Figure 4 for Semantic Image Synthesis via Diffusion Models
Viaarxiv icon

GLIPv2: Unifying Localization and Vision-Language Understanding

Add code
Bookmark button
Alert button
Jun 12, 2022
Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao

Figure 1 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 2 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 3 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 4 for GLIPv2: Unifying Localization and Vision-Language Understanding
Viaarxiv icon

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

Add code
Bookmark button
Alert button
Jun 07, 2022
Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 2 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 3 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 4 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Viaarxiv icon

Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning

Add code
Bookmark button
Alert button
Jun 03, 2022
Yujia Xie, Luowei Zhou, Xiyang Dai, Lu Yuan, Nguyen Bach, Ce Liu, Michael Zeng

Figure 1 for Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Figure 2 for Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Figure 3 for Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Figure 4 for Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Viaarxiv icon

REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering

Add code
Bookmark button
Alert button
Jun 02, 2022
Yuanze Lin, Yujia Xie, Dongdong Chen, Yichong Xu, Chenguang Zhu, Lu Yuan

Figure 1 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 2 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 3 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Figure 4 for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Viaarxiv icon

Reduce Information Loss in Transformers for Pluralistic Image Inpainting

Add code
Bookmark button
Alert button
May 15, 2022
Qiankun Liu, Zhentao Tan, Dongdong Chen, Qi Chu, Xiyang Dai, Yinpeng Chen, Mengchen Liu, Lu Yuan, Nenghai Yu

Figure 1 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 2 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 3 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Figure 4 for Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Viaarxiv icon