Alert button
Picture for Xiujun Li

Xiujun Li

Alert button

VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following

Add code
Bookmark button
Alert button
Nov 29, 2023
Yujie Lu, Xiujun Li, William Yang Wang, Yejin Choi

Viaarxiv icon

LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation

Add code
Bookmark button
Alert button
May 18, 2023
Yujie Lu, Xianjun Yang, Xiujun Li, Xin Eric Wang, William Yang Wang

Figure 1 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 2 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 3 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Figure 4 for LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Viaarxiv icon

Self-supervised Pre-training with Hard Examples Improves Visual Representations

Add code
Bookmark button
Alert button
Jan 04, 2021
Chunyuan Li, Xiujun Li, Lei Zhang, Baolin Peng, Mingyuan Zhou, Jianfeng Gao

Figure 1 for Self-supervised Pre-training with Hard Examples Improves Visual Representations
Figure 2 for Self-supervised Pre-training with Hard Examples Improves Visual Representations
Figure 3 for Self-supervised Pre-training with Hard Examples Improves Visual Representations
Figure 4 for Self-supervised Pre-training with Hard Examples Improves Visual Representations
Viaarxiv icon

VinVL: Making Visual Representations Matter in Vision-Language Models

Add code
Bookmark button
Alert button
Jan 02, 2021
Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao

Figure 1 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 2 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 3 for VinVL: Making Visual Representations Matter in Vision-Language Models
Figure 4 for VinVL: Making Visual Representations Matter in Vision-Language Models
Viaarxiv icon

MiniVLM: A Smaller and Faster Vision-Language Model

Add code
Bookmark button
Alert button
Dec 13, 2020
Jianfeng Wang, Xiaowei Hu, Pengchuan Zhang, Xiujun Li, Lijuan Wang, Lei Zhang, Jianfeng Gao, Zicheng Liu

Figure 1 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 2 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 3 for MiniVLM: A Smaller and Faster Vision-Language Model
Figure 4 for MiniVLM: A Smaller and Faster Vision-Language Model
Viaarxiv icon

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

Add code
Bookmark button
Alert button
May 18, 2020
Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang, Lijuan Wang, Houdong Hu, Li Dong, Furu Wei, Yejin Choi, Jianfeng Gao

Figure 1 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 2 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 3 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Figure 4 for Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Viaarxiv icon

Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space

Add code
Bookmark button
Alert button
Apr 05, 2020
Chunyuan Li, Xiang Gao, Yuan Li, Xiujun Li, Baolin Peng, Yizhe Zhang, Jianfeng Gao

Figure 1 for Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Figure 2 for Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Figure 3 for Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Figure 4 for Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Viaarxiv icon