Alert button
Picture for Bin Bi

Bin Bi

Alert button

BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization

Jul 17, 2023
Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Songfang Huang

Figure 1 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 2 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 3 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Figure 4 for BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization
Viaarxiv icon

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video

Feb 01, 2023
Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou

Figure 1 for mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Figure 2 for mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Figure 3 for mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Figure 4 for mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Viaarxiv icon

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections

May 25, 2022
Chenliang Li, Haiyang Xu, Junfeng Tian, Wei Wang, Ming Yan, Bin Bi, Jiabo Ye, Hehong Chen, Guohai Xu, Zheng Cao, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou, Luo Si

Figure 1 for mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Figure 2 for mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Figure 3 for mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Figure 4 for mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
Viaarxiv icon

Achieving Human Parity on Visual Question Answering

Nov 19, 2021
Ming Yan, Haiyang Xu, Chenliang Li, Junfeng Tian, Bin Bi, Wei Wang, Weihua Chen, Xianzhe Xu, Fan Wang, Zheng Cao, Zhicheng Zhang, Qiyu Zhang, Ji Zhang, Songfang Huang, Fei Huang, Luo Si, Rong Jin

Figure 1 for Achieving Human Parity on Visual Question Answering
Figure 2 for Achieving Human Parity on Visual Question Answering
Figure 3 for Achieving Human Parity on Visual Question Answering
Figure 4 for Achieving Human Parity on Visual Question Answering
Viaarxiv icon

Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training

Aug 21, 2021
Ming Yan, Haiyang Xu, Chenliang Li, Bin Bi, Junfeng Tian, Min Gui, Wei Wang

Figure 1 for Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training
Figure 2 for Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training
Figure 3 for Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training
Figure 4 for Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training
Viaarxiv icon

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Jun 04, 2021
Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao, Fei Huang

Figure 1 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 2 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 3 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Figure 4 for E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning
Viaarxiv icon

StructuralLM: Structural Pre-training for Form Understanding

May 24, 2021
Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si

Figure 1 for StructuralLM: Structural Pre-training for Form Understanding
Figure 2 for StructuralLM: Structural Pre-training for Form Understanding
Figure 3 for StructuralLM: Structural Pre-training for Form Understanding
Figure 4 for StructuralLM: Structural Pre-training for Form Understanding
Viaarxiv icon