Alert button
Picture for Weichong Yin

Weichong Yin

Alert button

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation

Add code
Bookmark button
Alert button
Nov 09, 2022
Bin Shan, Yaqian Han, Weichong Yin, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 2 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 3 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Figure 4 for ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation
Viaarxiv icon

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts

Add code
Bookmark button
Alert button
Oct 27, 2022
Zhida Feng, Zhenyu Zhang, Xintong Yu, Yewei Fang, Lanxin Li, Xuyi Chen, Yuxiang Lu, Jiaxiang Liu, Weichong Yin, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 2 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 3 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Figure 4 for ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts
Viaarxiv icon

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

Add code
Bookmark button
Alert button
Oct 14, 2022
Qiming Peng, Yinxu Pan, Wenjin Wang, Bin Luo, Zhenyu Zhang, Zhengjie Huang, Teng Hu, Weichong Yin, Yongfeng Chen, Yin Zhang, Shikun Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 2 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 3 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Figure 4 for ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
Viaarxiv icon

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

Add code
Bookmark button
Alert button
Sep 30, 2022
Bin Shan, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 2 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 3 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Figure 4 for ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Viaarxiv icon

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Add code
Bookmark button
Alert button
Sep 18, 2022
Wenjin Wang, Zhengjie Huang, Bin Luo, Qianglong Chen, Qiming Peng, Yinxu Pan, Weichong Yin, Shikun Feng, Yu Sun, Dianhai Yu, Yin Zhang

Figure 1 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 2 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 3 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 4 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Add code
Bookmark button
Alert button
Dec 31, 2021
Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph

Add code
Bookmark button
Alert button
Jun 30, 2020
Fei Yu, Jiji Tang, Weichong Yin, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 2 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 3 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Figure 4 for ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph
Viaarxiv icon