Alert button
Picture for Zhiliang Peng

Zhiliang Peng

Alert button

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Oct 04, 2023
Xichen Pan, Li Dong, Shaohan Huang, Zhiliang Peng, Wenhu Chen, Furu Wei

Figure 1 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 2 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 3 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Figure 4 for Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Viaarxiv icon

Kosmos-2: Grounding Multimodal Large Language Models to the World

Add code
Bookmark button
Alert button
Jul 13, 2023
Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei

Figure 1 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 2 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 3 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Figure 4 for Kosmos-2: Grounding Multimodal Large Language Models to the World
Viaarxiv icon

Generic-to-Specific Distillation of Masked Autoencoders

Add code
Bookmark button
Alert button
Feb 28, 2023
Wei Huang, Zhiliang Peng, Li Dong, Furu Wei, Jianbin Jiao, Qixiang Ye

Figure 1 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 2 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 3 for Generic-to-Specific Distillation of Masked Autoencoders
Figure 4 for Generic-to-Specific Distillation of Masked Autoencoders
Viaarxiv icon

A Unified View of Masked Image Modeling

Add code
Bookmark button
Alert button
Oct 19, 2022
Zhiliang Peng, Li Dong, Hangbo Bao, Qixiang Ye, Furu Wei

Figure 1 for A Unified View of Masked Image Modeling
Figure 2 for A Unified View of Masked Image Modeling
Figure 3 for A Unified View of Masked Image Modeling
Figure 4 for A Unified View of Masked Image Modeling
Viaarxiv icon

Foundation Transformers

Add code
Bookmark button
Alert button
Oct 19, 2022
Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei

Figure 1 for Foundation Transformers
Figure 2 for Foundation Transformers
Figure 3 for Foundation Transformers
Figure 4 for Foundation Transformers
Viaarxiv icon

Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

Add code
Bookmark button
Alert button
Aug 31, 2022
Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei

Figure 1 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 2 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 3 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Figure 4 for Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
Viaarxiv icon

BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers

Add code
Bookmark button
Alert button
Aug 12, 2022
Zhiliang Peng, Li Dong, Hangbo Bao, Qixiang Ye, Furu Wei

Figure 1 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 2 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 3 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Figure 4 for BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Viaarxiv icon