Alert button
Picture for Zhe Gan

Zhe Gan

Alert button

MOFI: Learning Image Representations from Noisy Entity Annotated Images

Jun 24, 2023
Wentao Wu, Aleksei Timofeev, Chen Chen, Bowen Zhang, Kun Duan, Shuangning Liu, Yantao Zheng, Jon Shlens, Xianzhi Du, Zhe Gan, Yinfei Yang

Figure 1 for MOFI: Learning Image Representations from Noisy Entity Annotated Images
Figure 2 for MOFI: Learning Image Representations from Noisy Entity Annotated Images
Figure 3 for MOFI: Learning Image Representations from Noisy Entity Annotated Images
Figure 4 for MOFI: Learning Image Representations from Noisy Entity Annotated Images
Viaarxiv icon

An Empirical Study of Multimodal Model Merging

Apr 28, 2023
Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang

Figure 1 for An Empirical Study of Multimodal Model Merging
Figure 2 for An Empirical Study of Multimodal Model Merging
Figure 3 for An Empirical Study of Multimodal Model Merging
Figure 4 for An Empirical Study of Multimodal Model Merging
Viaarxiv icon

Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation

Apr 14, 2023
Jaemin Cho, Linjie Li, Zhengyuan Yang, Zhe Gan, Lijuan Wang, Mohit Bansal

Figure 1 for Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Figure 2 for Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Figure 3 for Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Figure 4 for Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Viaarxiv icon

Generalized Decoding for Pixel, Image, and Language

Dec 21, 2022
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao

Figure 1 for Generalized Decoding for Pixel, Image, and Language
Figure 2 for Generalized Decoding for Pixel, Image, and Language
Figure 3 for Generalized Decoding for Pixel, Image, and Language
Figure 4 for Generalized Decoding for Pixel, Image, and Language
Viaarxiv icon

Exploring Discrete Diffusion Models for Image Captioning

Dec 09, 2022
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan, Zheng Zhang, Le Wang, Gang Hua, Lijuan Wang, Zicheng Liu, Han Hu

Figure 1 for Exploring Discrete Diffusion Models for Image Captioning
Figure 2 for Exploring Discrete Diffusion Models for Image Captioning
Figure 3 for Exploring Discrete Diffusion Models for Image Captioning
Figure 4 for Exploring Discrete Diffusion Models for Image Captioning
Viaarxiv icon

GRiT: A Generative Region-to-text Transformer for Object Understanding

Dec 01, 2022
Jialian Wu, Jianfeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang

Figure 1 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 2 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 3 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Figure 4 for GRiT: A Generative Region-to-text Transformer for Object Understanding
Viaarxiv icon

ReCo: Region-Controlled Text-to-Image Generation

Nov 23, 2022
Zhengyuan Yang, Jianfeng Wang, Zhe Gan, Linjie Li, Kevin Lin, Chenfei Wu, Nan Duan, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang

Figure 1 for ReCo: Region-Controlled Text-to-Image Generation
Figure 2 for ReCo: Region-Controlled Text-to-Image Generation
Figure 3 for ReCo: Region-Controlled Text-to-Image Generation
Figure 4 for ReCo: Region-Controlled Text-to-Image Generation
Viaarxiv icon

Non-Contrastive Learning Meets Language-Image Pre-Training

Oct 17, 2022
Jinghao Zhou, Li Dong, Zhe Gan, Lijuan Wang, Furu Wei

Figure 1 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 2 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 3 for Non-Contrastive Learning Meets Language-Image Pre-Training
Figure 4 for Non-Contrastive Learning Meets Language-Image Pre-Training
Viaarxiv icon