Alert button
Picture for Wenyi Hong

Wenyi Hong

Alert button

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Add code
Bookmark button
Alert button
Feb 06, 2024
Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang

Viaarxiv icon

CogAgent: A Visual Language Model for GUI Agents

Add code
Bookmark button
Alert button
Dec 21, 2023
Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

Viaarxiv icon

CogVLM: Visual Expert for Pretrained Language Models

Add code
Bookmark button
Alert button
Nov 06, 2023
Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

Figure 1 for CogVLM: Visual Expert for Pretrained Language Models
Figure 2 for CogVLM: Visual Expert for Pretrained Language Models
Figure 3 for CogVLM: Visual Expert for Pretrained Language Models
Figure 4 for CogVLM: Visual Expert for Pretrained Language Models
Viaarxiv icon

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Add code
Bookmark button
Alert button
Sep 04, 2023
Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang

Figure 1 for Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Figure 2 for Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Figure 3 for Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Figure 4 for Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Viaarxiv icon

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

Add code
Bookmark button
Alert button
May 29, 2022
Wenyi Hong, Ming Ding, Wendi Zheng, Xinghan Liu, Jie Tang

Figure 1 for CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Figure 2 for CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Figure 3 for CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Figure 4 for CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Viaarxiv icon

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

Add code
Bookmark button
Alert button
Apr 28, 2022
Ming Ding, Wendi Zheng, Wenyi Hong, Jie Tang

Figure 1 for CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Figure 2 for CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Figure 3 for CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Figure 4 for CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Viaarxiv icon

CogView: Mastering Text-to-Image Generation via Transformers

Add code
Bookmark button
Alert button
May 28, 2021
Ming Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang

Figure 1 for CogView: Mastering Text-to-Image Generation via Transformers
Figure 2 for CogView: Mastering Text-to-Image Generation via Transformers
Figure 3 for CogView: Mastering Text-to-Image Generation via Transformers
Figure 4 for CogView: Mastering Text-to-Image Generation via Transformers
Viaarxiv icon