Alert button
Picture for Zhengyuan Yang

Zhengyuan Yang

Alert button

MM-VID: Advancing Video Understanding with GPT-4V(ision)

Add code
Bookmark button
Alert button
Oct 30, 2023
Kevin Lin, Faisal Ahmed, Linjie Li, Chung-Ching Lin, Ehsan Azarnasab, Zhengyuan Yang, Jianfeng Wang, Lin Liang, Zicheng Liu, Yumao Lu, Ce Liu, Lijuan Wang

Figure 1 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 2 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 3 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Figure 4 for MM-VID: Advancing Video Understanding with GPT-4V(ision)
Viaarxiv icon

DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design

Add code
Bookmark button
Alert button
Oct 23, 2023
Kevin Lin, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Lijuan Wang

Viaarxiv icon

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Add code
Bookmark button
Alert button
Oct 12, 2023
Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang

Figure 1 for Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Figure 2 for Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Figure 3 for Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Figure 4 for Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Viaarxiv icon

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation

Add code
Bookmark button
Alert button
Oct 11, 2023
Jie An, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Lijuan Wang, Jiebo Luo

Figure 1 for OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Figure 2 for OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Figure 3 for OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Figure 4 for OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Viaarxiv icon

The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

Add code
Bookmark button
Alert button
Oct 11, 2023
Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang

Figure 1 for The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Figure 2 for The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Figure 3 for The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Figure 4 for The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Viaarxiv icon

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Add code
Bookmark button
Alert button
Sep 18, 2023
Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao

Figure 1 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 2 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 3 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 4 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Viaarxiv icon

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities

Add code
Bookmark button
Alert button
Aug 04, 2023
Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, Lijuan Wang

Figure 1 for MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Viaarxiv icon

Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models

Add code
Bookmark button
Alert button
Jul 27, 2023
Xin Yuan, Linjie Li, Jianfeng Wang, Zhengyuan Yang, Kevin Lin, Zicheng Liu, Lijuan Wang

Figure 1 for Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Figure 2 for Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Figure 3 for Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Figure 4 for Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models
Viaarxiv icon

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Add code
Bookmark button
Alert button
Jun 30, 2023
Tan Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang

Figure 1 for DisCo: Disentangled Control for Referring Human Dance Generation in Real World
Figure 2 for DisCo: Disentangled Control for Referring Human Dance Generation in Real World
Figure 3 for DisCo: Disentangled Control for Referring Human Dance Generation in Real World
Figure 4 for DisCo: Disentangled Control for Referring Human Dance Generation in Real World
Viaarxiv icon