Alert button
Picture for Baining Guo

Baining Guo

Alert button

Simplified Diffusion Schrödinger Bridge

Mar 21, 2024
Zhicong Tang, Tiankai Hang, Shuyang Gu, Dong Chen, Baining Guo

Viaarxiv icon

VisualCritic: Making LMMs Perceive Visual Quality Like Humans

Mar 19, 2024
Zhipeng Huang, Zhizheng Zhang, Yiting Lu, Zheng-Jun Zha, Zhibo Chen, Baining Guo

Viaarxiv icon

RelationVLM: Making Large Vision-Language Models Understand Visual Relations

Mar 19, 2024
Zhipeng Huang, Zhizheng Zhang, Zheng-Jun Zha, Yan Lu, Baining Guo

Viaarxiv icon

CCA: Collaborative Competitive Agents for Image Editing

Jan 23, 2024
Tiankai Hang, Shuyang Gu, Dong Chen, Xin Geng, Baining Guo

Viaarxiv icon

VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder

Dec 18, 2023
Zhicong Tang, Shuyang Gu, Chunyu Wang, Ting Zhang, Jianmin Bao, Dong Chen, Baining Guo

Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Nov 30, 2023
Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo

Viaarxiv icon

COLE: A Hierarchical Generation Framework for Graphic Design

Nov 28, 2023
Peidong Jia, Chenxuan Li, Zeyu Liu, Yichao Shen, Xingru Chen, Yuhui Yuan, Yinglin Zheng, Dong Chen, Ji Li, Xiaodong Xie, Shanghang Zhang, Baining Guo

Viaarxiv icon

CCEdit: Creative and Controllable Video Editing via Diffusion Models

Sep 28, 2023
Ruoyu Feng, Wenming Weng, Yanhui Wang, Yuhui Yuan, Jianmin Bao, Chong Luo, Zhibo Chen, Baining Guo

Figure 1 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 2 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 3 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Figure 4 for CCEdit: Creative and Controllable Video Editing via Diffusion Models
Viaarxiv icon

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

Sep 07, 2023
Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo

Figure 1 for InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Figure 2 for InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Figure 3 for InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Figure 4 for InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Viaarxiv icon

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Aug 08, 2023
Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo

Figure 1 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 2 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 3 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 4 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Viaarxiv icon