Alert button
Picture for Zhuofan Zong

Zhuofan Zong

Alert button

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Bookmark button
Alert button
Apr 19, 2024
Zhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu

Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li

Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Bookmark button
Alert button
Mar 25, 2024
Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li

Viaarxiv icon

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

Add code
Bookmark button
Alert button
May 29, 2023
Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo

Figure 1 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 2 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 3 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Figure 4 for RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Viaarxiv icon

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Add code
Bookmark button
Alert button
Apr 03, 2023
Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu

Figure 1 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 2 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 3 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 4 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Viaarxiv icon

DETRs with Collaborative Hybrid Assignments Training

Add code
Bookmark button
Alert button
Nov 22, 2022
Zhuofan Zong, Guanglu Song, Yu Liu

Figure 1 for DETRs with Collaborative Hybrid Assignments Training
Figure 2 for DETRs with Collaborative Hybrid Assignments Training
Figure 3 for DETRs with Collaborative Hybrid Assignments Training
Figure 4 for DETRs with Collaborative Hybrid Assignments Training
Viaarxiv icon

Self-slimmed Vision Transformer

Add code
Bookmark button
Alert button
Nov 24, 2021
Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu

Figure 1 for Self-slimmed Vision Transformer
Figure 2 for Self-slimmed Vision Transformer
Figure 3 for Self-slimmed Vision Transformer
Figure 4 for Self-slimmed Vision Transformer
Viaarxiv icon

RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection

Add code
Bookmark button
Alert button
Oct 23, 2021
Zhuofan Zong, Qianggang Cao, Biao Leng

Figure 1 for RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
Figure 2 for RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
Figure 3 for RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
Figure 4 for RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
Viaarxiv icon