Alert button
Picture for Dongzhi Jiang

Dongzhi Jiang

Alert button

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Bookmark button
Alert button
Apr 19, 2024
Zhuofan Zong, Bingqi Ma, Dazhong Shen, Guanglu Song, Hao Shao, Dongzhi Jiang, Hongsheng Li, Yu Liu

Viaarxiv icon

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Add code
Bookmark button
Alert button
Apr 04, 2024
Dongzhi Jiang, Guanglu Song, Xiaoshi Wu, Renrui Zhang, Dazhong Shen, Zhuofan Zong, Yu Liu, Hongsheng Li

Viaarxiv icon

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Add code
Bookmark button
Alert button
Mar 21, 2024
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

Figure 1 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 2 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 3 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 4 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Viaarxiv icon

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Add code
Bookmark button
Alert button
Apr 03, 2023
Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, Yu Liu

Figure 1 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 2 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 3 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Figure 4 for Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Viaarxiv icon