Alert button
Picture for Pan Zhang

Pan Zhang

Alert button

Concealed Object Segmentation with Hierarchical Coherence Modeling

Add code
Bookmark button
Alert button
Jan 22, 2024
Fengyang Xiao, Pan Zhang, Chunming He, Runze Hu, Yutao Liu

Viaarxiv icon

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Add code
Bookmark button
Alert button
Dec 13, 2023
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Viaarxiv icon

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

Add code
Bookmark button
Alert button
Dec 07, 2023
Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xinggang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu

Viaarxiv icon

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Add code
Bookmark button
Alert button
Nov 29, 2023
Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu

Viaarxiv icon

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Add code
Bookmark button
Alert button
Nov 28, 2023
Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin

Viaarxiv icon

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

Add code
Bookmark button
Alert button
Sep 29, 2023
Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

Figure 1 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 2 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 3 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 4 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Viaarxiv icon

VIGC: Visual Instruction Generation and Correction

Add code
Bookmark button
Alert button
Sep 11, 2023
Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He

Figure 1 for VIGC: Visual Instruction Generation and Correction
Figure 2 for VIGC: Visual Instruction Generation and Correction
Figure 3 for VIGC: Visual Instruction Generation and Correction
Figure 4 for VIGC: Visual Instruction Generation and Correction
Viaarxiv icon

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Add code
Bookmark button
Alert button
Sep 11, 2023
Zhiyuan Zhao, Linke Ouyang, Bin Wang, Siyuan Huang, Pan Zhang, Xiaoyi Dong, Jiaqi Wang, Conghui He

Figure 1 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 2 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 3 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 4 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Viaarxiv icon