Alert button
Picture for Pan Zhang

Pan Zhang

Alert button

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Add code
Bookmark button
Alert button
Aug 25, 2023
Zhiyuan Zhao, Linke Ouyang, Bin Wang, Siyuan Huang, Pan Zhang, Xiaoyi Dong, Jiaqi Wang, Conghui He

Figure 1 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 2 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 3 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Figure 4 for MLLM-DataEngine: An Iterative Refinement Approach for MLLM
Viaarxiv icon

VIGC: Visual Instruction Generation and Correction

Add code
Bookmark button
Alert button
Aug 24, 2023
Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He

Figure 1 for VIGC: Visual Instruction Generation and Correction
Figure 2 for VIGC: Visual Instruction Generation and Correction
Figure 3 for VIGC: Visual Instruction Generation and Correction
Figure 4 for VIGC: Visual Instruction Generation and Correction
Viaarxiv icon

HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation

Add code
Bookmark button
Alert button
Aug 10, 2023
Chaoran Lu, Ningning Cao, Pan Zhang, Ting Liu, Baochai Peng, Guozhang Liu, Mengke Yuan, Sen Zhang, Simin Huang, Tao Wang

Figure 1 for HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation
Figure 2 for HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation
Figure 3 for HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation
Figure 4 for HGDNet: A Height-Hierarchy Guided Dual-Decoder Network for Single View Building Extraction and Height Estimation
Viaarxiv icon

Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone

Add code
Bookmark button
Alert button
Aug 10, 2023
Guozhang Liu, Baochai Peng, Ting Liu, Pan Zhang, Mengke Yuan, Chaoran Lu, Ningning Cao, Sen Zhang, Simin Huang, Tao Wang

Figure 1 for Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone
Figure 2 for Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone
Figure 3 for Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone
Figure 4 for Fine-grained building roof instance segmentation based on domain adapted pretraining and composite dual-backbone
Viaarxiv icon

FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing

Add code
Bookmark button
Alert button
Jul 29, 2023
Pengyang Ling, Lin Chen, Pan Zhang, Huaian Chen, Yi Jin

Figure 1 for FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing
Figure 2 for FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing
Figure 3 for FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing
Figure 4 for FreeDrag: Point Tracking is Not What You Need for Interactive Point-based Image Editing
Viaarxiv icon

qecGPT: decoding Quantum Error-correcting Codes with Generative Pre-trained Transformers

Add code
Bookmark button
Alert button
Jul 18, 2023
Hanyan Cao, Feng Pan, Yijia Wang, Pan Zhang

Figure 1 for qecGPT: decoding Quantum Error-correcting Codes with Generative Pre-trained Transformers
Figure 2 for qecGPT: decoding Quantum Error-correcting Codes with Generative Pre-trained Transformers
Figure 3 for qecGPT: decoding Quantum Error-correcting Codes with Generative Pre-trained Transformers
Figure 4 for qecGPT: decoding Quantum Error-correcting Codes with Generative Pre-trained Transformers
Viaarxiv icon

FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing

Add code
Bookmark button
Alert button
Jul 10, 2023
Pengyang Ling, Lin Chen, Pan Zhang, Huaian Chen, Yi Jin

Figure 1 for FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing
Figure 2 for FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing
Figure 3 for FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing
Figure 4 for FreeDrag: Point Tracking is Not You Need for Interactive Point-based Image Editing
Viaarxiv icon

BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image

Add code
Bookmark button
Alert button
Jun 01, 2023
Tao Chu, Pan Zhang, Qiong Liu, Jiaqi Wang

Figure 1 for BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Figure 2 for BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Figure 3 for BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Figure 4 for BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
Viaarxiv icon

V3Det: Vast Vocabulary Visual Detection Dataset

Add code
Bookmark button
Alert button
Apr 07, 2023
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin

Figure 1 for V3Det: Vast Vocabulary Visual Detection Dataset
Figure 2 for V3Det: Vast Vocabulary Visual Detection Dataset
Figure 3 for V3Det: Vast Vocabulary Visual Detection Dataset
Figure 4 for V3Det: Vast Vocabulary Visual Detection Dataset
Viaarxiv icon

MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation

Add code
Bookmark button
Alert button
Dec 17, 2022
Bowen Zhang, Chenyang Qi, Pan Zhang, Bo Zhang, HsiangTao Wu, Dong Chen, Qifeng Chen, Yong Wang, Fang Wen

Figure 1 for MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Figure 2 for MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Figure 3 for MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Figure 4 for MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Viaarxiv icon