Alert button
Picture for Mengdan Zhang

Mengdan Zhang

Alert button

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

Aligning and Prompting Everything All at Once for Universal Visual Perception

Dec 04, 2023
Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji

Viaarxiv icon

Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection

Aug 30, 2023
Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu

Figure 1 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 2 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 3 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 4 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Viaarxiv icon

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

Jul 02, 2023
Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Zhenyu Qiu, Wei Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Rongrong Ji

Figure 1 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 2 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 3 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 4 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Viaarxiv icon

Multi-modal Queried Object Detection in the Wild

May 30, 2023
Yifan Xu, Mengdan Zhang, Chaoyou Fu, Peixian Chen, Xiaoshan Yang, Ke Li, Changsheng Xu

Figure 1 for Multi-modal Queried Object Detection in the Wild
Figure 2 for Multi-modal Queried Object Detection in the Wild
Figure 3 for Multi-modal Queried Object Detection in the Wild
Figure 4 for Multi-modal Queried Object Detection in the Wild
Viaarxiv icon

Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization

Jun 24, 2022
Peixian Chen, Kekai Sheng, Mengdan Zhang, Yunhang Shen, Ke Li, Chunhua Shen

Figure 1 for Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Figure 2 for Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Figure 3 for Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Figure 4 for Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Viaarxiv icon

Efficient Decoder-free Object Detection with Transformers

Jun 17, 2022
Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

Figure 1 for Efficient Decoder-free Object Detection with Transformers
Figure 2 for Efficient Decoder-free Object Detection with Transformers
Figure 3 for Efficient Decoder-free Object Detection with Transformers
Figure 4 for Efficient Decoder-free Object Detection with Transformers
Viaarxiv icon

ARM: Any-Time Super-Resolution Method

Mar 21, 2022
Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji

Figure 1 for ARM: Any-Time Super-Resolution Method
Figure 2 for ARM: Any-Time Super-Resolution Method
Figure 3 for ARM: Any-Time Super-Resolution Method
Figure 4 for ARM: Any-Time Super-Resolution Method
Viaarxiv icon