Alert button
Picture for Xing Sun

Xing Sun

Alert button

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

Add code
Bookmark button
Alert button
Jul 02, 2023
Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Zhenyu Qiu, Wei Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Rongrong Ji

Figure 1 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 2 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 3 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Figure 4 for MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
Viaarxiv icon

A Survey on Multimodal Large Language Models

Add code
Bookmark button
Alert button
Jun 23, 2023
Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen

Figure 1 for A Survey on Multimodal Large Language Models
Figure 2 for A Survey on Multimodal Large Language Models
Figure 3 for A Survey on Multimodal Large Language Models
Figure 4 for A Survey on Multimodal Large Language Models
Viaarxiv icon

Looking and Listening: Audio Guided Text Recognition

Add code
Bookmark button
Alert button
Jun 06, 2023
Wenwen Yu, Mingyu Liu, Biao Yang, Enming Zhang, Deqiang Jiang, Xing Sun, Yuliang Liu, Xiang Bai

Figure 1 for Looking and Listening: Audio Guided Text Recognition
Figure 2 for Looking and Listening: Audio Guided Text Recognition
Figure 3 for Looking and Listening: Audio Guided Text Recognition
Figure 4 for Looking and Listening: Audio Guided Text Recognition
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Bookmark button
Alert button
Jun 05, 2023
Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger

Add code
Bookmark button
Alert button
Mar 30, 2023
Yuting Gao, Jinfeng Liu, Zihan Xu, Tong Wu, Wei Liu, Jie Yang, Ke Li, Xing Sun

Figure 1 for SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger
Figure 2 for SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger
Figure 3 for SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger
Figure 4 for SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger
Viaarxiv icon

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation

Add code
Bookmark button
Alert button
Mar 16, 2023
Hao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun

Figure 1 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 2 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 3 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 4 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Viaarxiv icon

Co-Salient Object Detection with Co-Representation Purification

Add code
Bookmark button
Alert button
Mar 14, 2023
Ziyue Zhu, Zhao Zhang, Zheng Lin, Xing Sun, Ming-Ming Cheng

Figure 1 for Co-Salient Object Detection with Co-Representation Purification
Figure 2 for Co-Salient Object Detection with Co-Representation Purification
Figure 3 for Co-Salient Object Detection with Co-Representation Purification
Figure 4 for Co-Salient Object Detection with Co-Representation Purification
Viaarxiv icon

Efficient Decoder-free Object Detection with Transformers

Add code
Bookmark button
Alert button
Jun 17, 2022
Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen

Figure 1 for Efficient Decoder-free Object Detection with Transformers
Figure 2 for Efficient Decoder-free Object Detection with Transformers
Figure 3 for Efficient Decoder-free Object Detection with Transformers
Figure 4 for Efficient Decoder-free Object Detection with Transformers
Viaarxiv icon

Training-free Transformer Architecture Search

Add code
Bookmark button
Alert button
Mar 23, 2022
Qinqin Zhou, Kekai Sheng, Xiawu Zheng, Ke Li, Xing Sun, Yonghong Tian, Jie Chen, Rongrong Ji

Figure 1 for Training-free Transformer Architecture Search
Figure 2 for Training-free Transformer Architecture Search
Figure 3 for Training-free Transformer Architecture Search
Figure 4 for Training-free Transformer Architecture Search
Viaarxiv icon