Alert button
Picture for Xing Sun

Xing Sun

Alert button

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Feb 29, 2024
Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun

Viaarxiv icon

Sinkhorn Distance Minimization for Knowledge Distillation

Feb 27, 2024
Xiao Cui, Yulei Qin, Yuting Gao, Enwei Zhang, Zihan Xu, Tong Wu, Ke Li, Xing Sun, Wengang Zhou, Houqiang Li

Viaarxiv icon

FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema

Feb 19, 2024
Junru Lu, Siyu An, Min Zhang, Yulan He, Di Yin, Xing Sun

Viaarxiv icon

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

SPD-DDPM: Denoising Diffusion Probabilistic Models in the Symmetric Positive Definite Space

Dec 13, 2023
Yunchen Li, Zhou Yu, Gaoqi He, Yunhang Shen, Ke Li, Xing Sun, Shaohui Lin

Viaarxiv icon

MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples

Dec 12, 2023
Tao Chen, Enwei Zhang, Yuting Gao, Ke Li, Xing Sun, Yan Zhang, Hui Li

Viaarxiv icon

Aligning and Prompting Everything All at Once for Universal Visual Perception

Dec 04, 2023
Yunhang Shen, Chaoyou Fu, Peixian Chen, Mengdan Zhang, Ke Li, Xing Sun, Yunsheng Wu, Shaohui Lin, Rongrong Ji

Viaarxiv icon

Towards Robust Text Retrieval with Progressive Learning

Nov 20, 2023
Tong Wu, Yulei Qin, Enwei Zhang, Zihan Xu, Yuting Gao, Ke Li, Xing Sun

Figure 1 for Towards Robust Text Retrieval with Progressive Learning
Figure 2 for Towards Robust Text Retrieval with Progressive Learning
Figure 3 for Towards Robust Text Retrieval with Progressive Learning
Figure 4 for Towards Robust Text Retrieval with Progressive Learning
Viaarxiv icon

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Oct 24, 2023
Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen

Figure 1 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 2 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 3 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Figure 4 for Woodpecker: Hallucination Correction for Multimodal Large Language Models
Viaarxiv icon

CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes

Oct 15, 2023
Yulei Qin, Xingyu Chen, Yunhang Shen, Chaoyou Fu, Yun Gu, Ke Li, Xing Sun, Rongrong Ji

Figure 1 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 2 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 3 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Figure 4 for CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes
Viaarxiv icon