Picture for Xirong Li

Xirong Li

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning

Add code
Mar 25, 2025
Viaarxiv icon

FunBench: Benchmarking Fundus Reading Skills of MLLMs

Add code
Mar 02, 2025
Viaarxiv icon

Convolutional Prompting for Broad-Domain Retinal Vessel Segmentation

Add code
Dec 24, 2024
Viaarxiv icon

Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization

Add code
Nov 15, 2024
Viaarxiv icon

Beyond Coarse-Grained Matching in Video-Text Retrieval

Add code
Oct 17, 2024
Figure 1 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 2 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 3 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Figure 4 for Beyond Coarse-Grained Matching in Video-Text Retrieval
Viaarxiv icon

Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions

Add code
Oct 15, 2024
Figure 1 for Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions
Figure 2 for Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions
Figure 3 for Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions
Figure 4 for Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions
Viaarxiv icon

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching

Add code
Aug 23, 2024
Viaarxiv icon

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

Add code
Aug 06, 2024
Figure 1 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 2 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 3 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 4 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Viaarxiv icon

PhD: A Prompted Visual Hallucination Evaluation Dataset

Add code
Mar 17, 2024
Figure 1 for PhD: A Prompted Visual Hallucination Evaluation Dataset
Figure 2 for PhD: A Prompted Visual Hallucination Evaluation Dataset
Figure 3 for PhD: A Prompted Visual Hallucination Evaluation Dataset
Figure 4 for PhD: A Prompted Visual Hallucination Evaluation Dataset
Viaarxiv icon

Adaptive Fusion of Radiomics and Deep Features for Lung Adenocarcinoma Subtype Recognition

Add code
Aug 27, 2023
Viaarxiv icon