Picture for Shan Ning

Shan Ning

DA-DPO: Cost-efficient Difficulty-aware Preference Optimization for Reducing MLLM Hallucinations

Add code
Jan 02, 2026
Viaarxiv icon

Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training

Add code
Jan 04, 2024
Figure 1 for Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Figure 2 for Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Figure 3 for Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Figure 4 for Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training
Viaarxiv icon

HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models

Add code
Mar 29, 2023
Viaarxiv icon