Content Based Image Retrieval


Content-based image retrieval is a well-studied problem in computer vision, with retrieval problems generally divided into two groups:category-level retrieval and instance-level retrieval. Given a query image of the Sydney Harbour bridge, for instance, category-level retrieval aims to find any bridge in a given dataset of images, whilst instance-level retrieval must find the Sydney Harbour bridge to be considered a match.

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

Add code
Apr 14, 2025
Viaarxiv icon

The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance

Add code
Apr 14, 2025
Viaarxiv icon

REJEPA: A Novel Joint-Embedding Predictive Architecture for Efficient Remote Sensing Image Retrieval

Add code
Apr 04, 2025
Viaarxiv icon

RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model

Add code
Apr 07, 2025
Viaarxiv icon

IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval

Add code
Apr 01, 2025
Viaarxiv icon

Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation

Add code
Mar 27, 2025
Viaarxiv icon

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Add code
Mar 24, 2025
Viaarxiv icon

Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval

Add code
Mar 21, 2025
Viaarxiv icon

Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval

Add code
Mar 25, 2025
Viaarxiv icon

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Add code
Mar 26, 2025
Viaarxiv icon