Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Jun 07, 2025

Bastian Jäckl, Vojtěch Kloda, Daniel A. Keim, Jakub Lokoč

Figure 1 for Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Figure 2 for Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Figure 3 for Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Figure 4 for Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Share this with someone who'll enjoy it:

Abstract:Advances in multimodal text-image models have enabled effective text-based querying in extensive image collections. While these models show convincing performance for everyday life scenes, querying in highly homogeneous, specialized domains remains challenging. The primary problem is that users can often provide only vague textual descriptions as they lack expert knowledge to discriminate between homogenous entities. This work investigates whether adding location-based prompts to complement these vague text queries can enhance retrieval performance. Specifically, we collected a dataset of 741 human annotations, each containing short and long textual descriptions and bounding boxes indicating regions of interest in challenging underwater scenes. Using these annotations, we evaluate the performance of CLIP when queried on various static sub-regions of images compared to the full image. Our results show that both a simple 3-by-3 partitioning and a 5-grid overlap significantly improve retrieval effectiveness and remain robust to perturbations of the annotation box.

* 14 pages, 4 figures, 2 tables

View paper on

Share this with someone who'll enjoy it:

Title:Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP

Paper and Code