Image Retrieval


Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 03, 2026
Viaarxiv icon

ObjEmbed: Towards Universal Multimodal Object Embeddings

Add code
Feb 03, 2026
Viaarxiv icon

OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering

Add code
Feb 03, 2026
Viaarxiv icon

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval

Add code
Feb 02, 2026
Viaarxiv icon

Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs

Add code
Feb 03, 2026
Viaarxiv icon

TextME: Bridging Unseen Modalities Through Text Descriptions

Add code
Feb 03, 2026
Viaarxiv icon

Contextualized Visual Personalization in Vision-Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Aligning Forest and Trees in Images and Long Captions for Visually Grounded Understanding

Add code
Feb 03, 2026
Viaarxiv icon

Origin Lens: A Privacy-First Mobile Framework for Cryptographic Image Provenance and AI Detection

Add code
Feb 03, 2026
Viaarxiv icon

ReasonEdit: Editing Vision-Language Models using Human Reasoning

Add code
Feb 03, 2026
Viaarxiv icon