Cross Modal Information Retrieval


Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

DocMMIR: A Framework for Document Multi-modal Information Retrieval

Add code
May 25, 2025
Viaarxiv icon

Co-AttenDWG: Co-Attentive Dimension-Wise Gating and Expert Fusion for Multi-Modal Offensive Content Detection

Add code
May 25, 2025
Viaarxiv icon

Benchmarking Retrieval-Augmented Multimomal Generation for Document Question Answering

Add code
May 22, 2025
Viaarxiv icon

DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval

Add code
May 23, 2025
Viaarxiv icon

Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval

Add code
May 22, 2025
Viaarxiv icon

Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio

Add code
May 19, 2025
Viaarxiv icon

DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery

Add code
May 20, 2025
Viaarxiv icon

Towards Cross-modal Retrieval in Chinese Cultural Heritage Documents: Dataset and Solution

Add code
May 16, 2025
Viaarxiv icon

OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval

Add code
May 10, 2025
Viaarxiv icon