Video To Text Retrieval


EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval

Add code
Mar 26, 2026
Viaarxiv icon

Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining

Add code
Mar 24, 2026
Viaarxiv icon

Knowledge-Refined Dual Context-Aware Network for Partially Relevant Video Retrieval

Add code
Mar 25, 2026
Viaarxiv icon

Mamba-VMR: Multimodal Query Augmentation via Generated Videos for Precise Temporal Grounding

Add code
Mar 23, 2026
Viaarxiv icon

ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance

Add code
Mar 24, 2026
Viaarxiv icon

Attention-aware Inference Optimizations for Large Vision-Language Models with Memory-efficient Decoding

Add code
Mar 25, 2026
Viaarxiv icon

Leum-VL Technical Report

Add code
Mar 20, 2026
Viaarxiv icon

CoVR-R:Reason-Aware Composed Video Retrieval

Add code
Mar 20, 2026
Viaarxiv icon

GenState-AI: State-Aware Dataset for Text-to-Video Retrieval on AI-Generated Videos

Add code
Mar 15, 2026
Viaarxiv icon

SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval

Add code
Mar 11, 2026
Viaarxiv icon