Text To Video Search


ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance

Add code
Mar 24, 2026
Viaarxiv icon

CoVR-R:Reason-Aware Composed Video Retrieval

Add code
Mar 20, 2026
Viaarxiv icon

The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering

Add code
Mar 18, 2026
Viaarxiv icon

AMES: Approximate Multi-modal Enterprise Search via Late Interaction Retrieval

Add code
Mar 13, 2026
Viaarxiv icon

LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion

Add code
Mar 15, 2026
Viaarxiv icon

VQQA: An Agentic Approach for Video Evaluation and Quality Improvement

Add code
Mar 12, 2026
Viaarxiv icon

LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval

Add code
Mar 03, 2026
Viaarxiv icon

WISE: A Multimodal Search Engine for Visual Scenes, Audio, Objects, Faces, Speech, and Metadata

Add code
Feb 13, 2026
Viaarxiv icon

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

AD-MIR: Bridging the Gap from Perception to Persuasion in Advertising Video Understanding via Structured Reasoning

Add code
Feb 07, 2026
Viaarxiv icon