Video Similarity


Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings

Add code
Jun 11, 2025
Viaarxiv icon

Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking

Add code
May 19, 2025
Viaarxiv icon

MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution

Add code
Jun 13, 2025
Viaarxiv icon

VUDG: A Dataset for Video Understanding Domain Generalization

Add code
May 30, 2025
Viaarxiv icon

CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval

Add code
Jun 06, 2025
Viaarxiv icon

Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?

Add code
May 20, 2025
Viaarxiv icon

Correspondence of high-dimensional emotion structures elicited by video clips between humans and Multimodal LLMs

Add code
May 19, 2025
Viaarxiv icon

Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning

Add code
Jun 09, 2025
Viaarxiv icon

Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model

Add code
May 19, 2025
Viaarxiv icon

ZooplanktonBench: A Geo-Aware Zooplankton Recognition and Classification Dataset from Marine Observations

Add code
May 24, 2025
Viaarxiv icon