Video Understanding


TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Two Causally Related Needles in a Video Haystack

Add code
May 26, 2025
Viaarxiv icon

The Role of Video Generation in Enhancing Data-Limited Action Understanding

Add code
May 26, 2025
Viaarxiv icon

AdaTP: Attention-Debiased Token Pruning for Video Large Language Models

Add code
May 26, 2025
Viaarxiv icon

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Add code
May 26, 2025
Viaarxiv icon

Dynamic-I2V: Exploring Image-to-Video Generaion Models via Multimodal LLM

Add code
May 26, 2025
Viaarxiv icon

Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs

Add code
May 25, 2025
Viaarxiv icon

TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs

Add code
May 26, 2025
Viaarxiv icon

Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals

Add code
May 26, 2025
Viaarxiv icon

Multi-modal brain encoding models for multi-modal stimuli

Add code
May 26, 2025
Viaarxiv icon