Video Alignment


Video alignment is the process of synchronizing or aligning multiple video sequences to create a coherent timeline or narrative.

SPKLIP: Aligning Spike Video Streams with Natural Language

Add code
May 19, 2025
Viaarxiv icon

HiERO: understanding the hierarchy of human behavior enhances reasoning on egocentric videos

Add code
May 19, 2025
Viaarxiv icon

PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI

Add code
May 19, 2025
Viaarxiv icon

RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers

Add code
May 19, 2025
Viaarxiv icon

SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

Add code
May 19, 2025
Viaarxiv icon

Rebalancing Contrastive Alignment with Learnable Semantic Gaps in Text-Video Retrieval

Add code
May 18, 2025
Viaarxiv icon

SafeVid: Toward Safety Aligned Video Large Multimodal Models

Add code
May 17, 2025
Viaarxiv icon

Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos

Add code
May 19, 2025
Viaarxiv icon

LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

Add code
May 17, 2025
Viaarxiv icon

VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning

Add code
May 18, 2025
Viaarxiv icon