Video Synchronization


Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization

Add code
Aug 12, 2025
Viaarxiv icon

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Add code
Aug 11, 2025
Viaarxiv icon

Commentary Generation for Soccer Highlights

Add code
Aug 11, 2025
Viaarxiv icon

AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning

Add code
Aug 10, 2025
Viaarxiv icon

MotionSwap

Add code
Aug 08, 2025
Viaarxiv icon

Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm

Add code
Aug 05, 2025
Viaarxiv icon

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation

Add code
Aug 01, 2025
Viaarxiv icon

Beamformed 360° Sound Maps: U-Net-Driven Acoustic Source Segmentation and Localization

Add code
Aug 01, 2025
Viaarxiv icon

Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation

Add code
Jul 28, 2025
Viaarxiv icon