Audio Visual Synchronization


CAFA: a Controllable Automatic Foley Artist

Add code
Apr 15, 2025
Viaarxiv icon

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Add code
Apr 13, 2025
Viaarxiv icon

Controllable Automatic Foley Artist

Add code
Apr 09, 2025
Viaarxiv icon

TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis

Add code
Apr 08, 2025
Viaarxiv icon

FluentLip: A Phonemes-Based Two-stage Approach for Audio-Driven Lip Synthesis with Optical Flow Consistency

Add code
Apr 06, 2025
Viaarxiv icon

OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

Add code
Apr 03, 2025
Viaarxiv icon

Contrastive Decoupled Representation Learning and Regularization for Speech-Preserving Facial Expression Manipulation

Add code
Apr 08, 2025
Viaarxiv icon

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Add code
Apr 07, 2025
Viaarxiv icon

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

Add code
Apr 03, 2025
Viaarxiv icon

DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation

Add code
Mar 28, 2025
Viaarxiv icon