Lip Sync


IP-Adapter Is All You Need: Towards Fine-Tuning-Free Diffusion-Based Talking Face Generation

Add code
May 28, 2026
Viaarxiv icon

MTAVG-Bench 2.0: Diagnosing Failure Modes of Cinematic Expressiveness in Multi-Talker Audio-Video Generation

Add code
May 27, 2026
Viaarxiv icon

Test-Time Self-Adaptive Conditioning for Stable Audio-Driven Talking-Head Generation

Add code
May 25, 2026
Viaarxiv icon

EAD-Net: Emotion-Aware Talking Head Generation with Spatial Refinement and Temporal Coherence

Add code
Apr 25, 2026
Viaarxiv icon

Personalizing Causal Audio-Driven Facial Motion via Dynamic Multi-modal Retrieval

Add code
Apr 26, 2026
Viaarxiv icon

Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling

Add code
Apr 26, 2026
Viaarxiv icon

PS-TTS: Phonetic Synchronization in Text-to-Speech for Achieving Natural Automated Dubbing

Add code
Apr 14, 2026
Viaarxiv icon

CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing

Add code
Apr 14, 2026
Viaarxiv icon

Empowering Video Translation using Multimodal Large Language Models

Add code
Apr 13, 2026
Viaarxiv icon

MMTalker: Multiresolution 3D Talking Head Synthesis with Multimodal Feature Fusion

Add code
Apr 03, 2026
Viaarxiv icon