Talking Face Generation


Talking face generation is the process of generating videos of a person speaking based on an audio recording of their voice.

Video Editing for Audio-Visual Dubbing

Add code
May 29, 2025
Viaarxiv icon

TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection

Add code
May 30, 2025
Viaarxiv icon

RESOUND: Speech Reconstruction from Silent Videos via Acoustic-Semantic Decomposed Modeling

Add code
May 28, 2025
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Viaarxiv icon

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Add code
May 26, 2025
Viaarxiv icon

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Add code
Mar 24, 2025
Viaarxiv icon

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation

Add code
Mar 20, 2025
Viaarxiv icon

UniSync: A Unified Framework for Audio-Visual Synchronization

Add code
Mar 20, 2025
Viaarxiv icon

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Add code
Mar 27, 2025
Viaarxiv icon

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model

Add code
Mar 27, 2025
Viaarxiv icon