Talking Face Generation


Talking face generation is the process of generating videos of a person speaking based on an audio recording of their voice.

Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation

Add code
Jul 28, 2025
Viaarxiv icon

JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync

Add code
Jul 28, 2025
Viaarxiv icon

Celeb-DF++: A Large-scale Challenging Video DeepFake Benchmark for Generalizable Forensics

Add code
Jul 24, 2025
Viaarxiv icon

SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting

Add code
Jun 17, 2025
Viaarxiv icon

Video Editing for Audio-Visual Dubbing

Add code
May 29, 2025
Viaarxiv icon

RESOUND: Speech Reconstruction from Silent Videos via Acoustic-Semantic Decomposed Modeling

Add code
May 28, 2025
Viaarxiv icon

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Add code
May 26, 2025
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Viaarxiv icon

TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection

Add code
May 30, 2025
Viaarxiv icon

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model

Add code
Mar 24, 2025
Viaarxiv icon