Talking Face Generation


Talking face generation is the process of generating videos of a person speaking based on an audio recording of their voice.

EmoTaG: Emotion-Aware Talking Head Synthesis on Gaussian Splatting with Few-Shot Personalization

Add code
Mar 22, 2026
Viaarxiv icon

Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling

Add code
Mar 16, 2026
Viaarxiv icon

UniSync: Towards Generalizable and High-Fidelity Lip Synchronization for Challenging Scenarios

Add code
Mar 04, 2026
Viaarxiv icon

LPIPS-AttnWav2Lip: Generic Audio-Driven lip synchronization for Talking Head Generation in the Wild

Add code
Jan 30, 2026
Viaarxiv icon

MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control

Add code
Jan 30, 2026
Viaarxiv icon

From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors

Add code
Feb 05, 2026
Viaarxiv icon

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation

Add code
Jan 15, 2026
Viaarxiv icon

Efficient and Robust Video Defense Framework against 3D-field Personalized Talking Face

Add code
Dec 24, 2025
Viaarxiv icon

FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling

Add code
Dec 16, 2025
Figure 1 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 2 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 3 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Figure 4 for FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling
Viaarxiv icon

Generalizable and Animatable 3D Full-Head Gaussian Avatar from a Single Image

Add code
Jan 19, 2026
Viaarxiv icon