Talking Head Generation


Talking head generation is the process of generating videos of a person speaking based on an audio recording of their voice.

KLASSify to Verify: Audio-Visual Deepfake Detection Using SSL-based Audio and Handcrafted Visual Features

Add code
Aug 10, 2025
Viaarxiv icon

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

JOLT3D: Joint Learning of Talking Heads and 3DMM Parameters with Application to Lip-Sync

Add code
Jul 28, 2025
Viaarxiv icon

Is It Really You? Exploring Biometric Verification Scenarios in Photorealistic Talking-Head Avatar Videos

Add code
Aug 01, 2025
Viaarxiv icon

FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases

Add code
Jul 02, 2025
Viaarxiv icon

MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding

Add code
Jul 08, 2025
Viaarxiv icon

Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Router

Add code
Jun 24, 2025
Viaarxiv icon

FaceEditTalker: Interactive Talking Head Generation with Facial Attribute Editing

Add code
May 28, 2025
Viaarxiv icon

GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation

Add code
Jun 26, 2025
Viaarxiv icon

DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations

Add code
May 26, 2025
Viaarxiv icon