Talking


Asymmetric Hierarchical Anchoring for Audio-Visual Joint Representation: Resolving Information Allocation Ambiguity for Robust Cross-Modal Generalization

Add code
Feb 03, 2026
Viaarxiv icon

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Add code
Feb 02, 2026
Viaarxiv icon

Verification Required: The Impact of Information Credibility on AI Persuasion

Add code
Feb 01, 2026
Viaarxiv icon

JoyAvatar: Unlocking Highly Expressive Avatars via Harmonized Text-Audio Conditioning

Add code
Jan 31, 2026
Viaarxiv icon

Should LLMs, $\textit{like}$, Generate How Users Talk? Building Dialect-Accurate Dialog[ue]s Beyond the American Default with MDial

Add code
Jan 30, 2026
Viaarxiv icon

MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control

Add code
Jan 30, 2026
Viaarxiv icon

LPIPS-AttnWav2Lip: Generic Audio-Driven lip synchronization for Talking Head Generation in the Wild

Add code
Jan 30, 2026
Viaarxiv icon

Lightweight High-Fidelity Low-Bitrate Talking Face Compression for 3D Video Conference

Add code
Jan 29, 2026
Viaarxiv icon

EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers

Add code
Jan 29, 2026
Viaarxiv icon

Small Talk, Big Impact: The Energy Cost of Thanking AI

Add code
Jan 29, 2026
Viaarxiv icon