Talking


Accelerating Audio Research with Robotic Dummy Heads

Add code
May 07, 2025
Viaarxiv icon

Proceedings The 13th International Workshop on Theorem proving components for Educational software

Add code
May 07, 2025
Viaarxiv icon

RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet

Add code
May 05, 2025
Viaarxiv icon

What Is AI Safety? What Do We Want It to Be?

Add code
May 05, 2025
Viaarxiv icon

OT-Talk: Animating 3D Talking Head with Optimal Transportation

Add code
May 03, 2025
Viaarxiv icon

GenSync: A Generalized Talking Head Framework for Audio-driven Multi-Subject Lip-Sync using 3D Gaussian Splatting

Add code
May 03, 2025
Viaarxiv icon

Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion

Add code
May 03, 2025
Viaarxiv icon

KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Add code
May 01, 2025
Viaarxiv icon

Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA

Add code
Apr 30, 2025
Viaarxiv icon

IRL Dittos: Embodied Multimodal AI Agent Interactions in Open Spaces

Add code
Apr 30, 2025
Viaarxiv icon