Picture for Seongho Joo

Seongho Joo

MultiActor-Audiobook: Zero-Shot Audiobook Generation with Faces and Voices of Multiple Speakers

Add code
May 19, 2025
Viaarxiv icon

Drift: Decoding-time Personalized Alignments with Implicit User Preferences

Add code
Feb 21, 2025
Viaarxiv icon

DPP-TTS: Diversifying prosodic features of speech via determinantal point processes

Add code
Oct 23, 2023
Viaarxiv icon