Picture for Heeseung Kim

Heeseung Kim

EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models

Add code
Feb 27, 2025
Viaarxiv icon

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Add code
Nov 23, 2024
Viaarxiv icon

Style-Friendly SNR Sampler for Style-Driven Generation

Add code
Nov 22, 2024
Figure 1 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 2 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 3 for Style-Friendly SNR Sampler for Style-Driven Generation
Figure 4 for Style-Friendly SNR Sampler for Style-Driven Generation
Viaarxiv icon

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers

Add code
Sep 24, 2024
Viaarxiv icon

VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance

Add code
Sep 24, 2024
Viaarxiv icon

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech

Add code
Aug 27, 2024
Figure 1 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 2 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 3 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 4 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

Unified Speech-Text Pretraining for Spoken Dialog Modeling

Add code
Feb 08, 2024
Figure 1 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 2 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 3 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Figure 4 for Unified Speech-Text Pretraining for Spoken Dialog Modeling
Viaarxiv icon

UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data

Add code
Jun 28, 2023
Viaarxiv icon