Picture for Hyeongju Kim

Hyeongju Kim

RobustSpeechFlow: Learning Robust Text-to-Speech Trajectories via Augmentation-based Contrastive Flow Matching

Add code
May 21, 2026
Viaarxiv icon

Robust TTS Training via Self-Purifying Flow Matching for the WildSpoof 2026 TTS Track

Add code
Dec 19, 2025
Figure 1 for Robust TTS Training via Self-Purifying Flow Matching for the WildSpoof 2026 TTS Track
Figure 2 for Robust TTS Training via Self-Purifying Flow Matching for the WildSpoof 2026 TTS Track
Viaarxiv icon

SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System

Add code
Mar 29, 2025
Figure 1 for SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Figure 2 for SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Figure 3 for SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Figure 4 for SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
Viaarxiv icon

Super Monotonic Alignment Search

Add code
Sep 12, 2024
Figure 1 for Super Monotonic Alignment Search
Figure 2 for Super Monotonic Alignment Search
Figure 3 for Super Monotonic Alignment Search
Viaarxiv icon

DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance

Add code
Aug 27, 2024
Figure 1 for DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance
Figure 2 for DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance
Figure 3 for DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance
Viaarxiv icon

Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric

Add code
Dec 13, 2022
Figure 1 for Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric
Figure 2 for Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric
Figure 3 for Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric
Figure 4 for Towards trustworthy phoneme boundary detection with autoregressive model and improved evaluation metric
Viaarxiv icon

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Add code
Nov 17, 2022
Viaarxiv icon

EdiTTS: Score-based Editing for Controllable Text-to-Speech

Add code
Oct 06, 2021
Figure 1 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 2 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 3 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Figure 4 for EdiTTS: Score-based Editing for Controllable Text-to-Speech
Viaarxiv icon

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech

Add code
Apr 03, 2021
Figure 1 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 2 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 3 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 4 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Viaarxiv icon

Continuous Monitoring of Blood Pressure with Evidential Regression

Add code
Feb 26, 2021
Figure 1 for Continuous Monitoring of Blood Pressure with Evidential Regression
Figure 2 for Continuous Monitoring of Blood Pressure with Evidential Regression
Figure 3 for Continuous Monitoring of Blood Pressure with Evidential Regression
Figure 4 for Continuous Monitoring of Blood Pressure with Evidential Regression
Viaarxiv icon