Picture for Hyung-Seok Oh

Hyung-Seok Oh

VibE-SVC: Vibrato Extraction with High-frequency F0 Contour for Singing Voice Conversion

Add code
May 27, 2025
Viaarxiv icon

DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech

Add code
May 26, 2025
Viaarxiv icon

EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification

Add code
May 26, 2025
Viaarxiv icon

JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis

Add code
Jan 09, 2025
Viaarxiv icon

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

Add code
Nov 04, 2024
Figure 1 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 2 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 3 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 4 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Viaarxiv icon

EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

Add code
Jun 12, 2024
Viaarxiv icon

DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training

Add code
Jul 31, 2023
Figure 1 for DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
Figure 2 for DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
Figure 3 for DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
Figure 4 for DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training
Viaarxiv icon

HierVST: Hierarchical Adaptive Zero-shot Voice Style Transfer

Add code
Jul 30, 2023
Viaarxiv icon