Picture for Ji-Hyun Lee

Ji-Hyun Lee

High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

Add code
Jun 25, 2024
Viaarxiv icon

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

Add code
Oct 05, 2023
Figure 1 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 2 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 3 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 4 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Viaarxiv icon

GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints

Add code
Aug 16, 2021
Figure 1 for GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Figure 2 for GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Figure 3 for GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Figure 4 for GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
Viaarxiv icon

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Add code
Jun 14, 2021
Figure 1 for Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Figure 2 for Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Figure 3 for Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Figure 4 for Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Viaarxiv icon