Picture for Hoon-Young Cho

Hoon-Young Cho

High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

Add code
Jun 25, 2024
Viaarxiv icon

Relational Proxy Loss for Audio-Text based Keyword Spotting

Add code
Jun 08, 2024
Viaarxiv icon

Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis

Add code
Oct 05, 2023
Figure 1 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 2 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 3 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Figure 4 for Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis
Viaarxiv icon

An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space

Add code
Nov 06, 2022
Figure 1 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 2 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 3 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Figure 4 for An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space
Viaarxiv icon

N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement

Add code
Jun 29, 2021
Figure 1 for N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement
Figure 2 for N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement
Figure 3 for N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement
Figure 4 for N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement
Viaarxiv icon

GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis

Add code
Jun 29, 2021
Figure 1 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 2 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 3 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Figure 4 for GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
Viaarxiv icon

Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech

Add code
Jun 29, 2021
Figure 1 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 2 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 3 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Figure 4 for Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
Viaarxiv icon

FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Add code
Jun 29, 2021
Figure 1 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 2 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 3 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Figure 4 for FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Viaarxiv icon

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music

Add code
Mar 04, 2021
Figure 1 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 2 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 3 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 4 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Viaarxiv icon

Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking

Add code
Mar 20, 2020
Figure 1 for Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking
Figure 2 for Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking
Figure 3 for Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking
Figure 4 for Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking
Viaarxiv icon