Picture for Na Hu

Na Hu

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers

Add code
Jul 02, 2022
Figure 1 for Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Figure 2 for Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Figure 3 for Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Figure 4 for Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers
Viaarxiv icon

Controllable Context-aware Conversational Speech Synthesis

Add code
Jun 21, 2021
Figure 1 for Controllable Context-aware Conversational Speech Synthesis
Figure 2 for Controllable Context-aware Conversational Speech Synthesis
Figure 3 for Controllable Context-aware Conversational Speech Synthesis
Figure 4 for Controllable Context-aware Conversational Speech Synthesis
Viaarxiv icon

VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention

Add code
Feb 12, 2021
Figure 1 for VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Figure 2 for VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Figure 3 for VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Figure 4 for VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Viaarxiv icon

Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training

Add code
Dec 03, 2020
Figure 1 for Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
Figure 2 for Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
Figure 3 for Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
Figure 4 for Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
Viaarxiv icon

DurIAN: Duration Informed Attention Network For Multimodal Synthesis

Add code
Sep 05, 2019
Figure 1 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 2 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 3 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Figure 4 for DurIAN: Duration Informed Attention Network For Multimodal Synthesis
Viaarxiv icon