Picture for Chenye Cui

Chenye Cui

RMSSinger: Realistic-Music-Score based Singing Voice Synthesis

Add code
May 18, 2023
Figure 1 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 2 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 3 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Figure 4 for RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
Viaarxiv icon

VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement

Add code
Nov 19, 2022
Figure 1 for VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Figure 2 for VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Figure 3 for VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Figure 4 for VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement
Viaarxiv icon

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

Add code
Jul 13, 2022
Figure 1 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 2 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 3 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 4 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Viaarxiv icon

GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis

Add code
May 15, 2022
Figure 1 for GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis
Figure 2 for GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis
Figure 3 for GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis
Figure 4 for GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis
Viaarxiv icon

Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus

Add code
Dec 20, 2021
Figure 1 for Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
Figure 2 for Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
Figure 3 for Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
Figure 4 for Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus
Viaarxiv icon

SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

Add code
Oct 26, 2021
Figure 1 for SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Figure 2 for SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Figure 3 for SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Figure 4 for SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Viaarxiv icon

EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model

Add code
Jun 17, 2021
Figure 1 for EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Figure 2 for EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Figure 3 for EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Figure 4 for EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Viaarxiv icon