Picture for Guangyan Zhang

Guangyan Zhang

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Add code
Jul 31, 2023
Figure 1 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 2 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 3 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 4 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Viaarxiv icon

Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models

Add code
May 27, 2023
Figure 1 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 2 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 3 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 4 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Viaarxiv icon

iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

Add code
Jun 29, 2022
Figure 1 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 2 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 3 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 4 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Viaarxiv icon

Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech

Add code
Mar 31, 2022
Figure 1 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 2 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 3 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 4 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Viaarxiv icon

Environment Aware Text-to-Speech Synthesis

Add code
Oct 11, 2021
Figure 1 for Environment Aware Text-to-Speech Synthesis
Figure 2 for Environment Aware Text-to-Speech Synthesis
Figure 3 for Environment Aware Text-to-Speech Synthesis
Figure 4 for Environment Aware Text-to-Speech Synthesis
Viaarxiv icon

A study on the efficacy of model pre-training in developing neural text-to-speech system

Add code
Oct 08, 2021
Figure 1 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 2 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 3 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 4 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Viaarxiv icon

Applying the Information Bottleneck Principle to Prosodic Representation Learning

Add code
Aug 05, 2021
Figure 1 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 2 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 3 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 4 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Viaarxiv icon

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style

Add code
Jul 06, 2021
Figure 1 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 2 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 3 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 4 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Viaarxiv icon

CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge

Add code
Apr 03, 2021
Figure 1 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 2 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 3 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 4 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Viaarxiv icon

PtLnc-BXE: Prediction of plant lncRNAs using a Bagging-XGBoost-ensemble method with multiple features

Add code
Nov 01, 2019
Figure 1 for PtLnc-BXE: Prediction of plant lncRNAs using a Bagging-XGBoost-ensemble method with multiple features
Figure 2 for PtLnc-BXE: Prediction of plant lncRNAs using a Bagging-XGBoost-ensemble method with multiple features
Figure 3 for PtLnc-BXE: Prediction of plant lncRNAs using a Bagging-XGBoost-ensemble method with multiple features
Figure 4 for PtLnc-BXE: Prediction of plant lncRNAs using a Bagging-XGBoost-ensemble method with multiple features
Viaarxiv icon