Alert button
Picture for Guangyan Zhang

Guangyan Zhang

Alert button

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Add code
Bookmark button
Alert button
Jul 31, 2023
Guangyan Zhang, Thomas Merritt, Manuel Sam Ribeiro, Biel Tura-Vecino, Kayoko Yanagisawa, Kamil Pokora, Abdelhamid Ezzerg, Sebastian Cygert, Ammar Abbas, Piotr Bilinski, Roberto Barra-Chicote, Daniel Korzekwa, Jaime Lorenzo-Trueba

Figure 1 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 2 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 3 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Figure 4 for Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech
Viaarxiv icon

Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models

Add code
Bookmark button
Alert button
May 27, 2023
Yusheng Tian, Guangyan Zhang, Tan Lee

Figure 1 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 2 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 3 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 4 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Viaarxiv icon

iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre

Add code
Bookmark button
Alert button
Jun 29, 2022
Guangyan Zhang, Ying Qin, Wenjie Zhang, Jialun Wu, Mei Li, Yutao Gai, Feijun Jiang, Tan Lee

Figure 1 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 2 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 3 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Figure 4 for iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre
Viaarxiv icon

Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech

Add code
Bookmark button
Alert button
Mar 31, 2022
Guangyan Zhang, Kaitao Song, Xu Tan, Daxin Tan, Yuzi Yan, Yanqing Liu, Gang Wang, Wei Zhou, Tao Qin, Tan Lee, Sheng Zhao

Figure 1 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 2 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 3 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 4 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Viaarxiv icon

Environment Aware Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Oct 11, 2021
Daxin Tan, Guangyan Zhang, Tan Lee

Figure 1 for Environment Aware Text-to-Speech Synthesis
Figure 2 for Environment Aware Text-to-Speech Synthesis
Figure 3 for Environment Aware Text-to-Speech Synthesis
Figure 4 for Environment Aware Text-to-Speech Synthesis
Viaarxiv icon

A study on the efficacy of model pre-training in developing neural text-to-speech system

Add code
Bookmark button
Alert button
Oct 08, 2021
Guangyan Zhang, Yichong Leng, Daxin Tan, Ying Qin, Kaitao Song, Xu Tan, Sheng Zhao, Tan Lee

Figure 1 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 2 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 3 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Figure 4 for A study on the efficacy of model pre-training in developing neural text-to-speech system
Viaarxiv icon

Applying the Information Bottleneck Principle to Prosodic Representation Learning

Add code
Bookmark button
Alert button
Aug 05, 2021
Guangyan Zhang, Ying Qin, Daxin Tan, Tan Lee

Figure 1 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 2 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 3 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Figure 4 for Applying the Information Bottleneck Principle to Prosodic Representation Learning
Viaarxiv icon

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style

Add code
Bookmark button
Alert button
Jul 06, 2021
Yuzi Yan, Xu Tan, Bohan Li, Guangyan Zhang, Tao Qin, Sheng Zhao, Yuan Shen, Wei-Qiang Zhang, Tie-Yan Liu

Figure 1 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 2 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 3 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 4 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Viaarxiv icon

CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge

Add code
Bookmark button
Alert button
Apr 03, 2021
Daxin Tan, Hingpang Huang, Guangyan Zhang, Tan Lee

Figure 1 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 2 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 3 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Figure 4 for CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Viaarxiv icon

CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge

Add code
Bookmark button
Alert button
Mar 24, 2021
Daxin Tan, Hingpang Huang, Guangyan Zhang, Tan Lee

Figure 1 for CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge
Figure 2 for CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge
Figure 3 for CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge
Figure 4 for CUHK-EE voice cloning system for ICASSP 2021 M2VoC challenge
Viaarxiv icon