Alert button
Picture for Shaofei Zhang

Shaofei Zhang

Alert button

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Bookmark button
Alert button
Dec 19, 2023
Xueyuan Chen, Xi Wang, Shaofei Zhang, Lei He, Zhiyong Wu, Xixin Wu, Helen Meng

Figure 1 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 2 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 3 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 4 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Viaarxiv icon

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

Add code
Bookmark button
Alert button
Sep 12, 2023
Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao

Figure 1 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 2 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 3 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 4 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Viaarxiv icon

Large-Scale Automatic Audiobook Creation

Add code
Bookmark button
Alert button
Sep 07, 2023
Brendan Walsh, Mark Hamilton, Greg Newby, Xi Wang, Serena Ruan, Sheng Zhao, Lei He, Shaofei Zhang, Eric Dettinger, William T. Freeman, Markus Weimer

Figure 1 for Large-Scale Automatic Audiobook Creation
Viaarxiv icon

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

Add code
Bookmark button
Alert button
Jul 03, 2023
Yujia Xiao, Shaofei Zhang, Xi Wang, Xu Tan, Lei He, Sheng Zhao, Frank K. Soong, Tan Lee

Figure 1 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 2 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 3 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 4 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Viaarxiv icon

ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS

Add code
Bookmark button
Alert button
Sep 14, 2022
Liumeng Xue, Frank K. Soong, Shaofei Zhang, Lei Xie

Figure 1 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 2 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 3 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 4 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Viaarxiv icon

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Jun 25, 2022
Yihan Wu, Xi Wang, Shaofei Zhang, Lei He, Ruihua Song, Jian-Yun Nie

Figure 1 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 2 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 3 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 4 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Viaarxiv icon