Picture for Shaofei Zhang

Shaofei Zhang

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Dec 19, 2023
Figure 1 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 2 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 3 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Figure 4 for StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
Viaarxiv icon

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

Add code
Sep 12, 2023
Figure 1 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 2 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 3 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 4 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Viaarxiv icon

Large-Scale Automatic Audiobook Creation

Sep 07, 2023
Figure 1 for Large-Scale Automatic Audiobook Creation
Viaarxiv icon

ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading

Add code
Jul 03, 2023
Figure 1 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 2 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 3 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Figure 4 for ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
Viaarxiv icon

ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS

Add code
Sep 14, 2022
Figure 1 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 2 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 3 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Figure 4 for ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
Viaarxiv icon

Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

Add code
Jun 25, 2022
Figure 1 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 2 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 3 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Figure 4 for Self-supervised Context-aware Style Representation for Expressive Speech Synthesis
Viaarxiv icon