Picture for Chunfeng Wang

Chunfeng Wang

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts

Add code
Jul 14, 2023
Figure 1 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 2 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 3 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 4 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Viaarxiv icon

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

Add code
Jun 27, 2023
Figure 1 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 2 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 3 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Figure 4 for GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Viaarxiv icon

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias

Add code
Jun 06, 2023
Figure 1 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 2 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 3 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Figure 4 for Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias
Viaarxiv icon

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Add code
Jun 01, 2023
Figure 1 for StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation
Figure 2 for StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation
Figure 3 for StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation
Figure 4 for StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation
Viaarxiv icon

LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion

Add code
Mar 02, 2023
Figure 1 for LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion
Figure 2 for LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion
Figure 3 for LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion
Figure 4 for LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion
Viaarxiv icon

Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

Add code
Jun 18, 2020
Figure 1 for Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Figure 2 for Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Figure 3 for Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Figure 4 for Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Viaarxiv icon