Alert button
Picture for Xinsheng Wang

Xinsheng Wang

Alert button

Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis

Jan 19, 2022
Yu Wang, Xinsheng Wang, Pengcheng Zhu, Jie Wu, Hanzhao Li, Heyang Xue, Yongmao Zhang, Lei Xie, Mengxiao Bi

Figure 1 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 2 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 3 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 4 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Viaarxiv icon

MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis

Jan 17, 2022
Yi Lei, Shan Yang, Xinsheng Wang, Lei Xie

Figure 1 for MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Figure 2 for MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Figure 3 for MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Figure 4 for MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Viaarxiv icon

Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios

Dec 23, 2021
Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan

Figure 1 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 2 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 3 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Figure 4 for Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios
Viaarxiv icon

Controllable cross-speaker emotion transfer for end-to-end speech synthesis

Sep 14, 2021
Tao Li, Xinsheng Wang, Qicong Xie, Zhichao Wang, Lei Xie

Figure 1 for Controllable cross-speaker emotion transfer for end-to-end speech synthesis
Figure 2 for Controllable cross-speaker emotion transfer for end-to-end speech synthesis
Figure 3 for Controllable cross-speaker emotion transfer for end-to-end speech synthesis
Figure 4 for Controllable cross-speaker emotion transfer for end-to-end speech synthesis
Viaarxiv icon

AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person

Aug 11, 2021
Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, Scharenborg

Figure 1 for AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Figure 2 for AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Figure 3 for AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Figure 4 for AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person
Viaarxiv icon

Show and Speak: Directly Synthesize Spoken Description of Images

Oct 23, 2020
Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg

Figure 1 for Show and Speak: Directly Synthesize Spoken Description of Images
Figure 2 for Show and Speak: Directly Synthesize Spoken Description of Images
Figure 3 for Show and Speak: Directly Synthesize Spoken Description of Images
Figure 4 for Show and Speak: Directly Synthesize Spoken Description of Images
Viaarxiv icon

S2IGAN: Speech-to-Image Generation via Adversarial Learning

May 14, 2020
Xinsheng Wang, Tingting Qiao, Jihua Zhu, Alan Hanjalic, Odette Scharenborg

Figure 1 for S2IGAN: Speech-to-Image Generation via Adversarial Learning
Figure 2 for S2IGAN: Speech-to-Image Generation via Adversarial Learning
Figure 3 for S2IGAN: Speech-to-Image Generation via Adversarial Learning
Figure 4 for S2IGAN: Speech-to-Image Generation via Adversarial Learning
Viaarxiv icon

Domain segmentation and adjustment for generalized zero-shot learning

Feb 01, 2020
Xinsheng Wang, Shanmin Pang, Jihua Zhu

Figure 1 for Domain segmentation and adjustment for generalized zero-shot learning
Figure 2 for Domain segmentation and adjustment for generalized zero-shot learning
Figure 3 for Domain segmentation and adjustment for generalized zero-shot learning
Figure 4 for Domain segmentation and adjustment for generalized zero-shot learning
Viaarxiv icon

Competing Ratio Loss for Discriminative Multi-class Image Classification

Dec 25, 2019
Ke Zhang, Xinsheng Wang, Yurong Guo, Dongliang Chang, Zhenbing Zhao, Zhanyu Ma, Tony X. Han

Figure 1 for Competing Ratio Loss for Discriminative Multi-class Image Classification
Figure 2 for Competing Ratio Loss for Discriminative Multi-class Image Classification
Figure 3 for Competing Ratio Loss for Discriminative Multi-class Image Classification
Figure 4 for Competing Ratio Loss for Discriminative Multi-class Image Classification
Viaarxiv icon