Alert button
Picture for Xinfa Zhu

Xinfa Zhu

Alert button

Accent-VITS:accent transfer for end-to-end TTS

Add code
Bookmark button
Alert button
Dec 29, 2023
Linhan Ma, Yongmao Zhang, Xinfa Zhu, Yi Lei, Ziqian Ning, Pengcheng Zhu, Lei Xie

Viaarxiv icon

SELM: Speech Enhancement Using Discrete Tokens and Language Models

Add code
Bookmark button
Alert button
Dec 15, 2023
Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Ning Jiang, Guoqing Zhao, Lei Xie

Viaarxiv icon

SponTTS: modeling and transferring spontaneous style for TTS

Add code
Bookmark button
Alert button
Nov 13, 2023
Hanzhao Li, Xinfa Zhu, Liumeng Xue, Yang Song, Yunlin Chen, Lei Xie

Figure 1 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 2 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 3 for SponTTS: modeling and transferring spontaneous style for TTS
Figure 4 for SponTTS: modeling and transferring spontaneous style for TTS
Viaarxiv icon

Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning

Add code
Bookmark button
Alert button
Oct 26, 2023
Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, Lei Xie

Viaarxiv icon

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

Add code
Bookmark button
Alert button
Oct 12, 2023
Xinfa Zhu, Yuanjun Lv, Yi Lei, Tao Li, Wendi He, Hongbin Zhou, Heng Lu, Lei Xie

Viaarxiv icon

U-Style: Cascading U-nets with Multi-level Speaker and Style Modeling for Zero-Shot Voice Cloning

Add code
Bookmark button
Alert button
Oct 06, 2023
Tao Li, Zhichao Wang, Xinfa Zhu, Jian Cong, Qiao Tian, Yuping Wang, Lei Xie

Viaarxiv icon

Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis

Add code
Bookmark button
Alert button
Oct 06, 2023
Yuke Li, Xinfa Zhu, Yi Lei, Hai Li, Junhui Liu, Danming Xie, Lei Xie

Figure 1 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 2 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 3 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Figure 4 for Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis
Viaarxiv icon

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS

Add code
Bookmark button
Alert button
Sep 25, 2023
Dake Guo, Xinfa Zhu, Liumeng Xue, Tao Li, Yuanjun Lv, Yuepeng Jiang, Lei Xie

Viaarxiv icon

DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin

Add code
Bookmark button
Alert button
Sep 02, 2023
Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie

Figure 1 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 2 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 3 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 4 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Viaarxiv icon