Alert button
Picture for Chenpeng Du

Chenpeng Du

Alert button

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Add code
Bookmark button
Alert button
Jan 30, 2024
Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder

Add code
Bookmark button
Alert button
Nov 03, 2023
Tao Liu, Chenpeng Du, Shuai Fan, Feilong Chen, Kai Yu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Bookmark button
Alert button
Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS

Add code
Bookmark button
Alert button
Sep 14, 2023
Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu, Daniel Povey, Xie Chen

Figure 1 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 2 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 3 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 4 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Viaarxiv icon

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

Add code
Bookmark button
Alert button
Sep 10, 2023
Yiwei Guo, Chenpeng Du, Ziyang Ma, Xie Chen, Kai Yu

Figure 1 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 2 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 3 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 4 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Viaarxiv icon

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Add code
Bookmark button
Alert button
Jun 25, 2023
Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 2 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 3 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 4 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Viaarxiv icon

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

Add code
Bookmark button
Alert button
Jun 18, 2023
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu

Figure 1 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 2 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 3 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 4 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Viaarxiv icon

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Add code
Bookmark button
Alert button
Jun 14, 2023
Zheng Liang, Zheshu Song, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen

Figure 1 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 2 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 3 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 4 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Viaarxiv icon