Alert button
Picture for Yiwei Guo

Yiwei Guo

Alert button

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Add code
Bookmark button
Alert button
Jan 30, 2024
Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention

Add code
Bookmark button
Alert button
Dec 14, 2023
Junjie Li, Yiwei Guo, Xie Chen, Kai Yu

Viaarxiv icon

Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations

Add code
Bookmark button
Alert button
Nov 02, 2023
Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen, Kai Yu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Bookmark button
Alert button
Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen

Figure 1 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 2 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 3 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 4 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Viaarxiv icon

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

Add code
Bookmark button
Alert button
Sep 10, 2023
Yiwei Guo, Chenpeng Du, Ziyang Ma, Xie Chen, Kai Yu

Figure 1 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 2 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 3 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 4 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Viaarxiv icon

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Add code
Bookmark button
Alert button
Jun 25, 2023
Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 2 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 3 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 4 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Viaarxiv icon

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

Add code
Bookmark button
Alert button
Jun 18, 2023
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu

Figure 1 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 2 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 3 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 4 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Viaarxiv icon