Alert button
Picture for Xie Chen

Xie Chen

Alert button

Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations

Add code
Bookmark button
Alert button
Nov 02, 2023
Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen, Kai Yu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Bookmark button
Alert button
Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon

Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Sep 29, 2023
Guanrou Yang, Ziyang Ma, Zhisheng Zheng, Yakun Song, Zhikang Niu, Xie Chen

Viaarxiv icon

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen

Figure 1 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 2 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 3 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 4 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Viaarxiv icon

Improved Factorized Neural Transducer Model For text-only Domain Adaptation

Add code
Bookmark button
Alert button
Sep 18, 2023
Junzhe Liu, Jianwei Yu, Xie Chen

Viaarxiv icon

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

Add code
Bookmark button
Alert button
Sep 14, 2023
Peng Wang, Yifan Yang, Zheng Liang, Tian Tan, Shiliang Zhang, Xie Chen

Viaarxiv icon

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS

Add code
Bookmark button
Alert button
Sep 14, 2023
Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu, Daniel Povey, Xie Chen

Figure 1 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 2 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 3 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 4 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Viaarxiv icon

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

Add code
Bookmark button
Alert button
Sep 10, 2023
Yiwei Guo, Chenpeng Du, Ziyang Ma, Xie Chen, Kai Yu

Figure 1 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 2 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 3 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 4 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Viaarxiv icon

Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Aug 28, 2023
Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen

Figure 1 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 2 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 3 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 4 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Viaarxiv icon

Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems

Add code
Bookmark button
Alert button
Jun 26, 2023
Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu

Figure 1 for Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Figure 2 for Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Figure 3 for Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Figure 4 for Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Viaarxiv icon