Alert button
Picture for Xie Chen

Xie Chen

Alert button

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

Quantum State Generation with Structure-Preserving Diffusion Model

Add code
Bookmark button
Alert button
Apr 09, 2024
Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

Viaarxiv icon

Advanced Long-Content Speech Recognition With Factorized Neural Transducer

Add code
Bookmark button
Alert button
Mar 20, 2024
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

Figure 1 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 2 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 3 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Figure 4 for Advanced Long-Content Speech Recognition With Factorized Neural Transducer
Viaarxiv icon

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

Add code
Bookmark button
Alert button
Feb 13, 2024
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen

Viaarxiv icon

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi, David Harwath

Viaarxiv icon

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Add code
Bookmark button
Alert button
Jan 30, 2024
Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering

Add code
Bookmark button
Alert button
Jan 14, 2024
Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Xie Chen

Viaarxiv icon

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Add code
Bookmark button
Alert button
Jan 07, 2024
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen

Viaarxiv icon

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Add code
Bookmark button
Alert button
Dec 23, 2023
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen

Viaarxiv icon

SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention

Add code
Bookmark button
Alert button
Dec 14, 2023
Junjie Li, Yiwei Guo, Xie Chen, Kai Yu

Viaarxiv icon