Alert button
Picture for Xie Chen

Xie Chen

Alert button

Advanced Long-Content Speech Recognition With Factorized Neural Transducer

Mar 20, 2024
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

Viaarxiv icon

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

Feb 13, 2024
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen

Viaarxiv icon

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Feb 02, 2024
Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi, David Harwath

Viaarxiv icon

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Jan 30, 2024
Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu

Viaarxiv icon

ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering

Jan 14, 2024
Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Xie Chen

Viaarxiv icon

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Jan 07, 2024
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen

Viaarxiv icon

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Dec 23, 2023
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen

Viaarxiv icon

SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention

Dec 14, 2023
Junjie Li, Yiwei Guo, Xie Chen, Kai Yu

Viaarxiv icon

Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations

Nov 02, 2023
Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen, Kai Yu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon