Alert button
Picture for Sanyuan Chen

Sanyuan Chen

Alert button

WavLLM: Towards Robust and Adaptive Speech Large Language Model

Add code
Bookmark button
Alert button
Mar 31, 2024
Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei

Viaarxiv icon

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Add code
Bookmark button
Alert button
Aug 14, 2023
Xiaofei Wang, Manthan Thakker, Zhuo Chen, Naoyuki Kanda, Sefik Emre Eskimez, Sanyuan Chen, Min Tang, Shujie Liu, Jinyu Li, Takuya Yoshioka

Figure 1 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 2 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 3 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 4 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Viaarxiv icon

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

Add code
Bookmark button
Alert button
Mar 07, 2023
Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 2 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 3 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Figure 4 for Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Viaarxiv icon

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Add code
Bookmark button
Alert button
Jan 05, 2023
Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Figure 1 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 2 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 3 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Figure 4 for Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Viaarxiv icon

BEATs: Audio Pre-Training with Acoustic Tokenizers

Add code
Bookmark button
Alert button
Dec 18, 2022
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Daniel Tompkins, Zhuo Chen, Furu Wei

Figure 1 for BEATs: Audio Pre-Training with Acoustic Tokenizers
Figure 2 for BEATs: Audio Pre-Training with Acoustic Tokenizers
Figure 3 for BEATs: Audio Pre-Training with Acoustic Tokenizers
Figure 4 for BEATs: Audio Pre-Training with Acoustic Tokenizers
Viaarxiv icon

TESSP: Text-Enhanced Self-Supervised Speech Pre-training

Add code
Bookmark button
Alert button
Nov 24, 2022
Zhuoyuan Yao, Shuo Ren, Sanyuan Chen, Ziyang Ma, Pengcheng Guo, Lei Xie

Figure 1 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 2 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 3 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Figure 4 for TESSP: Text-Enhanced Self-Supervised Speech Pre-training
Viaarxiv icon

Exploring WavLM on Speech Enhancement

Add code
Bookmark button
Alert button
Nov 18, 2022
Hyungchan Song, Sanyuan Chen, Zhuo Chen, Yu Wu, Takuya Yoshioka, Min Tang, Jong Won Shin, Shujie Liu

Figure 1 for Exploring WavLM on Speech Enhancement
Figure 2 for Exploring WavLM on Speech Enhancement
Figure 3 for Exploring WavLM on Speech Enhancement
Viaarxiv icon

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

Add code
Bookmark button
Alert button
Sep 30, 2022
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei

Figure 1 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 2 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 3 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Figure 4 for SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data
Viaarxiv icon

Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training

Add code
Bookmark button
Alert button
Jun 21, 2022
Chengyi Wang, Yiming Wang, Yu Wu, Sanyuan Chen, Jinyu Li, Shujie Liu, Furu Wei

Figure 1 for Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Figure 2 for Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Viaarxiv icon

Ultra Fast Speech Separation Model with Teacher Student Learning

Add code
Bookmark button
Alert button
Apr 27, 2022
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu

Figure 1 for Ultra Fast Speech Separation Model with Teacher Student Learning
Figure 2 for Ultra Fast Speech Separation Model with Teacher Student Learning
Figure 3 for Ultra Fast Speech Separation Model with Teacher Student Learning
Viaarxiv icon