Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation


May 25, 2023
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Add code

* Working in progress 

   Access Paper or Ask Questions

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation


May 24, 2023
Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Michael Zeng, Xuedong Huang

Add code


   Access Paper or Ask Questions

Code-Switching Text Generation and Injection in Mandarin-English ASR


Mar 20, 2023
Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng

Add code

* Accepted by ICASSP 2023 

   Access Paper or Ask Questions

Target Sound Extraction with Variable Cross-modality Clues


Mar 15, 2023
Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng

Add code

* Accepted by ICASSP 2023 

   Access Paper or Ask Questions

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling


Mar 07, 2023
Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Add code

* We encourage readers to listen to the audio samples on our demo page: \url{https://aka.ms/vallex

   Access Paper or Ask Questions

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training


Mar 01, 2023
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong

Add code


   Access Paper or Ask Questions

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers


Jan 05, 2023
Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei

Add code

* Working in progress 

   Access Paper or Ask Questions

BEATs: Audio Pre-Training with Acoustic Tokenizers


Dec 18, 2022
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Daniel Tompkins, Zhuo Chen, Furu Wei

Add code


   Access Paper or Ask Questions

VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning


Nov 21, 2022
Qiushi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Lirong Dai, Daxin Jiang, Jinyu Li, Furu Wei

Add code

* 10 pages 

   Access Paper or Ask Questions

Exploring WavLM on Speech Enhancement


Nov 18, 2022
Hyungchan Song, Sanyuan Chen, Zhuo Chen, Yu Wu, Takuya Yoshioka, Min Tang, Jong Won Shin, Shujie Liu

Add code

* Accepted by IEEE SLT 2022 

   Access Paper or Ask Questions

1
2
3
4
5
6
7
>>