Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Yu Wu

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing


Oct 29, 2021
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei


  Access Paper or Ask Questions

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition


Oct 14, 2021
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong

* 5 pages, submitted to ICASSP 2022 

  Access Paper or Ask Questions

SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing


Oct 14, 2021
Junyi Ao, Rui Wang, Long Zhou, Shujie Liu, Shuo Ren, Yu Wu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei

* work in process 

  Access Paper or Ask Questions

Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification


Oct 12, 2021
Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng

* submitted to ICASSP 2022 

  Access Paper or Ask Questions

UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training


Oct 12, 2021
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu

* ICASSP 2022 Submission 

  Access Paper or Ask Questions

Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition


Oct 11, 2021
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu

* 5 pages, 1 figure, submitted to ICASSP 2022 

  Access Paper or Ask Questions

Contrastive Video-Language Segmentation


Sep 29, 2021
Chen Liang, Yawei Luo, Yu Wu, Yi Yang


  Access Paper or Ask Questions

Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation


Sep 12, 2021
Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang

* EMNLP 2021 

  Access Paper or Ask Questions

Detecting Speaker Personas from Conversational Texts


Sep 03, 2021
Jia-Chen Gu, Zhen-Hua Ling, Yu Wu, Quan Liu, Zhigang Chen, Xiaodan Zhu

* Accepted by EMNLP 2021 

  Access Paper or Ask Questions

Flexible Clustered Federated Learning for Client-Level Data Distribution Shift


Aug 22, 2021
Moming Duan, Duo Liu, Xinyuan Ji, Yu Wu, Liang Liang, Xianzhang Chen, Yujuan Tan

* Manuscript under review. arXiv admin note: substantial text overlap with arXiv:2010.06870 

  Access Paper or Ask Questions

UniSpeech at scale: An Empirical Study of Pre-training Method on Large-Scale Speech Recognition Dataset


Jul 12, 2021
Chengyi Wang, Yu Wu, Shujie Liu, Jinyu Li, Yao Qian, Kenichi Kumatani, Furu Wei


  Access Paper or Ask Questions

Investigation of Practical Aspects of Single Channel Speech Separation for ASR


Jul 05, 2021
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

* Accepted by Interspeech 2021 

  Access Paper or Ask Questions

Saying the Unseen: Video Descriptions via Dialog Agents


Jun 26, 2021
Ye Zhu, Yu Wu, Yi Yang, Yan Yan

* Accepted as a regular paper at TPAMI 2021 

  Access Paper or Ask Questions

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition


Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

* Interspeech 2021, Brno, Czech Republic 
* 5 pages, Interspeech 2021 

  Access Paper or Ask Questions

Template-Based Named Entity Recognition Using BART


Jun 03, 2021
Leyang Cui, Yu Wu, Jian Liu, Sen Yang, Yue Zhang


  Access Paper or Ask Questions

Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation


Jun 02, 2021
Chen Liang, Yu Wu, Tianfei Zhou, Wenguan Wang, Zongxin Yang, Yunchao Wei, Yi Yang


  Access Paper or Ask Questions

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone


Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation


Mar 19, 2021
Chen Liang, Yu Wu, Yawei Luo, Yi Yang


  Access Paper or Ask Questions

Learning Audio-Visual Correlations from Variational Cross-Modal Generation


Feb 14, 2021
Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, Yan Yan

* Accepted to ICASSP 2021 

  Access Paper or Ask Questions

UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data


Jan 19, 2021
Chengyi Wang, Yu Wu, Yao Qian, Kenichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang


  Access Paper or Ask Questions

Learning to Anticipate Egocentric Actions by Imagination


Jan 19, 2021
Yu Wu, Linchao Zhu, Xiaohan Wang, Yi Yang, Fei Wu

* IEEE Transactions on Image Processing, vol. 30, pp. 1143-1152, 2021 
* Accepted to IEEE Transactions on Image Processing (TIP) 

  Access Paper or Ask Questions

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer


Oct 23, 2020
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li


  Access Paper or Ask Questions

Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset


Oct 22, 2020
Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li

* 5 pages 

  Access Paper or Ask Questions

Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents


Aug 22, 2020
Ye Zhu, Yu Wu, Yi Yang, Yan Yan

* ECCV2020 

  Access Paper or Ask Questions

Describing Unseen Videos via Multi-ModalCooperative Dialog Agents


Aug 18, 2020
Ye Zhu, Yu Wu, Yi Yang, Yan Yan

* ECCV2020 

  Access Paper or Ask Questions