Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Xiaofei Wang

Personalized Speech Enhancement: New Models and Comprehensive Evaluation


Oct 18, 2021
Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang


  Access Paper or Ask Questions

All-neural beamformer for continuous speech separation


Oct 13, 2021
Zhuohuang Zhang, Takuya Yoshioka, Naoyuki Kanda, Zhuo Chen, Xiaofei Wang, Dongmei Wang, Sefik Emre Eskimez

* 5 pages, 3 figures, 2 tables 

  Access Paper or Ask Questions

VarArray: Array-Geometry-Agnostic Continuous Speech Separation


Oct 12, 2021
Takuya Yoshioka, Xiaofei Wang, Dongmei Wang, Min Tang, Zirun Zhu, Zhuo Chen, Naoyuki Kanda

* 5 pages, 1 figure, 3 tables, submitted to ICASSP 2022 

  Access Paper or Ask Questions

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR


Oct 07, 2021
Naoyuki Kanda, Xiong Xiao, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2022 

  Access Paper or Ask Questions

Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback


Aug 11, 2021
Xiaofei Wang, Kimin Lee, Kourosh Hakhamaneshi, Pieter Abbeel, Michael Laskin

* 8 pages,6 figures. for associated code and video, see http://sites.google.com/view/skill-pref 

  Access Paper or Ask Questions

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio


Jul 06, 2021
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement


Jun 05, 2021
Sefik Emre Eskimez, Xiaofei Wang, Min Tang, Hemin Yang, Zirun Zhu, Zhuo Chen, Huaming Wang, Takuya Yoshioka

* Accepted to INTERSPEECH2021 

  Access Paper or Ask Questions

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone


Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

End-to-End Speaker-Attributed ASR with Transformer


Apr 05, 2021
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Continuous Speech Separation with Ad Hoc Microphone Arrays


Mar 03, 2021
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng


  Access Paper or Ask Questions

Tailored Learning-Based Scheduling for Kubernetes-Oriented Edge-Cloud System


Jan 17, 2021
Yiwen Han, Shihao Shen, Xiaofei Wang, Shiqiang Wang, Victor C. M. Leung

* IEEE INFOCOM 2021 

  Access Paper or Ask Questions

EC-SAGINs: Edge Computing-enhanced Space-Air-Ground Integrated Networks for Internet of Vehicles


Jan 15, 2021
Shuai Yu, Xiaowen Gong, Qian Shi, Xiaofei Wang, Xu Chen

* The paper is accepted by IEEE IoTJ, Jan. 2021 

  Access Paper or Ask Questions

Reinforcement Learning with Latent Flow


Jan 06, 2021
Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin


  Access Paper or Ask Questions

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings


Jan 06, 2021
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR


Nov 03, 2020
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2021. arXiv admin note: text overlap with arXiv:2006.10930, arXiv:2008.04546 

  Access Paper or Ask Questions

OCT-GAN: Single Step Shadow and Noise Removal from Optical Coherence Tomography Images of the Human Optic Nerve Head


Oct 06, 2020
Haris Cheong, Sripad Krishna Devalla, Thanadet Chuangsuwanich, Tin A. Tun, Xiaofei Wang, Tin Aung, Leopold Schmetterer, Martin L. Buist, Craig Boote, Alexandre H. Thiéry, Michaël J. A. Girard

* 20 pages, 7 figures 

  Access Paper or Ask Questions

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings


Aug 11, 2020
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka


  Access Paper or Ask Questions

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers


Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

Federated Learning for 6G Communications: Challenges, Methods, and Future Directions


Jun 04, 2020
Yi Liu, Xingliang Yuan, Zehui Xiong, Jiawen Kang, Xiaofei Wang, Dusit Niyato


  Access Paper or Ask Questions

Serialized Output Training for End-to-End Overlapped Speech Recognition


Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

Global Adaptive Generative Adjustment


Nov 06, 2019
Bin Wang, Xiaofei Wang, Jianhua Guo


  Access Paper or Ask Questions

A practical two-stage training strategy for multi-stream end-to-end speech recognition


Oct 23, 2019
Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky

* submitted to ICASSP 2019 

  Access Paper or Ask Questions

DeshadowGAN: A Deep Learning Approach to Remove Shadows from Optical Coherence Tomography Images


Oct 07, 2019
Haris Cheong, Sripad Krishna Devalla, Tan Hung Pham, Zhang Liang, Tin Aung Tun, Xiaofei Wang, Shamira Perera, Leopold Schmetterer, Aung Tin, Craig Boote, Alexandre H. Thiery, Michael J. A. Girard


  Access Paper or Ask Questions

A Comparative Study on Transformer vs RNN in Speech Applications


Sep 28, 2019
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang

* IEEE Automatic Speech Recognition and Understanding Workshop 2019 
* Accepted at ASRU 2019 

  Access Paper or Ask Questions

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey


Jul 19, 2019
Yiwen Han, Xiaofei Wang, Victor C. M. Leung, Dusit Niyato, Xueqiang Yan, Xu Chen

* This paper has been submitted to IEEE Communications Surveys and Tutorials for possible publication 

  Access Paper or Ask Questions

Multi-Stream End-to-End Speech Recognition


Jun 17, 2019
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky

* submitted to IEEE TASLP. arXiv admin note: substantial text overlap with arXiv:1811.04897, arXiv:1811.04903 

  Access Paper or Ask Questions

An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions


Apr 28, 2019
Aswin Shanmugam Subramanian, Xiaofei Wang, Shinji Watanabe, Toru Taniguchi, Dung Tran, Yuya Fujita


  Access Paper or Ask Questions