End-to-End Multi-speaker Speech Recognition with Transformer

Feb 13, 2020
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* To appear in ICASSP 2020 

MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition

Oct 16, 2019
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* Accepted at ASRU 2019 

Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Jun 18, 2019
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu

* not accepted by INTERSPEECH 2019 

End-to-End Monaural Multi-speaker ASR System without Pretraining

Nov 05, 2018
Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe

* submitted to ICASSP2019 

Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

Aug 02, 2018
Zhehuai Chen, Yanmin Qian, Kai Yu

* Speech Communication, vol. 102, 100-111, 2018 
* accepted by Speech Communication, 08/02/2018 

Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training

Jul 19, 2017
Yanmin Qian, Xuankai Chang, Dong Yu

* 11 pages, 6 figures, Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing. arXiv admin note: text overlap with arXiv:1704.01985 

Recognizing Multi-talker Speech with Permutation Invariant Training

Jun 19, 2017
Dong Yu, Xuankai Chang, Yanmin Qian

* 5 pages, 6 figures, InterSpeech2017 

Very Deep Convolutional Neural Networks for Robust Speech Recognition

Oct 02, 2016
Yanmin Qian, Philip C Woodland

* accepted by SLT 2016 

