Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
End-to-End Multi-speaker Speech Recognition with Transformer

Feb 13, 2020
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* To appear in ICASSP 2020 

  Access Model/Code and Paper
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition

Oct 16, 2019
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe

* Accepted at ASRU 2019 

  Access Model/Code and Paper
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Jun 18, 2019
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu

* not accepted by INTERSPEECH 2019 

  Access Model/Code and Paper
End-to-End Monaural Multi-speaker ASR System without Pretraining

Nov 05, 2018
Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe

* submitted to ICASSP2019 

  Access Model/Code and Paper
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

Aug 02, 2018
Zhehuai Chen, Yanmin Qian, Kai Yu

* Speech Communication, vol. 102, 100-111, 2018 
* accepted by Speech Communication, 08/02/2018 

  Access Model/Code and Paper
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training

Jul 19, 2017
Yanmin Qian, Xuankai Chang, Dong Yu

* 11 pages, 6 figures, Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processing. arXiv admin note: text overlap with arXiv:1704.01985 

  Access Model/Code and Paper
Recognizing Multi-talker Speech with Permutation Invariant Training

Jun 19, 2017
Dong Yu, Xuankai Chang, Yanmin Qian

* 5 pages, 6 figures, InterSpeech2017 

  Access Model/Code and Paper
Very Deep Convolutional Neural Networks for Robust Speech Recognition

Oct 02, 2016
Yanmin Qian, Philip C Woodland

* accepted by SLT 2016 

  Access Model/Code and Paper