Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Model/Code and Paper
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

May 02, 2020
Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant


  Access Model/Code and Paper
Neural Speech Separation Using Spatially Distributed Microphones

Apr 28, 2020
Dongmei Wang, Zhuo Chen, Takuya Yoshioka

* 5 pages, 2 figures, Interspeech2020 

  Access Model/Code and Paper
Serialized Output Training for End-to-End Overlapped Speech Recognition

Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Model/Code and Paper
Continuous speech separation: dataset and analysis

Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li


  Access Model/Code and Paper
Advances in Online Audio-Visual Meeting Transcription

Dec 10, 2019
Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

* To appear in Proc. IEEE ASRU Workshop 2019 

  Access Model/Code and Paper
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

Nov 26, 2019
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka


  Access Model/Code and Paper
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation

Oct 14, 2019
Yi Luo, Zhuo Chen, Takuya Yoshioka


  Access Model/Code and Paper
DOVER: A Method for Combining Diarization Outputs

Sep 17, 2019
Andreas Stolcke, Takuya Yoshioka

* To appear in Proc. IEEE ASRU Workshop 2019 

  Access Model/Code and Paper
Meeting Transcription Using Virtual Microphone Arrays

May 03, 2019
Takuya Yoshioka, Zhuo Chen, Dimitrios Dimitriadis, William Hinthorn, Xuedong Huang, Andreas Stolcke, Michael Zeng


  Access Model/Code and Paper
Low-Latency Speaker-Independent Continuous Speech Separation

Apr 13, 2019
Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis


  Access Model/Code and Paper
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks

Oct 08, 2018
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva

* Proc. Interspeech 2018, 3038-3042 

  Access Model/Code and Paper