Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Takuya Yoshioka

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio


Jul 06, 2021
Naoyuki Kanda, Xiong Xiao, Jian Wu, Tianyan Zhou, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to ASRU 2021 

  Access Paper or Ask Questions

Investigation of Practical Aspects of Single Channel Speech Separation for ASR


Jul 05, 2021
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li

* Accepted by Interspeech 2021 

  Access Paper or Ask Questions

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement


Jun 05, 2021
Sefik Emre Eskimez, Xiaofei Wang, Min Tang, Hemin Yang, Zirun Zhu, Zhuo Chen, Huaming Wang, Takuya Yoshioka

* Accepted to INTERSPEECH2021 

  Access Paper or Ask Questions

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone


Apr 12, 2021
Naoyuki Kanda, Guoli Ye, Yu Wu, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

End-to-End Speaker-Attributed ASR with Transformer


Apr 05, 2021
Naoyuki Kanda, Guoli Ye, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Continuous Speech Separation with Ad Hoc Microphone Arrays


Mar 03, 2021
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng


  Access Paper or Ask Questions

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings


Jan 06, 2021
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR


Nov 03, 2020
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka

* Submitted to ICASSP 2021. arXiv admin note: text overlap with arXiv:2006.10930, arXiv:2008.04546 

  Access Paper or Ask Questions

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer


Oct 23, 2020
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li


  Access Paper or Ask Questions

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings


Aug 11, 2020
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka


  Access Paper or Ask Questions

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers


Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings


May 02, 2020
Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant


  Access Paper or Ask Questions

Neural Speech Separation Using Spatially Distributed Microphones


Apr 28, 2020
Dongmei Wang, Zhuo Chen, Takuya Yoshioka

* 5 pages, 2 figures, Interspeech2020 

  Access Paper or Ask Questions

Serialized Output Training for End-to-End Overlapped Speech Recognition


Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

* Submitted to INTERSPEECH 2020 

  Access Paper or Ask Questions

Continuous speech separation: dataset and analysis


Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li


  Access Paper or Ask Questions

Advances in Online Audio-Visual Meeting Transcription


Dec 10, 2019
Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

* To appear in Proc. IEEE ASRU Workshop 2019 

  Access Paper or Ask Questions

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation


Nov 26, 2019
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka


  Access Paper or Ask Questions

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation


Oct 14, 2019
Yi Luo, Zhuo Chen, Takuya Yoshioka


  Access Paper or Ask Questions

DOVER: A Method for Combining Diarization Outputs


Sep 17, 2019
Andreas Stolcke, Takuya Yoshioka

* To appear in Proc. IEEE ASRU Workshop 2019 

  Access Paper or Ask Questions

Meeting Transcription Using Virtual Microphone Arrays


May 03, 2019
Takuya Yoshioka, Zhuo Chen, Dimitrios Dimitriadis, William Hinthorn, Xuedong Huang, Andreas Stolcke, Michael Zeng


  Access Paper or Ask Questions

Low-Latency Speaker-Independent Continuous Speech Separation


Apr 13, 2019
Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis


  Access Paper or Ask Questions

Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks


Oct 08, 2018
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva

* Proc. Interspeech 2018, 3038-3042 

  Access Paper or Ask Questions