Alert button
Picture for Takuya Yoshioka

Takuya Yoshioka

Alert button

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

Add code
Bookmark button
Alert button
Nov 03, 2020
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka

Figure 1 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 2 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 3 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Viaarxiv icon

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

Add code
Bookmark button
Alert button
Oct 23, 2020
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li

Figure 1 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 2 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 3 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Viaarxiv icon

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

Add code
Bookmark button
Alert button
Aug 11, 2020
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 2 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 3 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 4 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Viaarxiv icon

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

Add code
Bookmark button
Alert button
Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

Figure 1 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 2 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 3 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 4 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Viaarxiv icon

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Add code
Bookmark button
Alert button
May 02, 2020
Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant

Figure 1 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 2 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 3 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 4 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Viaarxiv icon

Neural Speech Separation Using Spatially Distributed Microphones

Add code
Bookmark button
Alert button
Apr 28, 2020
Dongmei Wang, Zhuo Chen, Takuya Yoshioka

Figure 1 for Neural Speech Separation Using Spatially Distributed Microphones
Figure 2 for Neural Speech Separation Using Spatially Distributed Microphones
Figure 3 for Neural Speech Separation Using Spatially Distributed Microphones
Viaarxiv icon

Serialized Output Training for End-to-End Overlapped Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

Figure 1 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 2 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 3 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 4 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Viaarxiv icon

Continuous speech separation: dataset and analysis

Add code
Bookmark button
Alert button
Jan 30, 2020
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li

Figure 1 for Continuous speech separation: dataset and analysis
Figure 2 for Continuous speech separation: dataset and analysis
Figure 3 for Continuous speech separation: dataset and analysis
Figure 4 for Continuous speech separation: dataset and analysis
Viaarxiv icon

Advances in Online Audio-Visual Meeting Transcription

Add code
Bookmark button
Alert button
Dec 10, 2019
Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou

Figure 1 for Advances in Online Audio-Visual Meeting Transcription
Figure 2 for Advances in Online Audio-Visual Meeting Transcription
Figure 3 for Advances in Online Audio-Visual Meeting Transcription
Figure 4 for Advances in Online Audio-Visual Meeting Transcription
Viaarxiv icon

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

Add code
Bookmark button
Alert button
Nov 26, 2019
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka

Figure 1 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 2 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 3 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Viaarxiv icon