Alert button
Picture for Naoyuki Kanda

Naoyuki Kanda

Alert button

A Review of Speaker Diarization: Recent Advances with Deep Learning

Add code
Bookmark button
Alert button
Jan 24, 2021
Tae Jin Park, Naoyuki Kanda, Dimitrios Dimitriadis, Kyu J. Han, Shinji Watanabe, Shrikanth Narayanan

Figure 1 for A Review of Speaker Diarization: Recent Advances with Deep Learning
Figure 2 for A Review of Speaker Diarization: Recent Advances with Deep Learning
Figure 3 for A Review of Speaker Diarization: Recent Advances with Deep Learning
Figure 4 for A Review of Speaker Diarization: Recent Advances with Deep Learning
Viaarxiv icon

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings

Add code
Bookmark button
Alert button
Jan 06, 2021
Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

Figure 1 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Figure 2 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Viaarxiv icon

Streaming end-to-end multi-talker speech recognition

Add code
Bookmark button
Alert button
Nov 26, 2020
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for Streaming end-to-end multi-talker speech recognition
Figure 2 for Streaming end-to-end multi-talker speech recognition
Figure 3 for Streaming end-to-end multi-talker speech recognition
Figure 4 for Streaming end-to-end multi-talker speech recognition
Viaarxiv icon

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

Add code
Bookmark button
Alert button
Nov 03, 2020
Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka

Figure 1 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 2 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 3 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Viaarxiv icon

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Nov 03, 2020
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

Figure 1 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 3 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 4 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer

Add code
Bookmark button
Alert button
Oct 23, 2020
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong

Figure 1 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 2 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 3 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Figure 4 for On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
Viaarxiv icon

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

Add code
Bookmark button
Alert button
Aug 11, 2020
Naoyuki Kanda, Xuankai Chang, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Takuya Yoshioka

Figure 1 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 2 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 3 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 4 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Viaarxiv icon

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

Add code
Bookmark button
Alert button
Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

Figure 1 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 2 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 3 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 4 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Viaarxiv icon

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Add code
Bookmark button
Alert button
May 02, 2020
Shinji Watanabe, Michael Mandel, Jon Barker, Emmanuel Vincent, Ashish Arora, Xuankai Chang, Sanjeev Khudanpur, Vimal Manohar, Daniel Povey, Desh Raj, David Snyder, Aswin Shanmugam Subramanian, Jan Trmal, Bar Ben Yair, Christoph Boeddeker, Zhaoheng Ni, Yusuke Fujita, Shota Horiguchi, Naoyuki Kanda, Takuya Yoshioka, Neville Ryant

Figure 1 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 2 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 3 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 4 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Viaarxiv icon

Serialized Output Training for End-to-End Overlapped Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka

Figure 1 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 2 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 3 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 4 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Viaarxiv icon