Picture for Takuya Yoshioka

Takuya Yoshioka

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

Add code
Aug 11, 2020
Figure 1 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 2 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 3 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Figure 4 for Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Viaarxiv icon

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

Add code
Jun 19, 2020
Figure 1 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 2 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 3 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 4 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Viaarxiv icon

CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Add code
May 02, 2020
Figure 1 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 2 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 3 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Figure 4 for CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Viaarxiv icon

Neural Speech Separation Using Spatially Distributed Microphones

Add code
Apr 28, 2020
Figure 1 for Neural Speech Separation Using Spatially Distributed Microphones
Figure 2 for Neural Speech Separation Using Spatially Distributed Microphones
Figure 3 for Neural Speech Separation Using Spatially Distributed Microphones
Viaarxiv icon

Serialized Output Training for End-to-End Overlapped Speech Recognition

Add code
Mar 28, 2020
Figure 1 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 2 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 3 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Figure 4 for Serialized Output Training for End-to-End Overlapped Speech Recognition
Viaarxiv icon

Continuous speech separation: dataset and analysis

Add code
Jan 30, 2020
Figure 1 for Continuous speech separation: dataset and analysis
Figure 2 for Continuous speech separation: dataset and analysis
Figure 3 for Continuous speech separation: dataset and analysis
Figure 4 for Continuous speech separation: dataset and analysis
Viaarxiv icon

Advances in Online Audio-Visual Meeting Transcription

Add code
Dec 10, 2019
Figure 1 for Advances in Online Audio-Visual Meeting Transcription
Figure 2 for Advances in Online Audio-Visual Meeting Transcription
Figure 3 for Advances in Online Audio-Visual Meeting Transcription
Figure 4 for Advances in Online Audio-Visual Meeting Transcription
Viaarxiv icon

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

Add code
Nov 26, 2019
Figure 1 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 2 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 3 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Viaarxiv icon

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation

Add code
Oct 14, 2019
Figure 1 for Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Figure 2 for Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Figure 3 for Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Figure 4 for Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Viaarxiv icon

DOVER: A Method for Combining Diarization Outputs

Add code
Sep 17, 2019
Figure 1 for DOVER: A Method for Combining Diarization Outputs
Figure 2 for DOVER: A Method for Combining Diarization Outputs
Figure 3 for DOVER: A Method for Combining Diarization Outputs
Figure 4 for DOVER: A Method for Combining Diarization Outputs
Viaarxiv icon