Picture for Takuya Yoshioka

Takuya Yoshioka

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

Add code
Oct 07, 2021
Figure 1 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Figure 2 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Figure 3 for Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR
Viaarxiv icon

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

Add code
Jul 06, 2021
Figure 1 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 2 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 3 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Figure 4 for A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio
Viaarxiv icon

Investigation of Practical Aspects of Single Channel Speech Separation for ASR

Add code
Jul 05, 2021
Figure 1 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 2 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Figure 3 for Investigation of Practical Aspects of Single Channel Speech Separation for ASR
Viaarxiv icon

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement

Add code
Jun 05, 2021
Figure 1 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 2 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 3 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Viaarxiv icon

Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone

Add code
Apr 12, 2021
Figure 1 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 2 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Figure 3 for Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
Viaarxiv icon

End-to-End Speaker-Attributed ASR with Transformer

Add code
Apr 05, 2021
Figure 1 for End-to-End Speaker-Attributed ASR with Transformer
Figure 2 for End-to-End Speaker-Attributed ASR with Transformer
Figure 3 for End-to-End Speaker-Attributed ASR with Transformer
Figure 4 for End-to-End Speaker-Attributed ASR with Transformer
Viaarxiv icon

Continuous Speech Separation with Ad Hoc Microphone Arrays

Add code
Mar 03, 2021
Figure 1 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 2 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 3 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 4 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Viaarxiv icon

Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings

Add code
Jan 06, 2021
Figure 1 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Figure 2 for Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Viaarxiv icon

Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR

Add code
Nov 03, 2020
Figure 1 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 2 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Figure 3 for Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Viaarxiv icon

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

Add code
Oct 23, 2020
Figure 1 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 2 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 3 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Viaarxiv icon