Picture for Keisuke Kinoshita

Keisuke Kinoshita

Neural Target Speech Extraction: An Overview

Add code
Jan 31, 2023
Figure 1 for Neural Target Speech Extraction: An Overview
Figure 2 for Neural Target Speech Extraction: An Overview
Figure 3 for Neural Target Speech Extraction: An Overview
Figure 4 for Neural Target Speech Extraction: An Overview
Viaarxiv icon

On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems

Add code
Nov 29, 2022
Figure 1 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 2 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 3 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Figure 4 for On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems
Viaarxiv icon

Utterance-by-utterance overlap-aware neural diarization with Graph-PIT

Add code
Jul 28, 2022
Figure 1 for Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Figure 2 for Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Viaarxiv icon

Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations

Add code
Jun 16, 2022
Figure 1 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 2 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 3 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Figure 4 for Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
Viaarxiv icon

Listen only to me! How well can target speech extraction handle false alarms?

Add code
Apr 11, 2022
Figure 1 for Listen only to me! How well can target speech extraction handle false alarms?
Figure 2 for Listen only to me! How well can target speech extraction handle false alarms?
Figure 3 for Listen only to me! How well can target speech extraction handle false alarms?
Figure 4 for Listen only to me! How well can target speech extraction handle false alarms?
Viaarxiv icon

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning

Add code
Apr 08, 2022
Figure 1 for SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Figure 2 for SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Figure 3 for SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Figure 4 for SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning
Viaarxiv icon

Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening

Add code
Mar 31, 2022
Figure 1 for Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening
Figure 2 for Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening
Figure 3 for Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening
Figure 4 for Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening
Viaarxiv icon

Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model

Add code
Feb 14, 2022
Figure 1 for Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
Figure 2 for Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
Figure 3 for Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
Viaarxiv icon

Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition

Add code
Jan 11, 2022
Figure 1 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 2 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Figure 3 for Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition
Viaarxiv icon

Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm

Add code
Nov 20, 2021
Figure 1 for Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm
Figure 2 for Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm
Figure 3 for Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm
Figure 4 for Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm
Viaarxiv icon