Picture for Tomohiro Nakatani

Tomohiro Nakatani

Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation

Add code
Aug 04, 2021
Figure 1 for Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Figure 2 for Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation
Viaarxiv icon

Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation

Add code
Jun 10, 2021
Figure 1 for Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
Figure 2 for Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
Figure 3 for Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
Viaarxiv icon

PILOT: Introducing Transformers for Probabilistic Sound Event Localization

Add code
Jun 07, 2021
Figure 1 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization
Figure 2 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization
Figure 3 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization
Figure 4 for PILOT: Introducing Transformers for Probabilistic Sound Event Localization
Viaarxiv icon

Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility

Add code
Apr 17, 2021
Figure 1 for Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility
Figure 2 for Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility
Figure 3 for Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility
Figure 4 for Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility
Viaarxiv icon

Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization

Add code
Feb 28, 2021
Figure 1 for Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization
Figure 2 for Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization
Figure 3 for Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization
Figure 4 for Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization
Viaarxiv icon

Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain

Add code
Feb 24, 2021
Figure 1 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 2 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 3 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Figure 4 for Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain
Viaarxiv icon

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend

Add code
Feb 23, 2021
Figure 1 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 2 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 3 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Viaarxiv icon

Speaker activity driven neural speech extraction

Add code
Feb 09, 2021
Figure 1 for Speaker activity driven neural speech extraction
Figure 2 for Speaker activity driven neural speech extraction
Figure 3 for Speaker activity driven neural speech extraction
Viaarxiv icon

Independent Vector Extraction for Joint Blind Source Separation and Dereverberation

Add code
Feb 09, 2021
Figure 1 for Independent Vector Extraction for Joint Blind Source Separation and Dereverberation
Figure 2 for Independent Vector Extraction for Joint Blind Source Separation and Dereverberation
Figure 3 for Independent Vector Extraction for Joint Blind Source Separation and Dereverberation
Viaarxiv icon

Multimodal Attention Fusion for Target Speaker Extraction

Add code
Feb 02, 2021
Figure 1 for Multimodal Attention Fusion for Target Speaker Extraction
Figure 2 for Multimodal Attention Fusion for Target Speaker Extraction
Figure 3 for Multimodal Attention Fusion for Target Speaker Extraction
Figure 4 for Multimodal Attention Fusion for Target Speaker Extraction
Viaarxiv icon