Picture for Katsutoshi Itoyama

Katsutoshi Itoyama

Department of Systems and Control Engineering, School of Engineering, Tokyo Institute of Technology, Tokyo, Japan, Honda Research Institute Japan Co., Ltd., Saitama, Japan

SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

Add code
May 30, 2024
Viaarxiv icon

From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution

Jan 26, 2024
Viaarxiv icon

Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

Add code
Sep 21, 2023
Figure 1 for Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
Figure 2 for Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
Figure 3 for Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
Figure 4 for Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers
Viaarxiv icon

Metric-based multimodal meta-learning for human movement identification via footstep recognition

Nov 15, 2021
Figure 1 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 2 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 3 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 4 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Viaarxiv icon

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Mar 31, 2019
Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

Mar 19, 2018
Figure 1 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 2 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 3 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 4 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Viaarxiv icon

Generative Statistical Models with Self-Emergent Grammar of Chord Sequences

Mar 02, 2018
Figure 1 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 2 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 3 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 4 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Viaarxiv icon