Alert button
Picture for Masahiro Yasuda

Masahiro Yasuda

Alert button

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

Add code
Bookmark button
Alert button
Apr 12, 2024
Masahiro Yasuda, Noboru Harada, Yasunori Ohishi, Shoichiro Saito, Akira Nakayama, Nobutaka Ono

Viaarxiv icon

6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human

Add code
Bookmark button
Alert button
Mar 04, 2024
Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada

Figure 1 for 6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human
Figure 2 for 6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human
Figure 3 for 6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human
Figure 4 for 6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human
Viaarxiv icon

First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline

Add code
Bookmark button
Alert button
Mar 01, 2023
Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda

Figure 1 for First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline
Figure 2 for First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline
Figure 3 for First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline
Viaarxiv icon

Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion

Add code
Bookmark button
Alert button
Feb 18, 2022
Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada

Figure 1 for Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Figure 2 for Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Figure 3 for Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Figure 4 for Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion
Viaarxiv icon

Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments

Add code
Bookmark button
Alert button
Feb 18, 2022
Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito

Figure 1 for Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments
Figure 2 for Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments
Figure 3 for Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments
Figure 4 for Echo-aware Adaptation of Sound Event Localization and Detection in Unknown Environments
Viaarxiv icon

Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head

Add code
Bookmark button
Alert button
Feb 17, 2022
Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa

Figure 1 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 2 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 3 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 4 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Viaarxiv icon

APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization

Add code
Bookmark button
Alert button
Feb 16, 2022
Tomoro Tanaka, Kohei Yatabe, Masahiro Yasuda, Yasuhiro Oikawa

Figure 1 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 2 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 3 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 4 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Viaarxiv icon

ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions

Add code
Bookmark button
Alert button
Jun 04, 2021
Noboru Harada, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Masahiro Yasuda, Shoichiro Saito

Figure 1 for ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
Figure 2 for ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
Figure 3 for ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
Figure 4 for ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
Viaarxiv icon

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval

Add code
Bookmark button
Alert button
Dec 14, 2020
Yuma Koizumi, Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi, Masahiro Yasuda

Figure 1 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 2 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 3 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 4 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Viaarxiv icon

A Transformer-based Audio Captioning Model with Keyword Estimation

Add code
Bookmark button
Alert button
Jul 01, 2020
Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito

Figure 1 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 2 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 3 for A Transformer-based Audio Captioning Model with Keyword Estimation
Viaarxiv icon