Picture for Keisuke Imoto

Keisuke Imoto

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Add code
Jun 11, 2024
Viaarxiv icon

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation

Add code
Jun 04, 2024
Figure 1 for M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Figure 2 for M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Figure 3 for M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Figure 4 for M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation
Viaarxiv icon

Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant

Add code
Mar 26, 2024
Figure 1 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 2 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 3 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Figure 4 for Correlation of Fréchet Audio Distance With Human Perception of Environmental Audio Is Embedding Dependant
Viaarxiv icon

Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection

Add code
Mar 18, 2024
Figure 1 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 2 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 3 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 4 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Viaarxiv icon

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval

Add code
Mar 16, 2024
Figure 1 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 2 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 3 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 4 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Viaarxiv icon

F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection

Add code
Dec 14, 2023
Figure 1 for F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection
Figure 2 for F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection
Figure 3 for F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection
Figure 4 for F1-EV Score: Measuring the Likelihood of Estimating a Good Decision Threshold for Semi-Supervised Anomaly Detection
Viaarxiv icon

CAPTDURE: Captioned Sound Dataset of Single Sources

Add code
May 28, 2023
Figure 1 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 2 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 3 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 4 for CAPTDURE: Captioned Sound Dataset of Single Sources
Viaarxiv icon

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Add code
May 13, 2023
Figure 1 for Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Viaarxiv icon

Environmental sound conversion from vocal imitations and sound event labels

Add code
Apr 29, 2023
Figure 1 for Environmental sound conversion from vocal imitations and sound event labels
Figure 2 for Environmental sound conversion from vocal imitations and sound event labels
Figure 3 for Environmental sound conversion from vocal imitations and sound event labels
Figure 4 for Environmental sound conversion from vocal imitations and sound event labels
Viaarxiv icon

Foley Sound Synthesis at the DCASE 2023 Challenge

Add code
Apr 26, 2023
Figure 1 for Foley Sound Synthesis at the DCASE 2023 Challenge
Viaarxiv icon