Picture for Keisuke Imoto

Keisuke Imoto

Prosodically Enhanced Foreign Accent Simulation by Discrete Token-based Resynthesis Only with Native Speech Corpora

Add code
May 22, 2025
Viaarxiv icon

Discrete Tokens Exhibit Interlanguage Speech Intelligibility Benefit: an Analytical Study Towards Accent-robust ASR Only with Native Speech Data

Add code
May 22, 2025
Viaarxiv icon

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Add code
Apr 06, 2025
Viaarxiv icon

Handling Domain Shifts for Anomalous Sound Detection: A Review of DCASE-Related Work

Add code
Mar 13, 2025
Viaarxiv icon

Sound Scene Synthesis at the DCASE 2024 Challenge

Add code
Jan 15, 2025
Viaarxiv icon

Trainingless Adaptation of Pretrained Models for Environmental Sound Classification

Add code
Dec 23, 2024
Viaarxiv icon

DOA-Aware Audio-Visual Self-Supervised Learning for Sound Event Localization and Detection

Add code
Oct 30, 2024
Viaarxiv icon

Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation

Add code
Oct 23, 2024
Figure 1 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 2 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 3 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Figure 4 for Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
Viaarxiv icon

Construction and Analysis of Impression Caption Dataset for Environmental Sounds

Add code
Oct 20, 2024
Viaarxiv icon

LEAD Dataset: How Can Labels for Sound Event Detection Vary Depending on Annotators?

Add code
Oct 13, 2024
Viaarxiv icon