Picture for Mark D. Plumbley

Mark D. Plumbley

BioDCASE 2026 Challenge Baseline for Cross-Domain Mosquito Species Classification

Add code
Mar 20, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

Region-Specific Audio Tagging for Spatial Sound

Add code
Sep 11, 2025
Viaarxiv icon

Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models

Add code
Jul 15, 2025
Figure 1 for Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
Figure 2 for Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
Figure 3 for Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
Viaarxiv icon

Music Source Restoration

Add code
May 27, 2025
Viaarxiv icon

Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows

Add code
Apr 22, 2025
Figure 1 for Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Figure 2 for Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Figure 3 for Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Figure 4 for Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows
Viaarxiv icon

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models

Add code
Nov 28, 2024
Figure 1 for AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
Figure 2 for AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
Figure 3 for AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
Figure 4 for AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
Viaarxiv icon

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Add code
Nov 10, 2024
Figure 1 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 2 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 3 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Figure 4 for PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Viaarxiv icon

A decade of DCASE: Achievements, practices, evaluations and future challenges

Add code
Oct 07, 2024
Viaarxiv icon

The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection

Add code
Sep 17, 2024
Figure 1 for The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
Figure 2 for The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
Figure 3 for The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
Figure 4 for The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
Viaarxiv icon