Picture for Dorien Herremans

Dorien Herremans

Singapore University of Technology and Design

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Add code
May 27, 2025
Viaarxiv icon

Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment

Add code
May 19, 2025
Viaarxiv icon

JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata

Add code
Feb 11, 2025
Viaarxiv icon

ImprovNet: Generating Controllable Musical Improvisations with Iterative Corruption Refinement

Add code
Feb 06, 2025
Viaarxiv icon

Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Add code
Feb 06, 2025
Viaarxiv icon

Text2midi: Generating Symbolic Music from Captions

Add code
Dec 21, 2024
Viaarxiv icon

MIRFLEX: Music Information Retrieval Feature Library for Extraction

Add code
Nov 01, 2024
Figure 1 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 2 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 3 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 4 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Viaarxiv icon

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Add code
Oct 17, 2024
Figure 1 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 2 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 3 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 4 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Viaarxiv icon

Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction

Add code
Oct 15, 2024
Figure 1 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 2 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 3 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 4 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Viaarxiv icon

Prevailing Research Areas for Music AI in the Era of Foundation Models

Add code
Sep 14, 2024
Viaarxiv icon