Picture for Dorien Herremans

Dorien Herremans

Singapore University of Technology and Design

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Add code
Jun 18, 2025
Viaarxiv icon

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Add code
May 27, 2025
Viaarxiv icon

Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment

Add code
May 19, 2025
Viaarxiv icon

JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata

Add code
Feb 11, 2025
Viaarxiv icon

Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Add code
Feb 06, 2025
Viaarxiv icon

ImprovNet: Generating Controllable Musical Improvisations with Iterative Corruption Refinement

Add code
Feb 06, 2025
Viaarxiv icon

Text2midi: Generating Symbolic Music from Captions

Add code
Dec 21, 2024
Viaarxiv icon

MIRFLEX: Music Information Retrieval Feature Library for Extraction

Add code
Nov 01, 2024
Figure 1 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 2 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 3 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 4 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Viaarxiv icon

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Add code
Oct 17, 2024
Figure 1 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 2 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 3 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 4 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Viaarxiv icon

Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction

Add code
Oct 15, 2024
Figure 1 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 2 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 3 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Figure 4 for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Viaarxiv icon