Picture for Dorien Herremans

Dorien Herremans

Singapore University of Technology and Design

JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment

Add code
Jul 28, 2025
Viaarxiv icon

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Add code
Jun 18, 2025
Viaarxiv icon

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Add code
May 27, 2025
Viaarxiv icon

Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment

Add code
May 19, 2025
Viaarxiv icon

JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata

Add code
Feb 11, 2025
Viaarxiv icon

ImprovNet: Generating Controllable Musical Improvisations with Iterative Corruption Refinement

Add code
Feb 06, 2025
Viaarxiv icon

Towards Unified Music Emotion Recognition across Dimensional and Categorical Models

Add code
Feb 06, 2025
Viaarxiv icon

Text2midi: Generating Symbolic Music from Captions

Add code
Dec 21, 2024
Viaarxiv icon

MIRFLEX: Music Information Retrieval Feature Library for Extraction

Add code
Nov 01, 2024
Figure 1 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 2 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 3 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Figure 4 for MIRFLEX: Music Information Retrieval Feature Library for Extraction
Viaarxiv icon

DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech

Add code
Oct 17, 2024
Figure 1 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 2 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 3 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Figure 4 for DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
Viaarxiv icon