Music Generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture

Add code
Jun 26, 2025
Viaarxiv icon

Benchmarking Music Generation Models and Metrics via Human Preference Studies

Add code
Jun 23, 2025
Viaarxiv icon

Let Your Video Listen to Your Music!

Add code
Jun 23, 2025
Viaarxiv icon

LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR

Add code
Jun 23, 2025
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Viaarxiv icon

SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling

Add code
Jun 17, 2025
Viaarxiv icon

Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models

Add code
Jun 18, 2025
Viaarxiv icon

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Add code
Jun 18, 2025
Viaarxiv icon

Adaptive Accompaniment with ReaLchords

Add code
Jun 17, 2025
Viaarxiv icon

Personalizable Long-Context Symbolic Music Infilling with MIDI-RWKV

Add code
Jun 16, 2025
Viaarxiv icon