music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Viaarxiv icon

User-guided Generative Source Separation

Add code
Jul 02, 2025
Viaarxiv icon

Benchmarking Music Generation Models and Metrics via Human Preference Studies

Add code
Jun 23, 2025
Viaarxiv icon

SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture

Add code
Jun 26, 2025
Viaarxiv icon

Let Your Video Listen to Your Music!

Add code
Jun 23, 2025
Viaarxiv icon

LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR

Add code
Jun 23, 2025
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Viaarxiv icon

SLEEPING-DISCO 9M: A large-scale pre-training dataset for generative music modeling

Add code
Jun 17, 2025
Viaarxiv icon

Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models

Add code
Jun 18, 2025
Viaarxiv icon

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Add code
Jun 18, 2025
Viaarxiv icon