music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Add code
Apr 13, 2026
Viaarxiv icon

Towards Real-Time Human-AI Musical Co-Performance: Accompaniment Generation with Latent Diffusion Models and MAX/MSP

Add code
Apr 08, 2026
Viaarxiv icon

From Image to Music Language: A Two-Stage Structure Decoding Approach for Complex Polyphonic OMR

Add code
Apr 22, 2026
Viaarxiv icon

HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models

Add code
Apr 21, 2026
Viaarxiv icon

Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

Add code
Apr 12, 2026
Viaarxiv icon

Multimodal Large Language Models for Multi-Subject In-Context Image Generation

Add code
Apr 08, 2026
Viaarxiv icon

Diff-VS: Efficient Audio-Aware Diffusion U-Net for Vocals Separation

Add code
Apr 01, 2026
Viaarxiv icon

TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba

Add code
Mar 28, 2026
Viaarxiv icon

Hidden Biases in Conditioning Autoregressive Models

Add code
Apr 09, 2026
Viaarxiv icon

AT-ADD: All-Type Audio Deepfake Detection Challenge Evaluation Plan

Add code
Apr 09, 2026
Viaarxiv icon