music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Video Object Segmentation-Aware Audio Generation

Add code
Sep 30, 2025
Figure 1 for Video Object Segmentation-Aware Audio Generation
Figure 2 for Video Object Segmentation-Aware Audio Generation
Figure 3 for Video Object Segmentation-Aware Audio Generation
Figure 4 for Video Object Segmentation-Aware Audio Generation
Viaarxiv icon

ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation

Add code
Oct 09, 2025
Viaarxiv icon

Amadeus: Autoregressive Model with Bidirectional Attribute Modelling for Symbolic Music

Add code
Aug 28, 2025
Viaarxiv icon

High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling

Add code
Sep 26, 2025
Viaarxiv icon

Contrastive timbre representations for musical instrument and synthesizer retrieval

Add code
Sep 16, 2025
Figure 1 for Contrastive timbre representations for musical instrument and synthesizer retrieval
Figure 2 for Contrastive timbre representations for musical instrument and synthesizer retrieval
Figure 3 for Contrastive timbre representations for musical instrument and synthesizer retrieval
Figure 4 for Contrastive timbre representations for musical instrument and synthesizer retrieval
Viaarxiv icon

Back to Ear: Perceptually Driven High Fidelity Music Reconstruction

Add code
Sep 18, 2025
Viaarxiv icon

MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

Add code
Aug 23, 2025
Viaarxiv icon

Opening Musical Creativity? Embedded Ideologies in Generative-AI Music Systems

Add code
Aug 12, 2025
Figure 1 for Opening Musical Creativity? Embedded Ideologies in Generative-AI Music Systems
Figure 2 for Opening Musical Creativity? Embedded Ideologies in Generative-AI Music Systems
Viaarxiv icon

The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling

Add code
Sep 19, 2025
Figure 1 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 2 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 3 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 4 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Viaarxiv icon

MelCap: A Unified Single-Codebook Neural Codec for High-Fidelity Audio Compression

Add code
Oct 02, 2025
Viaarxiv icon