music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Jamendo-QA: A Large-Scale Music Question Answering Dataset

Add code
Sep 19, 2025
Viaarxiv icon

Video Object Segmentation-Aware Audio Generation

Add code
Sep 30, 2025
Figure 1 for Video Object Segmentation-Aware Audio Generation
Figure 2 for Video Object Segmentation-Aware Audio Generation
Figure 3 for Video Object Segmentation-Aware Audio Generation
Figure 4 for Video Object Segmentation-Aware Audio Generation
Viaarxiv icon

Pairwise and Attribute-Aware Decision Tree-Based Preference Elicitation for Cold-Start Recommendation

Add code
Oct 31, 2025
Figure 1 for Pairwise and Attribute-Aware Decision Tree-Based Preference Elicitation for Cold-Start Recommendation
Figure 2 for Pairwise and Attribute-Aware Decision Tree-Based Preference Elicitation for Cold-Start Recommendation
Figure 3 for Pairwise and Attribute-Aware Decision Tree-Based Preference Elicitation for Cold-Start Recommendation
Viaarxiv icon

High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling

Add code
Sep 26, 2025
Viaarxiv icon

From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance

Add code
Oct 16, 2025
Viaarxiv icon

The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling

Add code
Sep 19, 2025
Figure 1 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 2 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 3 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Figure 4 for The Rhythm In Anything: Audio-Prompted Drums Generation with Masked Language Modeling
Viaarxiv icon

TISDiSS: A Training-Time and Inference-Time Scalable Framework for Discriminative Source Separation

Add code
Sep 19, 2025
Viaarxiv icon

ACMID: Automatic Curation of Musical Instrument Dataset for 7-Stem Music Source Separation

Add code
Oct 09, 2025
Viaarxiv icon

MelCap: A Unified Single-Codebook Neural Codec for High-Fidelity Audio Compression

Add code
Oct 02, 2025
Viaarxiv icon

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Add code
Sep 26, 2025
Viaarxiv icon