music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation

Add code
Nov 12, 2025
Figure 1 for Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation
Figure 2 for Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation
Figure 3 for Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation
Figure 4 for Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation
Viaarxiv icon

On the Joint Minimization of Regularization Loss Functions in Deep Variational Bayesian Methods for Attribute-Controlled Symbolic Music Generation

Add code
Nov 10, 2025
Viaarxiv icon

Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models

Add code
Nov 18, 2025
Figure 1 for Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Figure 2 for Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Figure 3 for Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Figure 4 for Melodia: Training-Free Music Editing Guided by Attention Probing in Diffusion Models
Viaarxiv icon

Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation

Add code
Nov 10, 2025
Figure 1 for Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation
Figure 2 for Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation
Figure 3 for Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation
Figure 4 for Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation
Viaarxiv icon

MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation

Add code
Nov 06, 2025
Viaarxiv icon

Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content?

Add code
Dec 26, 2025
Viaarxiv icon

Aliasing-Free Neural Audio Synthesis

Add code
Dec 23, 2025
Figure 1 for Aliasing-Free Neural Audio Synthesis
Figure 2 for Aliasing-Free Neural Audio Synthesis
Figure 3 for Aliasing-Free Neural Audio Synthesis
Figure 4 for Aliasing-Free Neural Audio Synthesis
Viaarxiv icon

Emovectors: assessing emotional content in jazz improvisations for creativity evaluation

Add code
Dec 09, 2025
Viaarxiv icon

Persian Musical Instruments Classification Using Polyphonic Data Augmentation

Add code
Nov 07, 2025
Viaarxiv icon

Efficient Optimization of Hierarchical Identifiers for Generative Recommendation

Add code
Dec 20, 2025
Viaarxiv icon