music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding

Add code
May 20, 2025
Viaarxiv icon

JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry

Add code
Apr 29, 2025
Figure 1 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 2 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 3 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 4 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Figure 1 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 2 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 3 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 4 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Viaarxiv icon

Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition

Add code
Apr 18, 2025
Figure 1 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 2 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 3 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 4 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Viaarxiv icon

Semantics-Aware Human Motion Generation from Audio Instructions

Add code
May 29, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Figure 1 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 2 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 3 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 4 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Viaarxiv icon

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Add code
May 27, 2025
Viaarxiv icon

STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning

Add code
Apr 09, 2025
Viaarxiv icon

ELGAR: Expressive Cello Performance Motion Generation for Audio Rendition

Add code
May 07, 2025
Viaarxiv icon

SongEval: A Benchmark Dataset for Song Aesthetics Evaluation

Add code
May 16, 2025
Viaarxiv icon