music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Viaarxiv icon

Detecting Musical Deepfakes

Add code
May 03, 2025
Viaarxiv icon

Do Music Preferences Reflect Cultural Values? A Cross-National Analysis Using Music Embedding and World Values Survey

Add code
Jun 16, 2025
Viaarxiv icon

U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding

Add code
May 20, 2025
Viaarxiv icon

JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry

Add code
Apr 29, 2025
Figure 1 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 2 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 3 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Figure 4 for JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Figure 1 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 2 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 3 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Figure 4 for Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation
Viaarxiv icon

Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition

Add code
Apr 18, 2025
Figure 1 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 2 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 3 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Figure 4 for Evaluating Human-AI Interaction via Usability, User Experience and Acceptance Measures for MMM-C: A Creative AI System for Music Composition
Viaarxiv icon

Semantics-Aware Human Motion Generation from Audio Instructions

Add code
May 29, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Figure 1 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 2 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 3 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 4 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Viaarxiv icon

MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection

Add code
May 27, 2025
Viaarxiv icon