music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Réduire le bruit grâce à la réalité augmentée sonore -- Auditory Concealer

Add code
Apr 08, 2025
Viaarxiv icon

Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches

Add code
Apr 07, 2025
Viaarxiv icon

MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio

Add code
Mar 07, 2025
Figure 1 for MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio
Figure 2 for MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio
Figure 3 for MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio
Figure 4 for MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio
Viaarxiv icon

HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization

Add code
Mar 04, 2025
Figure 1 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 2 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 3 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 4 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Viaarxiv icon

High-Fidelity Music Vocoder using Neural Audio Codecs

Add code
Feb 18, 2025
Figure 1 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 2 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 3 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 4 for High-Fidelity Music Vocoder using Neural Audio Codecs
Viaarxiv icon

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Add code
Feb 18, 2025
Viaarxiv icon

Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach

Add code
Mar 24, 2025
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP

Add code
Mar 28, 2025
Viaarxiv icon

Audio signal interpolation using optimal transportation of spectrograms

Add code
Feb 21, 2025
Viaarxiv icon