music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

RenderBox: Expressive Performance Rendering with Text Control

Add code
Feb 11, 2025
Viaarxiv icon

HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization

Add code
Mar 04, 2025
Figure 1 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 2 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 3 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 4 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Viaarxiv icon

Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach

Add code
Mar 24, 2025
Viaarxiv icon

High-Fidelity Music Vocoder using Neural Audio Codecs

Add code
Feb 18, 2025
Figure 1 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 2 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 3 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 4 for High-Fidelity Music Vocoder using Neural Audio Codecs
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Add code
Feb 18, 2025
Viaarxiv icon

M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP

Add code
Mar 28, 2025
Viaarxiv icon

Audio signal interpolation using optimal transportation of spectrograms

Add code
Feb 21, 2025
Viaarxiv icon

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Add code
Feb 07, 2025
Figure 1 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 2 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 3 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 4 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Viaarxiv icon

Antenna Position Optimization for Movable Antenna-Empowered Near-Field Sensing

Add code
Feb 05, 2025
Figure 1 for Antenna Position Optimization for Movable Antenna-Empowered Near-Field Sensing
Figure 2 for Antenna Position Optimization for Movable Antenna-Empowered Near-Field Sensing
Figure 3 for Antenna Position Optimization for Movable Antenna-Empowered Near-Field Sensing
Figure 4 for Antenna Position Optimization for Movable Antenna-Empowered Near-Field Sensing
Viaarxiv icon