music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Every Image Listens, Every Image Dances: Music-Driven Image Animation

Add code
Jan 30, 2025
Figure 1 for Every Image Listens, Every Image Dances: Music-Driven Image Animation
Figure 2 for Every Image Listens, Every Image Dances: Music-Driven Image Animation
Figure 3 for Every Image Listens, Every Image Dances: Music-Driven Image Animation
Figure 4 for Every Image Listens, Every Image Dances: Music-Driven Image Animation
Viaarxiv icon

HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization

Add code
Mar 04, 2025
Figure 1 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 2 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 3 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Figure 4 for HarmonySet: A Comprehensive Dataset for Understanding Video-Music Semantic Alignment and Temporal Synchronization
Viaarxiv icon

RenderBox: Expressive Performance Rendering with Text Control

Add code
Feb 11, 2025
Viaarxiv icon

Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach

Add code
Mar 24, 2025
Viaarxiv icon

Overview of the Amphion Toolkit (v0.2)

Add code
Jan 26, 2025
Figure 1 for Overview of the Amphion Toolkit (v0.2)
Figure 2 for Overview of the Amphion Toolkit (v0.2)
Figure 3 for Overview of the Amphion Toolkit (v0.2)
Figure 4 for Overview of the Amphion Toolkit (v0.2)
Viaarxiv icon

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

High-Fidelity Music Vocoder using Neural Audio Codecs

Add code
Feb 18, 2025
Figure 1 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 2 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 3 for High-Fidelity Music Vocoder using Neural Audio Codecs
Figure 4 for High-Fidelity Music Vocoder using Neural Audio Codecs
Viaarxiv icon

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Add code
Feb 18, 2025
Viaarxiv icon

M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP

Add code
Mar 28, 2025
Viaarxiv icon

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

Add code
Jan 29, 2025
Viaarxiv icon