music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Add code
Feb 18, 2025
Viaarxiv icon

Unrolled Creative Adversarial Network For Generating Novel Musical Pieces

Add code
Dec 31, 2024
Figure 1 for Unrolled Creative Adversarial Network For Generating Novel Musical Pieces
Figure 2 for Unrolled Creative Adversarial Network For Generating Novel Musical Pieces
Figure 3 for Unrolled Creative Adversarial Network For Generating Novel Musical Pieces
Figure 4 for Unrolled Creative Adversarial Network For Generating Novel Musical Pieces
Viaarxiv icon

Overview of the Amphion Toolkit (v0.2)

Add code
Jan 26, 2025
Figure 1 for Overview of the Amphion Toolkit (v0.2)
Figure 2 for Overview of the Amphion Toolkit (v0.2)
Figure 3 for Overview of the Amphion Toolkit (v0.2)
Figure 4 for Overview of the Amphion Toolkit (v0.2)
Viaarxiv icon

Text2Playlist: Generating Personalized Playlists from Text on Deezer

Add code
Jan 10, 2025
Viaarxiv icon

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

Add code
Jan 29, 2025
Viaarxiv icon

Can Impressions of Music be Extracted from Thumbnail Images?

Add code
Jan 05, 2025
Figure 1 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 2 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 3 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 4 for Can Impressions of Music be Extracted from Thumbnail Images?
Viaarxiv icon

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

Add code
Dec 12, 2024
Viaarxiv icon

Deepfake Detection of Singing Voices With Whisper Encodings

Add code
Jan 31, 2025
Figure 1 for Deepfake Detection of Singing Voices With Whisper Encodings
Figure 2 for Deepfake Detection of Singing Voices With Whisper Encodings
Figure 3 for Deepfake Detection of Singing Voices With Whisper Encodings
Figure 4 for Deepfake Detection of Singing Voices With Whisper Encodings
Viaarxiv icon

Text2midi: Generating Symbolic Music from Captions

Add code
Dec 21, 2024
Viaarxiv icon

Audio signal interpolation using optimal transportation of spectrograms

Add code
Feb 21, 2025
Viaarxiv icon