music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Add code
May 23, 2025
Viaarxiv icon

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Add code
Mar 03, 2025
Viaarxiv icon

WiCAL: Accurate Wi-Fi-Based 3D Localization Enabled by Collaborative Antenna Arrays

Add code
May 27, 2025
Viaarxiv icon

DGFM: Full Body Dance Generation Driven by Music Foundation Models

Add code
Feb 27, 2025
Viaarxiv icon

MusicInfuser: Making Video Diffusion Listen and Dance

Add code
Mar 18, 2025
Viaarxiv icon

DRAGON: Distributional Rewards Optimize Diffusion Generative Models

Add code
Apr 21, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Figure 1 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 2 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 3 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 4 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Viaarxiv icon

DanceMosaic: High-Fidelity Dance Generation with Multimodal Editability

Add code
Apr 06, 2025
Viaarxiv icon

SoK: How Robust is Audio Watermarking in Generative AI models?

Add code
Mar 27, 2025
Figure 1 for SoK: How Robust is Audio Watermarking in Generative AI models?
Figure 2 for SoK: How Robust is Audio Watermarking in Generative AI models?
Figure 3 for SoK: How Robust is Audio Watermarking in Generative AI models?
Figure 4 for SoK: How Robust is Audio Watermarking in Generative AI models?
Viaarxiv icon

Aligning Text-to-Music Evaluation with Human Preferences

Add code
Mar 20, 2025
Viaarxiv icon