music generation


Music generation is the task of generating music or music-like sounds from a model or algorithm.

Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges

Add code
Nov 10, 2025
Viaarxiv icon

Music Flamingo: Scaling Music Understanding in Audio Language Models

Add code
Nov 13, 2025
Viaarxiv icon

AI as intermediary in modern-day ritual: An immersive, interactive production of the roller disco musical Xanadu at UCLA

Add code
Nov 09, 2025
Viaarxiv icon

SyMuPe: Affective and Controllable Symbolic Music Performance

Add code
Nov 05, 2025
Viaarxiv icon

Steering Autoregressive Music Generation with Recursive Feature Machines

Add code
Oct 21, 2025
Viaarxiv icon

Preference-Based Learning in Audio Applications: A Systematic Analysis

Add code
Nov 17, 2025
Figure 1 for Preference-Based Learning in Audio Applications: A Systematic Analysis
Figure 2 for Preference-Based Learning in Audio Applications: A Systematic Analysis
Figure 3 for Preference-Based Learning in Audio Applications: A Systematic Analysis
Figure 4 for Preference-Based Learning in Audio Applications: A Systematic Analysis
Viaarxiv icon

FoleyBench: A Benchmark For Video-to-Audio Models

Add code
Nov 17, 2025
Figure 1 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 2 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 3 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 4 for FoleyBench: A Benchmark For Video-to-Audio Models
Viaarxiv icon

Attribution-by-design: Ensuring Inference-Time Provenance in Generative Music Systems

Add code
Oct 09, 2025
Viaarxiv icon

Do Joint Language-Audio Embeddings Encode Perceptual Timbre Semantics?

Add code
Oct 16, 2025
Viaarxiv icon

Expressive Range Characterization of Open Text-to-Audio Models

Add code
Oct 31, 2025
Figure 1 for Expressive Range Characterization of Open Text-to-Audio Models
Figure 2 for Expressive Range Characterization of Open Text-to-Audio Models
Figure 3 for Expressive Range Characterization of Open Text-to-Audio Models
Figure 4 for Expressive Range Characterization of Open Text-to-Audio Models
Viaarxiv icon