music


Song Aesthetics Evaluation with Multi-Stem Attention and Hierarchical Uncertainty Modeling

Add code
Jan 18, 2026
Viaarxiv icon

Do Neural Codecs Generalize? A Controlled Study Across Unseen Languages and Non-Speech Tasks

Add code
Jan 18, 2026
Viaarxiv icon

VidTune: Creating Video Soundtracks with Generative Music and Contextual Thumbnails

Add code
Jan 17, 2026
Viaarxiv icon

MuseAgent-1: Interactive Grounded Multimodal Understanding of Music Scores and Performance Audio

Add code
Jan 17, 2026
Viaarxiv icon

Lightweight Self-Supervised Detection of Fundamental Frequency and Accurate Probability of Voicing in Monophonic Music

Add code
Jan 16, 2026
Viaarxiv icon

Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings

Add code
Jan 16, 2026
Viaarxiv icon

SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing

Add code
Jan 14, 2026
Viaarxiv icon

Weakly Supervised Tabla Stroke Transcription via TI-SDRM: A Rhythm-Aware Lattice Rescoring Framework

Add code
Jan 13, 2026
Viaarxiv icon

FusID: Modality-Fused Semantic IDs for Generative Music Recommendation

Add code
Jan 13, 2026
Viaarxiv icon

Heterogeneous computing platform for real-time robotics

Add code
Jan 13, 2026
Viaarxiv icon