Alert button

"music": models, code, and papers
Alert button

SMITIN: Self-Monitored Inference-Time INtervention for Generative Music Transformers

Apr 02, 2024
Junghyun Koo, Gordon Wichern, Francois G. Germain, Sameer Khurana, Jonathan Le Roux

Viaarxiv icon

Audio Dialogues: Dialogues dataset for audio and music understanding

Add code
Bookmark button
Alert button
Apr 11, 2024
Arushi Goel, Zhifeng Kong, Rafael Valle, Bryan Catanzaro

Viaarxiv icon

A Novel Audio Representation for Music Genre Identification in MIR

Apr 01, 2024
Navin Kamuni, Mayank Jindal, Arpita Soni, Sukender Reddy Mallreddy, Sharath Chandra Macha

Viaarxiv icon

Practical End-to-End Optical Music Recognition for Pianoform Music

Mar 20, 2024
Jiří Mayer, Milan Straka, Jan Hajič jr., Pavel Pecina

Figure 1 for Practical End-to-End Optical Music Recognition for Pianoform Music
Figure 2 for Practical End-to-End Optical Music Recognition for Pianoform Music
Figure 3 for Practical End-to-End Optical Music Recognition for Pianoform Music
Figure 4 for Practical End-to-End Optical Music Recognition for Pianoform Music
Viaarxiv icon

Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation

Apr 17, 2024
Ye Bai, Chenxing Li, Hao Li, Yuanyuan Zhao, Xiaorui Wang

Viaarxiv icon

A Diffusion-Based Generative Equalizer for Music Restoration

Mar 27, 2024
Eloi Moliner, Maija Turunen, Filip Elvander, Vesa Välimäki

Viaarxiv icon

Shotit: compute-efficient image-to-video search engine for the cloud

Apr 18, 2024
Leslie Wong

Viaarxiv icon

Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

Apr 18, 2024
Jie Ma, Min Hu, Pinghui Wang, Wangchun Sun, Lingyun Song, Hongbin Pei, Jun Liu, Youtian Du

Viaarxiv icon

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance

Add code
Bookmark button
Alert button
Mar 20, 2024
Zixuan Wang, Jia Jia, Shikun Sun, Haozhe Wu, Rong Han, Zhenyu Li, Di Tang, Jiaqing Zhou, Jiebo Luo

Figure 1 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 2 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 3 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Figure 4 for DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
Viaarxiv icon

Music to Dance as Language Translation using Sequence Models

Add code
Bookmark button
Alert button
Mar 22, 2024
André Correia, Luís A. Alexandre

Viaarxiv icon