Alert button

"music": models, code, and papers
Alert button

SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation

Feb 27, 2024
Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang

Viaarxiv icon

OntoChat: a Framework for Conversational Ontology Engineering using Language Models

Mar 09, 2024
Bohui Zhang, Valentina Anita Carriero, Katrin Schreiberhuber, Stefani Tsaneva, Lucía Sánchez González, Jongmo Kim, Jacopo de Berardinis

Viaarxiv icon

Dance-to-Music Generation with Encoder-based Textual Inversion of Diffusion Models

Jan 31, 2024
Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver Deussen, Tong-Yee Lee, Changsheng Xu

Viaarxiv icon

Answering Diverse Questions via Text Attached with Key Audio-Visual Clues

Mar 11, 2024
Qilang Ye, Zitong Yu, Xin Liu

Viaarxiv icon

Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT

Feb 24, 2024
Sixiao Zheng, Jingyang Huo, Yu Wang, Yanwei Fu

Viaarxiv icon

ByteComposer: a Human-like Melody Composition Method based on Language Model Agent

Feb 24, 2024
Xia Liang, Jiaju Lin, Xinjian Du

Viaarxiv icon

Room Impulse Response Estimation using Optimal Transport: Simulation-Informed Inference

Mar 06, 2024
David Sundström, Anton Björkman, Andreas Jakobsson, Filip Elvander

Viaarxiv icon

Remixing Music for Hearing Aids Using Ensemble of Fine-Tuned Source Separators

Feb 01, 2024
Matthew Daly

Viaarxiv icon

Arbitrary Discrete Fourier Analysis and Its Application in Replayed Speech Detection

Mar 02, 2024
Shih-Kuang Lee

Figure 1 for Arbitrary Discrete Fourier Analysis and Its Application in Replayed Speech Detection
Figure 2 for Arbitrary Discrete Fourier Analysis and Its Application in Replayed Speech Detection
Figure 3 for Arbitrary Discrete Fourier Analysis and Its Application in Replayed Speech Detection
Figure 4 for Arbitrary Discrete Fourier Analysis and Its Application in Replayed Speech Detection
Viaarxiv icon

Toward Fully Self-Supervised Multi-Pitch Estimation

Feb 23, 2024
Frank Cwitkowitz, Zhiyao Duan

Viaarxiv icon