Alert button
Picture for Emmanouil Benetos

Emmanouil Benetos

Alert button

Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models

Mar 18, 2024
Emilian Postolache, Giorgio Mariani, Luca Cosmo, Emmanouil Benetos, Emanuele Rodolà

Viaarxiv icon

WavCraft: Audio Editing and Generation with Natural Language Prompts

Mar 15, 2024
Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Viaarxiv icon

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Feb 25, 2024
Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, Jingcheng Wu, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Emmanouil Benetos, Jie Fu, Gus Xia, Roger Dannenberg, Wei Xue, Shiyin Kang, Yike Guo

Viaarxiv icon

A Data-Driven Analysis of Robust Automatic Piano Transcription

Feb 02, 2024
Drew Edwards, Simon Dixon, Emmanouil Benetos, Akira Maezawa, Yuta Kusaka

Viaarxiv icon

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Nov 30, 2023
Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan, Emmanouil Benetos

Viaarxiv icon

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

Nov 22, 2023
Ilaria Manco, Benno Weck, SeungHeon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam

Viaarxiv icon

ATGNN: Audio Tagging Graph Neural Network

Nov 02, 2023
Shubhr Singh, Christian J. Steinmetz, Emmanouil Benetos, Huy Phan, Dan Stowell

Figure 1 for ATGNN: Audio Tagging Graph Neural Network
Figure 2 for ATGNN: Audio Tagging Graph Neural Network
Figure 3 for ATGNN: Audio Tagging Graph Neural Network
Figure 4 for ATGNN: Audio Tagging Graph Neural Network
Viaarxiv icon

MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning

Oct 15, 2023
Dichucheng Li, Yinghao Ma, Weixing Wei, Qiuqiang Kong, Yulun Wu, Mingjin Che, Fan Xia, Emmanouil Benetos, Wei Li

Viaarxiv icon

MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response

Sep 15, 2023
Zihao Deng, Yinghao Ma, Yudong Liu, Rongchen Guo, Ge Zhang, Wenhu Chen, Wenhao Huang, Emmanouil Benetos

Figure 1 for MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Figure 2 for MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Figure 3 for MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response
Viaarxiv icon