Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Deep Conditional Representation Learning for Drum Sample Retrieval by Vocalisation



Alejandro Delgado , Charalampos Saitis , Emmanouil Benetos , Mark Sandler

* Submitted to Interspeech 2022 (under review) 

   Access Paper or Ask Questions

Exploring Transformer's potential on automatic piano transcription



Longshen Ou , Ziyi Guo , Emmanouil Benetos , Jiqing Han , Ye Wang

* Accepted by ICASSP 2022 

   Access Paper or Ask Questions

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality



Alessandro Ragano , Emmanouil Benetos , Michael Chinen , Helard B. Martinez , Chandan K. A. Reddy , Jan Skoglund , Andrew Hines

* Submitted to INTERSPEECH 2022 

   Access Paper or Ask Questions

Improving Lyrics Alignment through Joint Pitch Detection



Jiawen Huang , Emmanouil Benetos , Sebastian Ewert

* To appear in Proc. ICASSP 2022 

   Access Paper or Ask Questions

Learning music audio representations via weak language supervision



Ilaria Manco , Emmanouil Benetos , Elio Quinton , Gyorgy Fazekas

* 5 pages, 5 figures 

   Access Paper or Ask Questions

An evaluation of data augmentation methods for sound scene geotagging



Helen L. Bear , Veronica Morfi , Emmanouil Benetos

* Presented at Interspeech 2021 

   Access Paper or Ask Questions

Joint Scattering for Automatic Chick Call Recognition



Changhong Wang , Emmanouil Benetos , Shuge Wang , Elisabetta Versace

* 5 pages, submitted to ICASSP 2022 

   Access Paper or Ask Questions

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations



Alessandro Ragano , Emmanouil Benetos , Andrew Hines

* Published in 2021 13th International Conference on Quality of Multimedia Experience (QoMEX) 

   Access Paper or Ask Questions

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes



Carlos Lordelo , Emmanouil Benetos , Simon Dixon , Sven Ahlbäck

* 4 figures, 4 tables and 7 pages. Accepted for publication at ISMIR Conference 2021 

   Access Paper or Ask Questions

MusCaps: Generating Captions for Music Audio



Ilaria Manco , Emmanouil Benetos , Elio Quinton , Gyorgy Fazekas

* Accepted to IJCNN 2021 for the Special Session on Representation Learning for Audio, Speech, and Music Processing 

   Access Paper or Ask Questions

1
2
3
>>