Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Xavier Serra

Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging


Jan 30, 2021
Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov

* 2021 IEEE International Conference on Acoustics, Speech and Signal Processing 

  Access Paper or Ask Questions

Unsupervised Contrastive Learning of Sound Event Representations


Nov 15, 2020
Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

* A 4-page version is submitted to ICASSP 2021 

  Access Paper or Ask Questions

Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags


Oct 27, 2020
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra

* 5 pages, 1 figure 

  Access Paper or Ask Questions

FSD50K: an Open Dataset of Human-Labeled Sound Events


Oct 01, 2020
Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra


  Access Paper or Ask Questions

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations


Jul 08, 2020
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra

* 8 pages, 1 figure, workshop on Self-supervision in Audio and Speech at the 37th International Conference on Machine Learning (ICML), 2020, Vienna, Austria 

  Access Paper or Ask Questions

Addressing Missing Labels in Large-scale Sound Event Recognition using a Teacher-student Framework with Loss Masking


May 02, 2020
Eduardo Fonseca, Shawn Hershey, Manoj Plakal, Daniel P. W. Ellis, Aren Jansen, R. Channing Moore, Xavier Serra


  Access Paper or Ask Questions

Search Result Clustering in Collaborative Sound Collections


Apr 08, 2020
Xavier Favory, Frederic Font, Xavier Serra

* Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR 20), June 8-11, 2020, Dublin, Ireland. ACM, NewYork, NY, USA, 8 pages. https://doi.org/10.1145/3372278.3390691 
* 8 pages, 4 figures, ACM ICMR 20 

  Access Paper or Ask Questions

TensorFlow Audio Models in Essentia


Mar 16, 2020
Pablo Alonso-Jim茅nez, Dmitry Bogdanov, Jordi Pons, Xavier Serra


  Access Paper or Ask Questions

Neural Percussive Synthesis Parameterised by High-Level Timbral Features


Nov 25, 2019
Ant贸nio Ramires, Pritish Chandna, Xavier Favory, Emilia G贸mez, Xavier Serra


  Access Paper or Ask Questions

Model-agnostic Approaches to Handling Noisy Labels When Training Sound Event Classifiers


Oct 26, 2019
Eduardo Fonseca, Frederic Font, Xavier Serra

* WASPAA 2019 

  Access Paper or Ask Questions

musicnn: Pre-trained convolutional neural networks for music audio tagging


Sep 14, 2019
Jordi Pons, Xavier Serra

* Accepted to be presented at the Late-Breaking/Demo session of ISMIR 2019 

  Access Paper or Ask Questions

A hybrid parametric-deep learning approach for sound event localization and detection


Aug 27, 2019
Andres Perez-Lopez, Eduardo Fonseca, Xavier Serra

* 5 pages, 5 figures, submitted to DCASE2019 Workshop 

  Access Paper or Ask Questions

Audio tagging with noisy labels and minimal supervision


Jul 14, 2019
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Serra

* submitted to DCASE2019 Workshop 

  Access Paper or Ask Questions

Toward Interpretable Music Tagging with Self-Attention


Jun 12, 2019
Minz Won, Sanghyuk Chun, Xavier Serra

* 13 pages, 12 figures; code: https://github.com/minzwon/self-attention-music-tagging 

  Access Paper or Ask Questions

Learning Sound Event Classifiers from Web Audio with Noisy Labels


Jan 04, 2019
Eduardo Fonseca, Manoj Plakal, Daniel P. W. Ellis, Frederic Font, Xavier Favory, Xavier Serra


  Access Paper or Ask Questions

Facilitating the Manual Annotation of Sounds When Using Large Taxonomies


Nov 21, 2018
Xavier Favory, Eduardo Fonseca, Frederic Font, Xavier Serra

* Proceedings of the 23rd Conference of Open Innovations Association FRUCT, Bologna, Italy. 2018. ISSN 2305-7254, ISBN 978-952-68653-6-2, FRUCT Oy, e-ISSN 2343-0737 (license CC BY-ND) 
* 5 pages, 5 figures, IEEE FRUCT International Workshop on Semantic Audio and the Internet of Things 

  Access Paper or Ask Questions

Training neural audio classifiers with few data


Nov 03, 2018
Jordi Pons, Joan Serr脿, Xavier Serra

* Code: https://github.com/jordipons/neural-classifiers-with-few-audio/ 

  Access Paper or Ask Questions

End-to-end music source separation: is it possible in the waveform domain?


Oct 29, 2018
Francesc Llu铆s, Jordi Pons, Xavier Serra

* Code: https://github.com/francesclluis/source-separation-wavenet and Demo: http://jordipons.me/apps/end-to-end-music-source-separation/ 

  Access Paper or Ask Questions

General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline


Oct 07, 2018
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Favory, Jordi Pons, Xavier Serra

* Camera ready for DCASE Workshop 2018 

  Access Paper or Ask Questions

Natural Language Processing for Music Knowledge Discovery


Jul 06, 2018
Sergio Oramas, Luis Espinosa-Anke, Francisco G贸mez, Xavier Serra

* Journal of New Music Research (2018) 

  Access Paper or Ask Questions

A Simple Fusion of Deep and Shallow Learning for Acoustic Scene Classification


Jun 27, 2018
Eduardo Fonseca, Rong Gong, Xavier Serra

* accepted to SMC 2018; updated Figure 7, results unchanged 

  Access Paper or Ask Questions

End-to-end learning for music audio tagging at scale


Jun 15, 2018
Jordi Pons, Oriol Nieto, Matthew Prockup, Erik Schmidt, Andreas Ehmann, Xavier Serra

* Presented at the Workshop on Machine Learning for Audio Signal Processing (ML4Audio) at NIPS 2017, and in proceedings of the 19th International Society for Music Information Retrieval Conference (ISMIR2018). Code: https://github.com/jordipons/music-audio-tagging-at-scale-models. Demo: http://www.jordipons.me/apps/music-audio-tagging-at-scale-demo/ 

  Access Paper or Ask Questions

Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour


Jun 07, 2018
Emilia G贸mez, Carlos Castillo, Vicky Charisi, Ver贸nica Dahl, Gustavo Deco, Blagoj Delipetrev, Nicole Dewandre, Miguel 脕ngel Gonz谩lez-Ballester, Fabien Gouyon, Jos茅 Hern谩ndez-Orallo, Perfecto Herrera, Anders Jonsson, Ansgar Koene, Martha Larson, Ram贸n L贸pez de M谩ntaras, Bertin Martens, Marius Miron, Rub茅n Moreno-Bote, Nuria Oliver, Antonio Puertas Gallardo, Heike Schweitzer, Nuria Sebastian, Xavier Serra, Joan Serr脿, Song眉l Tolan, Karina Vold

* Proceedings of 1st HUMAINT (Human Behaviour and Machine Intelligence) workshop, Barcelona, Spain, March 5-6, 2018, edited by European Commission, Seville, 2018, JRC111773 https://ec.europa.eu/jrc/communities/community/humaint/document/assessing-impact-machine-intelligence-human-behaviour-interdisciplinary. arXiv admin note: text overlap with arXiv:1409.3097 by other authors 

  Access Paper or Ask Questions

Transfer Learning of Artist Group Factors to Musical Genre Classification


May 05, 2018
Jaehun Kim, Minz Won, Xavier Serra, Cynthia C. S. Liem

* The Web Conference 2018 

  Access Paper or Ask Questions

A Deep Multimodal Approach for Cold-start Music Recommendation


Jul 24, 2017
Sergio Oramas, Oriol Nieto, Mohamed Sordo, Xavier Serra

* In Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems (DLRS 2017), collocated with RecSys 2017 

  Access Paper or Ask Questions

Metrical-accent Aware Vocal Onset Detection in Polyphonic Audio


Jul 19, 2017
Georgi Dzhambazov, Andre Holzapfel, Ajay Srinivasamurthy, Xavier Serra

* International Society for Music Information Retrieval Conferece (ISMIR 2017) 

  Access Paper or Ask Questions

Characterization and exploitation of community structure in cover song networks


Sep 12, 2011
Joan Serr脿, Massimiliano Zanin, Perfecto Herrera, Xavier Serra

* Pattern Recognition Letters 33(9): 1032-1041, 2012 

  Access Paper or Ask Questions