Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags

Oct 27, 2020
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra

* 5 pages, 1 figure 

  Access Paper or Ask Questions

FSD50K: an Open Dataset of Human-Labeled Sound Events

Oct 01, 2020
Eduardo Fonseca, Xavier Favory, Jordi Pons, Frederic Font, Xavier Serra


  Access Paper or Ask Questions

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations

Jul 08, 2020
Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra

* 8 pages, 1 figure, workshop on Self-supervision in Audio and Speech at the 37th International Conference on Machine Learning (ICML), 2020, Vienna, Austria 

  Access Paper or Ask Questions

Search Result Clustering in Collaborative Sound Collections

Apr 08, 2020
Xavier Favory, Frederic Font, Xavier Serra

* Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR 20), June 8-11, 2020, Dublin, Ireland. ACM, NewYork, NY, USA, 8 pages. https://doi.org/10.1145/3372278.3390691 
* 8 pages, 4 figures, ACM ICMR 20 

  Access Paper or Ask Questions

Neural Percussive Synthesis Parameterised by High-Level Timbral Features

Nov 25, 2019
Ant贸nio Ramires, Pritish Chandna, Xavier Favory, Emilia G贸mez, Xavier Serra


  Access Paper or Ask Questions

Learning Sound Event Classifiers from Web Audio with Noisy Labels

Jan 04, 2019
Eduardo Fonseca, Manoj Plakal, Daniel P. W. Ellis, Frederic Font, Xavier Favory, Xavier Serra


  Access Paper or Ask Questions

Facilitating the Manual Annotation of Sounds When Using Large Taxonomies

Nov 21, 2018
Xavier Favory, Eduardo Fonseca, Frederic Font, Xavier Serra

* Proceedings of the 23rd Conference of Open Innovations Association FRUCT, Bologna, Italy. 2018. ISSN 2305-7254, ISBN 978-952-68653-6-2, FRUCT Oy, e-ISSN 2343-0737 (license CC BY-ND) 
* 5 pages, 5 figures, IEEE FRUCT International Workshop on Semantic Audio and the Internet of Things 

  Access Paper or Ask Questions

General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline

Oct 07, 2018
Eduardo Fonseca, Manoj Plakal, Frederic Font, Daniel P. W. Ellis, Xavier Favory, Jordi Pons, Xavier Serra

* Camera ready for DCASE Workshop 2018 

  Access Paper or Ask Questions