Alert button

Disambiguating Music Artists at Scale with Audio Metric Learning

Oct 03, 2018
Jimena Royo-Letelier, Romain Hennequin, Viet-Anh Tran, Manuel Moussallam

Figure 1 for Disambiguating Music Artists at Scale with Audio Metric Learning
Figure 2 for Disambiguating Music Artists at Scale with Audio Metric Learning
Figure 3 for Disambiguating Music Artists at Scale with Audio Metric Learning
Figure 4 for Disambiguating Music Artists at Scale with Audio Metric Learning

Share this with someone who'll enjoy it:

We address the problem of disambiguating large scale catalogs through the definition of an unknown artist clustering task. We explore the use of metric learning techniques to learn artist embeddings directly from audio, and using a dedicated homonym artists dataset, we compare our method with a recent approach that learn similar embeddings using artist classifiers. While both systems have the ability to disambiguate unknown artists relying exclusively on audio, we show that our system is more suitable in the case when enough audio data is available for each artist in the train dataset. We also propose a new negative sampling method for metric learning that takes advantage of side information such as music genre during the learning phase and shows promising results for the artist clustering task.

* published in ISMIR 2018  
View paper onarxiv icon

Share this with someone who'll enjoy it: