Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jimena Royo-Letelier

Improving Collaborative Metric Learning with Efficient Negative Sampling

Sep 24, 2019

Viet-Anh Tran, Romain Hennequin, Jimena Royo-Letelier, Manuel Moussallam

Figure 1 for Improving Collaborative Metric Learning with Efficient Negative Sampling

Figure 2 for Improving Collaborative Metric Learning with Efficient Negative Sampling

Abstract:Distance metric learning based on triplet loss has been applied with success in a wide range of applications such as face recognition, image retrieval, speaker change detection and recently recommendation with the CML model. However, as we show in this article, CML requires large batches to work reasonably well because of a too simplistic uniform negative sampling strategy for selecting triplets. Due to memory limitations, this makes it difficult to scale in high-dimensional scenarios. To alleviate this problem, we propose here a 2-stage negative sampling strategy which finds triplets that are highly informative for learning. Our strategy allows CML to work effectively in terms of accuracy and popularity bias, even when the batch size is an order of magnitude smaller than what would be needed with the default uniform sampling. We demonstrate the suitability of the proposed strategy for recommendation and exhibit consistent positive results across various datasets.

* SIGIR 2019

Via

Access Paper or Ask Questions

Disambiguating Music Artists at Scale with Audio Metric Learning

Oct 03, 2018

Jimena Royo-Letelier, Romain Hennequin, Viet-Anh Tran, Manuel Moussallam

Figure 1 for Disambiguating Music Artists at Scale with Audio Metric Learning

Figure 2 for Disambiguating Music Artists at Scale with Audio Metric Learning

Figure 3 for Disambiguating Music Artists at Scale with Audio Metric Learning

Figure 4 for Disambiguating Music Artists at Scale with Audio Metric Learning

Abstract:We address the problem of disambiguating large scale catalogs through the definition of an unknown artist clustering task. We explore the use of metric learning techniques to learn artist embeddings directly from audio, and using a dedicated homonym artists dataset, we compare our method with a recent approach that learn similar embeddings using artist classifiers. While both systems have the ability to disambiguate unknown artists relying exclusively on audio, we show that our system is more suitable in the case when enough audio data is available for each artist in the train dataset. We also propose a new negative sampling method for metric learning that takes advantage of side information such as music genre during the learning phase and shows promising results for the artist clustering task.

* published in ISMIR 2018

Via

Access Paper or Ask Questions

Music Mood Detection Based On Audio And Lyrics With Deep Neural Net

Sep 19, 2018

Rémi Delbouys, Romain Hennequin, Francesco Piccoli, Jimena Royo-Letelier, Manuel Moussallam

Figure 1 for Music Mood Detection Based On Audio And Lyrics With Deep Neural Net

Figure 2 for Music Mood Detection Based On Audio And Lyrics With Deep Neural Net

Figure 3 for Music Mood Detection Based On Audio And Lyrics With Deep Neural Net

Figure 4 for Music Mood Detection Based On Audio And Lyrics With Deep Neural Net

Abstract:We consider the task of multimodal music mood prediction based on the audio signal and the lyrics of a track. We reproduce the implementation of traditional feature engineering based approaches and propose a new model based on deep learning. We compare the performance of both approaches on a database containing 18,000 tracks with associated valence and arousal values and show that our approach outperforms classical models on the arousal detection task, and that both approaches perform equally on the valence prediction task. We also compare the a posteriori fusion with fusion of modalities optimized simultaneously with each unimodal model, and observe a significant improvement of valence prediction. We release part of our database for comparison purposes.

* Published in ISMIR 2018

Via

Access Paper or Ask Questions

Word2Vec applied to Recommendation: Hyperparameters Matter

Aug 29, 2018

Hugo Caselles-Dupré, Florian Lesaint, Jimena Royo-Letelier

Figure 1 for Word2Vec applied to Recommendation: Hyperparameters Matter

Figure 2 for Word2Vec applied to Recommendation: Hyperparameters Matter

Figure 3 for Word2Vec applied to Recommendation: Hyperparameters Matter

Figure 4 for Word2Vec applied to Recommendation: Hyperparameters Matter

Abstract:Skip-gram with negative sampling, a popular variant of Word2vec originally designed and tuned to create word embeddings for Natural Language Processing, has been used to create item embeddings with successful applications in recommendation. While these fields do not share the same type of data, neither evaluate on the same tasks, recommendation applications tend to use the same already tuned hyperparameters values, even if optimal hyperparameters values are often known to be data and task dependent. We thus investigate the marginal importance of each hyperparameter in a recommendation setting through large hyperparameter grid searches on various datasets. Results reveal that optimizing neglected hyperparameters, namely negative sampling distribution, number of epochs, subsampling parameter and window-size, significantly improves performance on a recommendation task, and can increase it by an order of magnitude. Importantly, we find that optimal hyperparameters configurations for Natural Language Processing tasks and Recommendation tasks are noticeably different.

* This paper is published on the 12th ACM Conference on Recommender Systems, Vancouver, Canada, 2nd-7th October 2018

Via

Access Paper or Ask Questions