Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Mar 07, 2017

Jan Deriu, Aurelien Lucchi, Valeria De Luca, Aliaksei Severyn, Simon Müller, Mark Cieliebak, Thomas Hofmann, Martin Jaggi

Figure 1 for Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Figure 2 for Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Figure 3 for Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Figure 4 for Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Share this with someone who'll enjoy it:

Abstract:This paper presents a novel approach for multi-lingual sentiment classification in short texts. This is a challenging task as the amount of training data in languages other than English is very limited. Previously proposed multi-lingual approaches typically require to establish a correspondence to English for which powerful classifiers are already available. In contrast, our method does not require such supervision. We leverage large amounts of weakly-supervised data in various languages to train a multi-layer convolutional network and demonstrate the importance of using pre-training of such networks. We thoroughly evaluate our approach on various multi-lingual datasets, including the recent SemEval-2016 sentiment prediction benchmark (Task 4), where we achieved state-of-the-art performance. We also compare the performance of our model trained individually for each language to a variant trained for all languages at once. We show that the latter model reaches slightly worse - but still acceptable - performance when compared to the single language model, while benefiting from better generalization properties across languages.

* appearing at WWW 2017 - 26th International World Wide Web Conference

View paper on

Share this with someone who'll enjoy it:

Title:Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

Paper and Code