Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Multilingual Hierarchical Attention Networks for Document Classification

Sep 15, 2017

Nikolaos Pappas, Andrei Popescu-Belis

Figure 1 for Multilingual Hierarchical Attention Networks for Document Classification

Figure 2 for Multilingual Hierarchical Attention Networks for Document Classification

Figure 3 for Multilingual Hierarchical Attention Networks for Document Classification

Figure 4 for Multilingual Hierarchical Attention Networks for Document Classification

Share this with someone who'll enjoy it:

Abstract:Hierarchical attention networks have recently achieved remarkable performance for document classification in a given language. However, when multilingual document collections are considered, training such models separately for each language entails linear parameter growth and lack of cross-language transfer. Learning a single multilingual model with fewer parameters is therefore a challenging but potentially beneficial objective. To this end, we propose multilingual hierarchical attention networks for learning document structures, with shared encoders and/or shared attention mechanisms across languages, using multi-task learning and an aligned semantic space as input. We evaluate the proposed models on multilingual document classification with disjoint label sets, on a large dataset which we provide, with 600k news documents in 8 languages, and 5k labels. The multilingual models outperform monolingual ones in low-resource as well as full-resource settings, and use fewer parameters, thus confirming their computational efficiency and the utility of cross-language transfer.

View paper on

Share this with someone who'll enjoy it:

Title:Multilingual Hierarchical Attention Networks for Document Classification

Paper and Code