Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

May 24, 2018

Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

Figure 1 for Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Figure 2 for Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Figure 3 for Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Figure 4 for Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Share this with someone who'll enjoy it:

Abstract:Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWEMs), consisting of parameter-free pooling operations, relative to word-embedding-based RNN/CNN models. Surprisingly, SWEMs exhibit comparable or even superior performance in the majority of cases considered. Based upon this understanding, we propose two additional pooling strategies over learned word embeddings: (i) a max-pooling operation for improved interpretability; and (ii) a hierarchical pooling operation, which preserves spatial (n-gram) information within text sequences. We present experiments on 17 datasets encompassing three tasks: (i) (long) document classification; (ii) text sequence matching; and (iii) short text tasks, including classification and tagging. The source code and datasets can be obtained from https:// github.com/dinghanshen/SWEM.

* To appear at ACL 2018 (code: https://github.com/dinghanshen/SWEM)

View paper on

Share this with someone who'll enjoy it:

Title:Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

Paper and Code