Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrew Runge

Exploring Neural Entity Representations for Semantic Information

Nov 17, 2020

Andrew Runge, Eduard Hovy

Figure 1 for Exploring Neural Entity Representations for Semantic Information

Figure 2 for Exploring Neural Entity Representations for Semantic Information

Figure 3 for Exploring Neural Entity Representations for Semantic Information

Figure 4 for Exploring Neural Entity Representations for Semantic Information

Abstract:Neural methods for embedding entities are typically extrinsically evaluated on downstream tasks and, more recently, intrinsically using probing tasks. Downstream task-based comparisons are often difficult to interpret due to differences in task structure, while probing task evaluations often look at only a few attributes and models. We address both of these issues by evaluating a diverse set of eight neural entity embedding methods on a set of simple probing tasks, demonstrating which methods are able to remember words used to describe entities, learn type, relationship and factual information, and identify how frequently an entity is mentioned. We also compare these methods in a unified framework on two entity linking tasks and discuss how they generalize to different model architectures and datasets.

* Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP 2020, p. 204-216
* 9 pages, 1 figure

Via

Access Paper or Ask Questions

Optimizing Segmentation Granularity for Neural Machine Translation

Oct 19, 2018

Elizabeth Salesky, Andrew Runge, Alex Coda, Jan Niehues, Graham Neubig

Figure 1 for Optimizing Segmentation Granularity for Neural Machine Translation

Figure 2 for Optimizing Segmentation Granularity for Neural Machine Translation

Figure 3 for Optimizing Segmentation Granularity for Neural Machine Translation

Figure 4 for Optimizing Segmentation Granularity for Neural Machine Translation

Abstract:In neural machine translation (NMT), it is has become standard to translate using subword units to allow for an open vocabulary and improve accuracy on infrequent words. Byte-pair encoding (BPE) and its variants are the predominant approach to generating these subwords, as they are unsupervised, resource-free, and empirically effective. However, the granularity of these subword units is a hyperparameter to be tuned for each language and task, using methods such as grid search. Tuning may be done inexhaustively or skipped entirely due to resource constraints, leading to sub-optimal performance. In this paper, we propose a method to automatically tune this parameter using only one training pass. We incrementally introduce new vocabulary online based on the held-out validation loss, beginning with smaller, general subwords and adding larger, more specific units over the course of training. Our method matches the results found with grid search, optimizing segmentation granularity without any additional training time. We also show benefits in training efficiency and performance improvements for rare words due to the way embeddings for larger units are incrementally constructed by combining those from smaller units.

Via

Access Paper or Ask Questions