Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pavlina Fragkou

Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts

Oct 28, 2016

Pavlina Fragkou

Figure 1 for Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts

Figure 2 for Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts

Figure 3 for Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts

Figure 4 for Text Segmentation using Named Entity Recognition and Co-reference Resolution in English and Greek Texts

Abstract:In this paper we examine the benefit of performing named entity recognition (NER) and co-reference resolution to an English and a Greek corpus used for text segmentation. The aim here is to examine whether the combination of text segmentation and information extraction can be beneficial for the identification of the various topics that appear in a document. NER was performed manually in the English corpus and was compared with the output produced by publicly available annotation tools while, an already existing tool was used for the Greek corpus. Produced annotations from both corpora were manually corrected and enriched to cover four types of named entities. Co-reference resolution i.e., substitution of every reference of the same instance with the same named entity identifier was subsequently performed. The evaluation, using five text segmentation algorithms for the English corpus and four for the Greek corpus leads to the conclusion that, the benefit highly depends on the segment's topic, the number of named entity instances appearing in it, as well as the segment's length.

* 32 pages. arXiv admin note: text overlap with arXiv:1308.0661, arXiv:1204.2847 by other authors

Via

Access Paper or Ask Questions

A Dynamic Programming Algorithm for the Segmentation of Greek Texts

Oct 21, 2003

Pavlina Fragkou

Figure 1 for A Dynamic Programming Algorithm for the Segmentation of Greek Texts

Figure 2 for A Dynamic Programming Algorithm for the Segmentation of Greek Texts

Figure 3 for A Dynamic Programming Algorithm for the Segmentation of Greek Texts

Figure 4 for A Dynamic Programming Algorithm for the Segmentation of Greek Texts

Abstract:In this paper we introduce a dynamic programming algorithm to perform linear text segmentation by global minimization of a segmentation cost function which consists of: (a) within-segment word similarity and (b) prior information about segment length. The evaluation of the segmentation accuracy of the algorithm on a text collection consisting of Greek texts showed that the algorithm achieves high segmentation accuracy and appears to be very innovating and promissing.

* This paper will appear in the Proceedings of the CONSOLE XII Conference (Patras, Greece, 2003)

Via

Access Paper or Ask Questions