Get our free extension to see links to code for papers anywhere online!


Multi-Paragraph Segmentation of Expository Text

Add code

Jun 23, 1994
Marti A. Hearst


Share this with someone who'll enjoy it:


This paper describes TextTiling, an algorithm for partitioning expository texts into coherent multi-paragraph discourse units which reflect the subtopic structure of the texts. The algorithm uses domain-independent lexical frequency and distribution information to recognize the interactions of multiple simultaneous themes. Two fully-implemented versions of the algorithm are described and shown to produce segmentation that corresponds well to human judgments of the major subtopic boundaries of thirteen lengthy texts.

* To Appear in ACL '94 Proceedings; 8 pages POSTSCRIPT format 


   Access Paper Source



Share this with someone who'll enjoy it: