Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed

Utilizing the World Wide Web as an Encyclopedia: Extracting Term Descriptions from Semi-Structured Texts

Nov 02, 2000
Atsushi Fujii, Tetsuya Ishikawa



In this paper, we propose a method to extract descriptions of technical terms from Web pages in order to utilize the World Wide Web as an encyclopedia. We use linguistic patterns and HTML text structures to extract text fragments containing term descriptions. We also use a language model to discard extraneous descriptions, and a clustering method to summarize resultant descriptions. We show the effectiveness of our method by way of experiments.

* Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), pp.488-495, Oct. 2000 
* 8 pages, 2 Postscript figures 


Share this with someone who'll enjoy it:

   Access Paper Source



Share this with someone who'll enjoy it: