Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Word Sense Disambiguation by Web Mining for Word Co-occurrence Probabilities

Jul 29, 2004

Peter D. Turney

Figure 1 for Word Sense Disambiguation by Web Mining for Word Co-occurrence Probabilities

Figure 2 for Word Sense Disambiguation by Web Mining for Word Co-occurrence Probabilities

Share this with someone who'll enjoy it:

Abstract:This paper describes the National Research Council (NRC) Word Sense Disambiguation (WSD) system, as applied to the English Lexical Sample (ELS) task in Senseval-3. The NRC system approaches WSD as a classical supervised machine learning problem, using familiar tools such as the Weka machine learning software and Brill's rule-based part-of-speech tagger. Head words are represented as feature vectors with several hundred features. Approximately half of the features are syntactic and the other half are semantic. The main novelty in the system is the method for generating the semantic features, based on word \hbox{co-occurrence} probabilities. The probabilities are estimated using the Waterloo MultiText System with a corpus of about one terabyte of unlabeled text, collected by a web crawler.

* Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), (2004), Barcelona, Spain, 239-242 * related work available at http://purl.org/peter.turney/

View paper on

Share this with someone who'll enjoy it:

Title:Word Sense Disambiguation by Web Mining for Word Co-occurrence Probabilities

Paper and Code