Get our free extension to see links to code for papers anywhere online!


A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Add code

May 05, 1994
Richard Sproat, Chilin Shih, William Gale, Nancy Chang


Share this with someone who'll enjoy it:


We present a stochastic finite-state model for segmenting Chinese text into dictionary entries and productively derived words, and providing pronunciations for these words; the method incorporates a class-based model in its treatment of personal names. We also evaluate the system's performance, taking into account the fact that people often do not agree on a single segmentation.

* in Proceedings of ACL 94 
* To appear in Proceedings of ACL-94 


   Access Paper Source



Share this with someone who'll enjoy it: