ZHANG, Dong (2013). An evolutionary approach to automatic Chinese text segmentation. In: Natural Computation (ICNC), 2013 Ninth International Conference on, Shenyang, China, 23-25 July 2013. 771-776. [Conference or Workshop Item]
Abstract
Textual information written in Chinese now represents a huge knowledge repository. The first step of managing and processing information in written Chinese text is segmentation. A new method for automatic Chinese text segmentation using evolutionary algorithms and Web search statistical data is outlined. This proposed method considers Web text a de facto corpus that updates automatically, thus eliminating the need for statistics training. It treats the segmentation as a process that finds out the best probability of how individual characters are combined into sentences, paragraphs, and articles, thus producing segmentation results that are tailored to the text in question and are independent of segmentation standards.
More Information
Metrics
Altmetric Badge
Dimensions Badge
Share
Actions (login required)
View Item |