论文部分内容阅读
歧义问题是自动分词系统中要解决的主要问题之一。本文介绍一种新的汉语分词方法,它利用所建立的歧义二叉树,得到多种切分可能,通过分析歧义字段的特性,再结合规则处理和统计模型进行汉语分词。
Ambiguity is one of the main problems to be solved in the automatic word segmentation system. This paper introduces a new Chinese word segmentation method, which uses the ambiguous binary tree to establish multiple segmentation possibilities. By analyzing the characteristics of the ambiguous fields, the Chinese word segmentation is combined with the rule processing and statistical models.