论文部分内容阅读
基于清华汉语树库,构建了多叉树结构的句法树。分别从名词短语的内部结构、内部词性序列、外部句法功能和左右边界特征几个方面,对清华汉语树库中的名词短语进行了知识计量统计分析。本文的研究可为名词短语的自动识别提供更全面的语言学知识和语言学规则,也为其它短语结构识别提供可借鉴的方法。最终为自然语言处理中的句法分析和语义分析提供数据支持。
Based on Tsinghua Chinese tree library, a syntax tree of multi-tree structure is constructed. From the aspects of internal structure, internal part-of-speech sequence, external syntactic function and left and right boundary features of noun phrases, knowledge measurement and statistical analysis of noun phrase in Tsinghua Chinese tree bank were carried out respectively. The research in this paper can provide a more comprehensive linguistic knowledge and linguistic rules for the automatic recognition of noun phrases, and provide a reference for other phrase structure recognition. Finally, it provides data support for syntax analysis and semantic analysis in natural language processing.