Combining Trigram and Automatic Weight Distribution in Chinese Spelling Error Correction

来源 :计算机科学技术学报 | 被引量 : 0次 | 上传用户:xiameng
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The researches on spelling correction aiming at detecting errors in texts tendto focus on context-sensitive spelling error correction, which is more difficult than traditionalisolated-word error correction. A novel and efficient algorithm for the system of Chinese spellingerror correction, CInsunSpell, is presented. In this system, the work of correction includes twoparts: checking phase and correcting phase. At the first phase, a Trigram algorithm within onefixed-size window is designed to locate potential errors in local area. The second phase employsa new method of automatically and dynamically distributing weights among the characters inthe confusion set as well as in the Bayesian language model. The tactics used above exhibitsgood performances.
其他文献
Polyether-tailored phosphite modified rhodium complex formed in situ was highly active in the hydroformylation of oleyl alcohol in nonaqueous phosphite/heptane
A simple method is applied to calculating the isotope shifts (ISs) on 5S1/2 → 4D3/2,5/2 transitions of 87,88Sr+. First we have calculated the ISs of lower tran
This paper describes the potential of heterogeneous catalytic ozonization of sulfo-salicylic acid (SSal). It was found that catalytic ozonization in the presenc
A new bibenzyl derivative, 3,4-dihydroxy-4(,5-dimethoxy bibenzyl, was isolated from a orchid Dendrobium moniliforme. The structure elucidation and 1H,13C NMR as
Two new isobutyltartrate monoesters, coelovirin A (1) and B (2), were isolated from the rhizomes of Coeloglossum viride (L.) Hartm. var. bracteatum (Willd.) Ric
Molybdenum(Ⅰ)-compound [Mo2(SC6H11)2(CO)8] 1, crystallizes in monoclinic, space group P21/c with a = 9.5863(9), b = 9.4469(9), c = 13.869(1) (A), β= 99.697(2)
In this paper we propose a novel scheme for scheduling divisible task onparallel processors connected by system interconnection network with arbitrary topology.
We have investigated the low-lying collective states and electromagnetic transitions in 94Mo within the framework of the interacting boson model. The influence
Electrostatic layer-by-layer self-assembly multilayer films were successfully fabricated from C60-ethylenediamineadduct (C60-EDA) and DNA. Under visible light i
This paper presents an automatic mesh generation procedure on a 2D domainbased on a regular background grid. The idea is to devise a robust mesh generation sche