Prediction of eukaryotic gene structures based on multilevel optimization

来源 :科学通报 | 被引量 : 0次 | 上传用户:mathsboy
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Computational gene structure prediction, which is valuable for finding new genes and understanding the composition of genomes, plays a very important role in various kinds of genome projects. For eukaryotic gene structures, however, the prediction accuracy of existing methods is still limited. This paper presents a method of predicting eukaryotic gene structures based on multilevel optimization. The complicated problem of predicting gene structure in eukaryotic DNA sequence containing multiple genes can be decomposed into a series of sub-problems at several levels with decreasing complexity, including the gene level (single-exon gene, multi-exon gene), the element level (exon, intron, etc.), and the feature level (functional site signals, codon usage preference, etc.). On the basis of this decomposition, a multilevel model for the prediction of complex gene structures is created by a multilevel optimization process, in which the models dealing with sub-problems at low complexity level are first optimized respectively, and then optimally combined together to form models for those sub-problems at higher complexity level. Based on the multilevel model, a dynamic programming algorithm is designed to search for optimal gene structures from DNA sequences, and a new program GeneKey (1.0) for the prediction of eukaryotic gene structures is developed. Testing results with widely used datasets demonstrate that the prediction accuracies of GeneKey (1.0) at the nucleotide level, exon level and gene level are all higher than that of the well known program GENSCAN. A web server of GeneKey(1.0) is available at http://infosci.hust.edu.cn
其他文献
介绍了一种利用同位素中子源 (2 41Am 9Be)通过γ n符合的中子飞行时间法对大面积探测器的中子探测效率进行刻度的方法 ,并用蒙特卡罗方法对该探测器的探测效率进行了模拟 .
Solutions of fuzzy differential equations provide a noteworthy example of time-dependent fuzzy sets. The purpose of this paper is to introduce functions of a su
Five strains of antarctic bacteria producing extracellular low-temperature lipase are screened from seawater collected by CTD during the Chinese 18th Antarctic
基于改进的同位旋相关量子分子动力学模型,研究了中能重离子碰撞中同位旋分馏强度(N/Z)气/(N/Z)液随着碰撞系统中子-质子比和碰撞参数的变化所呈现出的同位旋效应,得到了一些
在N人博弈中存在多重博弈均衡问题,这意味着在非合作博弈中Nash均衡不惟一.建立模型研究了均衡点数目问题,并将其中的不同局中人看作是不同的合作组织分析均衡选择对于合作组
In this paper two Quantum Key Distribution (QKD) protocols are proposed, which combined BBS4 protocol and EPR protocol subtly. In our protocols, entangled parti
Reaction of [(ButCp)2Er(. Μ-Cl)]2 with ButLi in 1: 1 molar ratio in THF at -78C, after work-up, afforded the trimetallic erbium tetrahydride complex [Li(THF)4]
A two-dimensional mathematical model for simulating flow and sediment transport is presented. The model simulates flow and geo-morphological processes using a h
微条气体室(Micro-strip Gas Chamber, MSGC)探测器最严重的问题是电荷积累效应,通过选择合适的衬底材料可以有效的避免. 为此,D263玻璃上沉积类金刚石(Diamond-like Carbon,