piEnPred:a bi-layered discriminative model for enhancers and their subtypes via novel cascade multi-

来源 :计算机科学前沿 | 被引量 : 0次 | 上传用户:liongliong506
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Enhancers are short DNA cis-elements that can be bound by proteins(activators)to increase the possibility that transcription of a particular gene will occur.The Enhancers per-form a significant role in the formation of proteins and regulat-ing the gene transcription process.Human diseases such as can-cer,inflammatory bowel disease,Parkinson\'s,addiction,and schizophrenia are due to genetic variation in enhancers.In the current study,we have made an effort by building,a more robust and novel computational a bi-layered model.The representative feature vector was constructed over a linear combination of six features.The optimum Hybrid feature vector was obtained via the Novel Cascade Multi-Level Subset Feature selection(CM-SFS)algorithm.The first layer predicts the enhancer,and the secondary layer carries the prediction of their subtypes.The baseline model obtained 87.88%of accuracy,95.29%of sen-sitivity,80.47%of specificity,0.766 of MCC,and 0.9603 of a roc value on Layer-1.Similarly,the model obtained 68.24%,65.54%,70.95%,0.3654,and 0.7568 as an Accuracy,sensitiv-ity,specificity,MCC,and ROC values on layer-2 respectively.Over an independent dataset on layer-1,the piEnPred secured 80.4%accuracy,82.5%of sensitivity,78.4%of specificity,and 0.6099 as MCC,respectively.Subsequently,the proposed pre-dictor obtained 72.5%of accuracy,70.0%of sensitivity,75%of specificity,and 0.4506 of MCC on layer-2,respectively.The proposed method remarkably performed in contrast to other state-of-the-art predictors.For the convenience of most exper-imental scientists,a user-friendly and publicly freely accessi-ble web server@/bienhancer dot pythonany where dot com/has been developed.
其他文献
针对虚拟发电厂中常规分布式电源、传统热电联产机组的碳排放特性,基于风电机组出力与用户热能需求的互补性,提出在虚拟发电厂中配置电锅炉以实现虚拟发电厂的碳中和.分析了风电机组、微型电热联产机组和电锅炉三者间的出力关系;建立了计及虚拟发电厂碳排放成本、虚拟发电厂运行成本和交易电量成本的虚拟发电厂电热联合运行模型,并利用量子进化算法对其进行了求解;计算了3种典型场景下的机组的出力、交易电量、碳排放总量.仿真算例表明,电锅炉联合风电供热能够显著降低虚拟发电厂的整体碳排放和运行成本.
为提升电力系统短期负荷预测精度,提出粒子群算法优化量子加权门控循环单元神经网络模型.首先,将量子加权神经元融入门控循环单元神经网络中,构建量子加权门控循环单元神经网络预测模型,利用量子信息处理机制,提高该神经网络的非线性逼近能力和泛化能力.然后,使用全局优化能力较强的改进粒子群优化算法对所提出模型的参数进行寻优,构建权重矩阵进行负荷预测.最后,通过实际电网算例进行仿真,仿真结果表明,本文提出的粒子群优化量子加权门控循环单元神经网络预测模型的预测精度较高.
Machine learning(ML)techniques and algorithms have been successfully and widely used in various areas includ-ing software engineering tasks.Like other software projects,bugs are also common in ML projects and libraries.In order to more deeply understand t
Closely related to the safety and stability of power grids,stability analysis has long been a core topic in the elec-tric industry.Conventional approaches employ computational simulation to make the quantitative judgement of the grid sta-bility under dist
Reinforcement learning is about learning agent models that make the best sequential decisions in unknown en-vironments.In an unknown environment,the agent needs to explore the environment while exploiting the collected infor-mation,which usually forms a s
The data stream processing framework processes the stream data based on event-time to ensure that the request can be responded to in real-time.In reality,streaming data usu-ally arrives out-of-order due to factors such as network delay.The data stream pro
Lysine lipoylation plays vital roles in cell metabolism and redox processes.For example,removal of lipoylation will decrease pyruvate dehydrogenase activity and affect the citric acid cycle.Despite the important functions of lysine lipoylation,the mechani
Many key-value stores use RDMA to optimize the messaging and data transmission between application layer and the storage layer,most of which only provide point-wise op-erations.Skiplist-based store can support both point operations and range queries,but i
Node order is one of the most important factors in learning the structure of a Bayesian network(BN)for proba-bilistic reasoning.To improve the BN structure learning,we propose a node order learning algorithm based on the frequently used Bayesian informati
The design of three novel fatty nitrogen mustard-based anticancer agents with fluorophores incor-porated into the alkene structure(CXL 118,CXL121,and CXL122)is described in this report.The results indicated that these compounds are selectively located in