Identification of gene-environment barcodes for complex human diseases

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:lijb2009
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: It is widely recognized that the molecular etiology of complex human diseases is very sophisticated, involving a large number of genes, gene-gene and geneenvironment interactions.Deciphering the underlying high order of multiple susceptible genes or genetic barcodes for complex diseases is always the great hope in biomedical domains, which has important implications in both early molecular diagnosis and personalized medicine.The aim of this study was to assess the potential of high-dimension SNP data (and environmental factors) to be used to generate the multi-factorial patterns for accurately partitioning human populations.Materials and Methods: Two datasets of case-control for two cancers were analyzed.The data for nasopharyngeal carcinoma (NPC) contained 676 SNPs and 12 environmental variables, while the data for coronary heart disease (CHD), provided by The Wellcome Trust Case Control Consortium (WTCCC), contained only genome-wide SNPs.To reduce the following computational burden, Chi-square association analysis of single SNP loci was first performed to reduce the SNP data.Then, genetic algorithm (GA) and probabilistic neural network (PNN) were integrated to be used to identify the multi-dimension patterns of SNPs (and environmental factors).ROC curve was also used to assess their performances for partitioning human populations.Finally, The pathway analysis software, Pathway Assist, was used to explore the biological functions that the genetic barcodes were involved.Results: For NPC data, a barcode composing of 14 SNPs (in 10 genes) and four environmental factors was identified, with an accuracy of 78.49% and Youden index 52.98% for distinguishing between NPC patients and health subjects.And for CHD data, 6 barcodes (with no more than 100 SNPs in each barcode) were identified, with accuracies of all >89% and AUC (area under the ROC curve) of all >0.85.The functional analysis of these barcodes demonstrated that these high-dimension barcodes were of sounding biological significance related to the two diseases.Conclusions: This study suggests that the proposed integrated approach is promising to be used for identifying gene-environment barcodes for complex human diseases .
其他文献
会议
1991年12月15日,被誉为“国之光荣”的上海秦山核电站,经过成千上万名科研人员、建设工人的努力,终于正式并网发电了。从此,中国大陆无核电的历史宣告结束,我国能源建设翻开新的一
过剩经济与开拓潜在市场 当前我国经济局部面临着严重的生产过剩的现象,而且已从消费品市场延伸到基础产品领域。钢铁、煤炭、化工等基础产业均出现严重的生产过剩,已实施了
现今,网络游戏走入人们日常生活,给人们的休闲生活又提供了一种新的选择.网络游戏这一行业逐渐形成规模.但是网络游戏的流行在催生游戏产业迅猛发展的进程中,出现了一个毒害
九景公路是江西省利用亚行贷款的第一条高速公路,项目合同金额19.6亿元人民币,其中亚行贷款1.09亿美元。为保证亚行贷款资金投向及建设速度,九江支局采取了三项措施:方便企业使用资金,特许
【本刊讯】全国房地产及房改工作座谈会于3月1日,在天津召开。此次会议的主要议题是:贯彻落实中央经济工作会议与全国建设工作会议的精神,交流总结一年来各地在住宅建设店地产业
“柳枝”折曲法,借名于小儿骨外伤受压弯曲而不折断的医学术语“柳枝骨折”。在盆景树桩的小枝条上,既用轻刀,又用力压折,使枝条有折曲而不会断离的拿弯方法,谓之“柳枝”式
(2012年7月3日)今天,2012年全国方志期刊工作座谈会在美丽的泉城济南市召开了,这次会议由中国地方志指导小组办公室主办,济南市史志办公室承办。方志期刊工作座谈会每两年召
  Background: As renewable energy, microbial fuel cells (MFCs) are gaining increasing concern and have been developed well for its peculiar advantages.Up to n