4.7 Article

Evaluation of genome-wide association study results through development of ontology fingerprints

期刊

BIOINFORMATICS
卷 25, 期 10, 页码 1314-1320

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btp158

关键词

-

资金

  1. NCRR NIH HHS [P20 RR017696-076928, 5 P20 RR017696-05, P20 RR017696] Funding Source: Medline
  2. NHGRI NIH HHS [R01 HG000376, HG00376] Funding Source: Medline
  3. NIDDK NIH HHS [R56 DK062370, R01 DK062370, DK62370, U01 DK062370] Funding Source: Medline
  4. NLM NIH HHS [5-T15-LM007438-02, T15 LM007438] Funding Source: Medline

向作者/读者索取更多资源

Motivation: Genome-wide association (GWA) studies may identify multiple variants that are associated with a disease or trait. To narrow down candidates for further validation, quantitatively assessing how identified genes relate to a phenotype of interest is important. Results: We describe an approach to characterize genes or biological concepts (phenotypes, pathways, diseases, etc.) by ontology fingerprint-the set of Gene Ontology (GO) terms that are overrepresented among the PubMed abstracts discussing the gene or biological concept together with the enrichment p-value of these terms generated from a hypergeometric enrichment test. We then quantify the relevance of genes to the trait from a GWA study by calculating similarity scores between their ontology fingerprints using enrichment p-values. We validate this approach by correctly identifying corresponding genes for biological pathways with a 90% average area under the ROC curve (AUC). We applied this approach to rank genes identified through a GWA study that are associated with the lipid concentrations in plasma as well as to prioritize genes within linkage disequilibrium (LD) block. We found that the genes with highest scores were: ABCA1, lipoprotein lipase (LPL) and cholesterol ester transfer protein, plasma for high-density lipoprotein; low-density lipoprotein receptor, APOE and APOB for low-density lipoprotein; and LPL, APOA1 and APOB for triglyceride. In addition, we identified genes relevant to lipid metabolism from the literature even in cases where such knowledge was not reflected in current annotation of these genes. These results demonstrate that ontology fingerprints can be used effectively to prioritize genes from GWA studies for experimental validation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据