4.7 Article

An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods

期刊

ARTIFICIAL INTELLIGENCE IN MEDICINE
卷 61, 期 2, 页码 63-78

出版社

ELSEVIER
DOI: 10.1016/j.artmed.2014.03.003

关键词

Gene disease prioritization; Network integration; Heterogeneous data fusion; MeSH descriptors; Node label ranking

资金

  1. PASCAL2 Network of Excellence under EC Grant [216886]
  2. PRIN project Automi e linguaggi formali: aspetti matematici e applicativi
  3. Italian Ministry of University
  4. Biotechnology and Biological Sciences Research Council (BBSRC) [BB/K004131/1]
  5. Biotechnology and Biological Sciences Research Council [BB/K004131/1, BB/F00964X/1] Funding Source: researchfish
  6. BBSRC [BB/K004131/1, BB/F00964X/1] Funding Source: UKRI

向作者/读者索取更多资源

Objective: In the context of network medicine, gene prioritization methods represent one of the main tools to discover candidate disease genes by exploiting the large amount of data covering different types of functional relationships between genes. Several works proposed to integrate multiple sources of data to improve disease gene prioritization, but to our knowledge no systematic studies focused on the quantitative evaluation of the impact of network integration on gene prioritization. In this paper, we aim at providing an extensive analysis of gene-disease associations not limited to genetic disorders, and a systematic comparison of different network integration methods for gene prioritization. Materials and methods: We collected nine different functional networks representing different functional relationships between genes, and we combined them through both unweighted and weighted network integration methods. We then prioritized genes with respect to each of the considered 708 medical subject headings (MeSH) diseases by applying classical guilt-by-association, random walk and random walk with restart algorithms, and the recently proposed kernelized score functions. Results: The results obtained with classical random walk algorithms and the best single network achieved an average area under the curve (AUC) across the 708 MeSH diseases of about 0.82, while kernelized score functions and network integration boosted the average AUC to about 0.89. Weighted integration, by exploiting the different informativeness embedded in different functional networks, outperforms unweighted integration at 0.01 significance level, according to the Wilcoxon signed rank sum test. For each MeSH disease we provide the top-ranked unannotated candidate genes, available for further biomedical investigation. Conclusions: Network integration is necessary to boost the performances of gene prioritization methods. Moreover the methods based on kernelized score functions can further enhance disease gene ranking results, by adopting both local and global learning strategies, able to exploit the overall topology of the network. (C) 2014 The Authors. Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据