4.7 Article

Benchmarking network-based gene prioritization methods for cerebral small vessel disease

期刊

BRIEFINGS IN BIOINFORMATICS
卷 22, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab006

关键词

network-based gene prioritization; cerebral small vessel disease; protein-protein interaction; disease gene association; benchmarking

资金

  1. Medical Research Council and Health Data Research UK [MR/S004149/1]
  2. Engineering and Physical Sciences Research Council
  3. Economic and Social Research Council
  4. National Institute for Health Research (England)
  5. Chief Scientist Office of the Scottish Government Health and Social Care Directorates
  6. Health and Social Care Research and Development Division (Welsh Government)
  7. Public Health Agency (Northern Ireland)
  8. British Heart Foundation
  9. Wellcome
  10. Industrial Strategy Challenge [MC_PC_18029]
  11. Wellcome Institutional Translation Partnership Award [PIII054]
  12. Medical Research Foundation
  13. MRC [MC_PC_18029, MR/J000914/1, MR/S004149/1, MR/S004122/1] Funding Source: UKRI

向作者/读者索取更多资源

Different algorithms for prioritizing disease-associated genes were benchmarked for application in cerebral small vessel disease, with random walk with restart on the heterogeneous network (RWRH) showing the best performance. Although RWRH had bias caused by degree centrality, it had overall better performance for cSVD. Current pure network-based gene prioritization algorithms may not discover novel disease-associated genes not linked to known ones.
Network-based gene prioritization algorithms are designed to prioritize disease-associated genes based on known ones using biological networks of protein interactions, gene-disease associations (GDAs) and other relationships between biological entities. Various algorithms have been developed based on different mechanisms, but it is not obvious which algorithm is optimal for a specific disease. To address this issue, we benchmarked multiple algorithms for their application in cerebral small vessel disease (cSVD). We curated protein-gene interactions (PGIs) and GDAs from databases and assembled PGI networks and disease-gene heterogeneous networks. A screening of algorithms resulted in seven representative algorithms to be benchmarked. Performance of algorithms was assessed using both leave-one-out cross-validation (LOOCV) and external validation with MEGASTROKE genome-wide association study (GWAS). We found that random walk with restart on the heterogeneous network (RWRH) showed best LOOCV performance, with median LOOCV rediscovery rank of 185.5 (out of 19 463 genes). The GenePanda algorithm had most GWAS-confirmable genes in top 200 predictions, while RWRH had best ranks for small vessel stroke-associated genes confirmed in GWAS. In conclusion, RWRH has overall better performance for application in cSVD despite its susceptibility to bias caused by degree centrality. Choice of algorithms should be determined before applying to specific disease. Current pure network-based gene prioritization algorithms are unlikely to find novel disease-associated genes that are not associated with known ones. The tools for implementing and benchmarking algorithms have been made available and can be generalized for other diseases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据