4.5 Article

Min-redundancy and max-relevance multi-view feature selection for predicting ovarian cancer survival using multi-omics data

期刊

BMC MEDICAL GENOMICS
卷 11, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12920-018-0388-0

关键词

Multi-omics data integration; Multi-view feature selection; Cancer survival prediction; Machine learning

资金

  1. National Institutes of Health [NCATS UL1 TR000127, NCATS TR002014, NIGMS P50GM115318, NLM R01 NL012535, NLM T32LM012415]
  2. Pennsylvania Department of Health [SAP 4100070267]
  3. National Science Foundation [IIS 1636795]
  4. Edward Frymoyer Endowed Professorship in Information Sciences and Technology at Pennsylvania State University
  5. Sudha Murty Distinguished Visiting Chair in Neurocomputing and Data Science at the Indian Institute of Science
  6. Pennsylvania State University Center for Big Data Analytics and Discovery Informatics (CBDADI)
  7. Institute for Cyberscience
  8. Social Science Research Institute at the university
  9. Huck Institutes of the Life Sciences

向作者/读者索取更多资源

Background: Large-scale collaborative precision medicine initiatives (e.g., The Cancer Genome Atlas (TCGA)) are yielding rich multi-omics data. Integrative analyses of the resulting multi-omics data, such as somatic mutation, copy number alteration (CNA), DNA methylation, miRNA, gene expression, and protein expression, offer tantalizing possibilities for realizing the promise and potential of precision medicine in cancer prevention, diagnosis, and treatment by substantially improving our understanding of underlying mechanisms as well as the discovery of novel biomarkers for different types of cancers. However, such analyses present a number of challenges, including heterogeneity, and high-dimensionality of omics data. Methods: We propose a novel framework for multi-omics data integration using multi-view feature selection. We introduce a novel multi-view feature selection algorithm, MRMR-mv, an adaptation of the well-known Min-Redundancy and Maximum-Relevance (MRMR) single-view feature selection algorithm to the multi-view setting. Results: We report results of experiments using an ovarian cancer multi-omics dataset derived from the TCGA database on the task of predicting ovarian cancer survival. Our results suggest that multi-view models outperform both view-specific models (i.e., models trained and tested using a single type of omics data) and models based on two baseline data fusion methods. Conclusions: Our results demonstrate the potential of multi-view feature selection in integrative analyses and predictive modeling from multi-omics data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据