4.5 Article

Joint neighborhood entropy-based gene selection method with fisher score for tumor classification

期刊

APPLIED INTELLIGENCE
卷 49, 期 4, 页码 1245-1259

出版社

SPRINGER
DOI: 10.1007/s10489-018-1320-1

关键词

Rough sets; Neighborhood rough sets; Gene selection; Neighborhood entropy; Tumor classification

资金

  1. National Natural Science Foundation of China [61772176, 61402153, 61672332, 61370169, 61472042]
  2. China Postdoctoral Science Foundation [2016M602247]
  3. Plan for Scientific Innovation Talent of Henan Province [184100510003]
  4. Key Project of Science and Technology Department of Henan Province [182102210362]
  5. Young Scholar Program of Henan Province [2017GGJS041]
  6. Key Scientific and Technological Project of Xinxiang City [CXGG17002]
  7. Ph.D. Research Foundation of Henan Normal University [qd15132, qd15129]

向作者/读者索取更多资源

Tumor classification is one of the most vital technologies for cancer diagnosis. Due to the high dimensionality, gene selection (finding a small, closely related gene set to accurately classify tumor) is an important step for improving gene expression data classification performance. Traditional rough set model as a classical attribute reduction method deals with discrete data only. As for the gene expression data containing real-value or noisy data, they are usually employed by a discrete preprocessing, which may result in poor classification accuracy. In this paper, a novel neighborhood rough sets and entropy measure-based gene selection with Fisher score for tumor classification is proposed, which has the ability of dealing with real-value data whilst maintaining the original gene classification information. First, the Fisher score method is employed to eliminate irrelevant genes to significantly reduce computation complexity. Next, some neighborhood entropy-based uncertainty measures are investigated for handling the uncertainty and noisy of gene expression data. Moreover, some of their properties are derived and the relationships among these measures are established. Finally, a joint neighborhood entropy-based gene selection algorithm with the Fisher score is presented to improve the classification performance of gene expression data. The experimental results under an instance and several public gene expression data sets prove that the proposed method is very effective for selecting the most relevant genes with high classification accuracy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据