4.6 Article

A hybrid method based on ensemble WELM for handling multi class imbalance in cancer microarray data

期刊

NEUROCOMPUTING
卷 266, 期 -, 页码 641-650

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2017.05.066

关键词

Multi class imbalance; Extreme learning machine; High dimension; Feature selection; Ensemble learning

资金

  1. Natural Science Foundation of China [61501128]
  2. school of Medical Information Engineering, Guangdong Pharmaceutical University
  3. Guangdong Provincial Natural fund [2014A030313585, 2016A030310300, 2015A030310483]

向作者/读者索取更多资源

DNA microarray technology provides an efficient way to diagnose cancer. However, microarray gene expression data face the challenges of class imbalance and high dimension. The class imbalance problem usually leads to inaccurate results when using traditional feature selection and classification algorithms. Due to fast learning speed and good classification performance, extreme learning machine (ELM) has become one of the best classification algorithms and weighted ELM has been recently presented to deal with the class imbalance. However, they ignored the negative impact of imbalanced feature set. This paper proposes a hybrid method based on WELM to handle the multi class imbalance problem of cancer microarray data at both feature and algorithmic levels. At feature level, a corrected feature subset is searched for each class using class oriented feature selection method, so that the features correlated with the minority class are explicitly selected. At algorithmic level, WELM is further modified to strengthen the input nodes with high discrimination power, and an ensemble model is trained to improve the generalization. That is, multiple modified WELM models are trained on the datasets characterized by different feature subsets; in order to encourage the ensemble diversity, the models with low dissimilarity are removed and the reserved ones are combined as an ensemble model. The experiments are conducted on eight gene expression datasets with multiple cancer types and classification results show that our method significantly outperforms ELM and several recent works. (C) 2017 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据