4.7 Article

Evolutionary undersampling boosting for imbalanced classification of breast cancer malignancy

期刊

APPLIED SOFT COMPUTING
卷 38, 期 -, 页码 714-726

出版社

ELSEVIER
DOI: 10.1016/j.asoc.2015.08.060

关键词

Machine Learning; Classifier ensemble; Imbalanced classification; Evolutionary algorithms; Clinical decision support; Breast cancer

资金

  1. Polish National Science Center [DEC-2013/09/B/ST6/02264]
  2. Spanish Ministry of Education and Science [TIN2013-40765-P, TIN2011-28488]
  3. Andalusian Research Plan [P10-TIC-6858, P11-TIC-7765]

向作者/读者索取更多资源

In this paper, we propose a complete, fully automatic and efficient clinical decision support system for breast cancer malignancy grading. The estimation of the level of a cancer malignancy is important to assess the degree of its progress and to elaborate a personalized therapy. Our system makes use of both Image Processing and Machine Learning techniques to perform the analysis of biopsy slides. Three different image segmentation methods (fuzzy c-means color segmentation, level set active contours technique and grey-level quantization method) are considered to extract the features used by the proposed classification system. In this classification problem, the highest malignancy grade is the most important to be detected early even though it occurs in the lowest number of cases, and hence the malignancy grading is an imbalanced classification problem. In order to overcome this difficulty, we propose the usage of an efficient ensemble classifier named EUSBoost, which combines a boosting scheme with evolutionary undersampling for producing balanced training sets for each one of the base classifiers in the final ensemble. The usage of the evolutionary approach allows us to select the most significant samples for the classifier learning step (in terms of accuracy and a new diversity term included in the fitness function), thus alleviating the problems produced by the imbalanced scenario in a guided and effective way. Experiments, carried on a large dataset collected by the authors, confirm the high efficiency of the proposed system, shows that level set active contours technique leads to an extraction of features with the highest discriminative power, and prove that EUSBoost is able to outperform state-of-the-art ensemble classifiers in a real-life imbalanced medical problem. (C) 2015 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据