4.7 Article

Ensemble-based hybrid probabilistic sampling for imbalanced data learning in lung nodule CAD

Journal

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS
Volume 38, Issue 3, Pages 137-150

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.compmedimag.2013.12.003

Keywords

Lung nodule detection; False positive reduction; Imbalanced data learning; Ensemble classifier; Re-sampling; Random subspace method

Funding

  1. Alberta Innovates Centre for Machine Learning
  2. National Natural Science Foundation of China [61001047]
  3. China Scholarship Council

Ask authors/readers for more resources

Classification plays a critical role in false positive reduction (FPR) in lung nodule computer aided detection (CAD). The difficulty of FPR lies in the variation of the appearances of the nodules, and the imbalance distribution between the nodule and non-nodule class. Moreover, the presence of inherent complex structures in data distribution, such as within-class imbalance and high-dimensionality are other critical factors of decreasing classification performance. To solve these challenges, we proposed a hybrid probabilistic sampling combined with diverse random subspace ensemble. Experimental results demonstrate the effectiveness of the proposed method in terms of geometric mean (G-mean) and area under the ROC curve (AUC) compared with commonly used methods. (C) 2013 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available