4.6 Article

Classifier selection and clustering with fuzzy assignment in ensemble model for credit scoring

Journal

NEUROCOMPUTING
Volume 316, Issue -, Pages 210-221

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2018.07.070

Keywords

Credit scoring; Ensemble learning; Clustering; Genetic algorithm; Fuzzy assignment

Funding

  1. China National Nature Science Foundation [51375429, 51475410]
  2. Zhejiang Nature Science Foundation of China [LY17E050010]

Ask authors/readers for more resources

With the development of statistical methods and machine learning algorithms, credit scoring is no longer a task merely based on experience. From single base classifiers to ensemble classifiers and hybrid models, researches have been focusing on combining classifiers and hybridizing with artificial intelligence algorithms to improve performance of the models. Ensemble classifiers have been proven to have a better predictive accuracy than single classifiers, but the method of ensemble affects performance and is worth studying. This study is based on the ensemble of five of the most widely recognized base classifiers in credit scoring, i.e. logistic regression, support vector machine, neural network, gradient boosting decision tree and random forest. It proposes a new method of selecting classifiers using Genetic Algorithm after they are trained, considering both the accuracy and diversity of the ensemble. Besides, unsupervised clustering is integrated with a fuzzy assignment procedure in the model, to make more use of the data pattern and improve performance. The proposed CF-GA-Ens model is tested on three credit scoring datasets (Australian, German, Japanese) and three performance measures (accuracy, AUC, F-score), and the results show that our classifier selection and clustering procedures have a positive impact on all performance measures. (C) 2018 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available