4.1 Review

Evaluation metrics and dimensional reduction for binary classification algorithms: a case study on bankruptcy prediction

Journal

KNOWLEDGE ENGINEERING REVIEW
Volume 37, Issue -, Pages -

Publisher

CAMBRIDGE UNIV PRESS
DOI: 10.1017/S026988892100014X

Keywords

-

Funding

  1. project INTELFIN: Artificial Intelligence for investment and value creation in SMEs through competitive analysis and business environment, - Ministry of Science, Innovation and Universities [RTC-2017-6536-]
  2. State Agency for Research (AEI)
  3. European Regional Development Fund (ERDF)

Ask authors/readers for more resources

This paper presents a methodology for automating binary classification using the minimum number of attributes. It evaluates the goodness of fit of an algorithm through evaluation metrics and compares performance of different models to obtain the optimal outcome.
This paper presents a methodology that permits to automate binary classification using the minimum possible number of attributes. In this methodology, the success of the binary prediction does not lie in the accuracy of an algorithm but in the evaluation metrics, which give information about the goodness of fit; which is an important factor when the data batch is unbalanced. The proposed methodology assesses the possible biases in identifying one algorithm as the best performer when considering the goodness of fit of an algorithm through evaluation metrics. The dimension of data has been reduced through the cumulative explained variance. Then, the performance of six machine learning classification models has been compared through Matthew correlation coefficient (MCC), area under curve - receiver operating characteristic (ROC-AUC), and area under curve - precision-recall (AUC-PR). The results show graphically and numerically how the evaluation metrics interfere with the most optimal outcome of an algorithm. The algorithms with the best performance in terms of evaluation metrics have been random forest and gradient boosting. In the imbalanced datasets, MCC has provided better prediction results than ROC-AUC or AUC-PR. The proposed methodology is adapted to the case of bankruptcy prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available