☆ 4.1 Review

Evaluation metrics and dimensional reduction for binary classification algorithms: a case study on bankruptcy prediction

KNOWLEDGE ENGINEERING REVIEW (2022)

Journal

KNOWLEDGE ENGINEERING REVIEW

Volume 37, Issue -, Pages -

Publisher

CAMBRIDGE UNIV PRESS

DOI: 10.1017/S026988892100014X

Keywords

Funding

project INTELFIN: Artificial Intelligence for investment and value creation in SMEs through competitive analysis and business environment, - Ministry of Science, Innovation and Universities [RTC-2017-6536-]
State Agency for Research (AEI)
European Regional Development Fund (ERDF)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper presents a methodology for automating binary classification using the minimum number of attributes. It evaluates the goodness of fit of an algorithm through evaluation metrics and compares performance of different models to obtain the optimal outcome.

This paper presents a methodology that permits to automate binary classification using the minimum possible number of attributes. In this methodology, the success of the binary prediction does not lie in the accuracy of an algorithm but in the evaluation metrics, which give information about the goodness of fit; which is an important factor when the data batch is unbalanced. The proposed methodology assesses the possible biases in identifying one algorithm as the best performer when considering the goodness of fit of an algorithm through evaluation metrics. The dimension of data has been reduced through the cumulative explained variance. Then, the performance of six machine learning classification models has been compared through Matthew correlation coefficient (MCC), area under curve - receiver operating characteristic (ROC-AUC), and area under curve - precision-recall (AUC-PR). The results show graphically and numerically how the evaluation metrics interfere with the most optimal outcome of an algorithm. The algorithms with the best performance in terms of evaluation metrics have been random forest and gradient boosting. In the imbalanced datasets, MCC has provided better prediction results than ROC-AUC or AUC-PR. The proposed methodology is adapted to the case of bankruptcy prediction.

Evaluation metrics and dimensional reduction for binary classification algorithms: a case study on bankruptcy prediction

Journal

KNOWLEDGE ENGINEERING REVIEW

Publisher

CAMBRIDGE UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Evaluation metrics and dimensional reduction for binary classification algorithms: a case study on bankruptcy prediction

Journal

KNOWLEDGE ENGINEERING REVIEW

Publisher

CAMBRIDGE UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper