☆ 4.7 Article

A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets

KNOWLEDGE-BASED SYSTEMS (2013)

Journal

KNOWLEDGE-BASED SYSTEMS

Volume 38, Issue -, Pages 85-104

Publisher

ELSEVIER

DOI: 10.1016/j.knosys.2012.08.025

Keywords

Fuzzy rule based classification systems; Hierarchical fuzzy partitions; Genetic rule selection; Tuning; Imbalanced data-sets; Borderline examples

Funding

Spanish Ministry of Science and Technology [TIN2011-28488, TIN2008-06681-C06-02]
Andalusian Research Plan [P10-TIC-6858, TIC-3928]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Lots of real world applications appear to be a matter of classification with imbalanced data-sets. This problem arises when the number of instances from one class is quite different to the number of instances from the other class. Traditionally, classification algorithms are unable to correctly deal with this issue as they are biased towards the majority class. Therefore, algorithms tend to misclassify the minority class which usually is the most interesting one for the application that is being sorted out. Among the available learning approaches, fuzzy rule-based classification systems have obtained a good behavior in the scenario of imbalanced data-sets. In this work, we focus on some modifications to further improve the performance of these systems considering the usage of information granulation. Specifically, a positive synergy between data sampling methods and algorithmic modifications is proposed, creating a genetic programming approach that uses linguistic variables in a hierarchical way. These linguistic variables are adapted to the context of the problem with a genetic process that combines rule selection with the adjustment of the lateral position of the labels based on the 2-tuples linguistic model. An experimental study is carried out over highly imbalanced and borderline imbalanced data-sets which is completed by a statistical comparative analysis. The results obtained show that the proposed model outperforms several fuzzy rule based classification systems, including a hierarchical approach and presents a better behavior than the C4.5 decision tree. (c) 2012 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Hierarchical belief rule-based model for imbalanced multi-classification

Guanxiang Hu, Wei He, Chao Sun, Hailong Zhu, Kangle Li, Li Jiang

Summary: Classification tasks are important in machine learning, but the problem of class imbalance can significantly affect classifier performance. This paper proposes a hierarchical belief rule-based system that integrates expert knowledge and utilizes extreme gradient boosting for feature selection to address class imbalance. By transforming multi-classification problems into binary classification problems and making precise predictions, class imbalance is alleviated.

EXPERT SYSTEMS WITH APPLICATIONS (2023)