4.7 Article

A Duplication Analysis-Based Evolutionary Algorithm for Biobjective Feature Selection

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TEVC.2020.3016049

关键词

Feature extraction; Optimization; Sociology; Statistics; Evolutionary computation; Measurement; Computer science; Classification; duplication analysis; evolutionary algorithm (EA); feature selection; multiobjective optimization

资金

  1. Marsden Fund of New Zealand [VUW1509, VUW1615]
  2. Educational Department Research Project of Fujian Province [JAT190594]
  3. University Research Fund of Putian University [2019002]
  4. Science for Technological Innovation Challenge Fund [E3603/2903]
  5. University Research Fund at Victoria University of Wellington [216378/3764, 223805/3986]
  6. MBIE Data Science SSIF Fund [RTVU1914]
  7. National Natural Science Foundation of China [61876169]

向作者/读者索取更多资源

This paper proposes a duplication analysis-based EA (DAEA) for biobjective feature selection in classification and compares its performance with five state-of-the-art multiobjective EAs (MOEAs) on 20 classification datasets. The empirical results show that DAEA performs the best on most datasets, achieving outstanding optimization performance as well as good classification and generalization results.
Feature selection is a complex optimization problem with important real-world applications. Normally, its main target is to reduce the dimensionality of the dataset and increase the effectiveness of the classification. Owing to the population-inspired characteristics, different evolutionary algorithms (EAs) have been proposed to solve feature selection problems over the past decades. However, the majority of them only consider single-objective optimization while many real-world problems have multiple objectives, which creates a genuine demand for designing more suitable and effective EAs to handle multiobjective feature selection. A multiobjective feature selection problem usually consists of two objectives: one is to minimize the number of selected features and the other is to minimize the error of classification. In this article, we propose a duplication analysis-based EA (DAEA) for biobjective feature selection in classification. In the proposed algorithm, we make improvements on the basic dominance-based EA framework in three aspects: first, the reproduction process is modified to improve the quality of offspring; second, a duplication analysis method is proposed to filter out the redundant solutions; and third, a diversity-based selection method is adopted to further select the reserved solutions. In the experiments, we have compared the proposed algorithm with five state-of-the-art multiobjective EAs (MOEAs) and tested them on 20 classification datasets, using two widely used performance metrics. According to the empirical results, DAEA performs the best on most datasets, indicating that DAEA not only gains outstanding optimization performance but also obtains good classification and generalization results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据