4.7 Article

A multi-objective evolutionary algorithm with interval based initialization and self-adaptive crossover operator for large-scale feature selection in classification

期刊

APPLIED SOFT COMPUTING
卷 127, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.asoc.2022.109420

关键词

Classification; Large-scale feature selection; Self-adaptive; Initialization; Evolutionary algorithm; Multi-objective optimization; ReliefF

资金

  1. National Natural Science Foundation of China [61876089, 61876185, 61902281]
  2. Nat- ural Science Foundat, China [BK20141005]

向作者/读者索取更多资源

Feature selection is an important technique in classification that can improve accuracy and reduce dimensionality. This paper proposes a multi-objective evolutionary algorithm, MOEA-ISa, with interval based initialization and self-adaptive crossover operator for large-scale feature selection. Experimental results show that MOEA-ISa outperforms other algorithms and the proposed methods effectively improve its performance.
Feature selection (FS) is an important data pre-processing technique in classification. In most cases, FS can improve classification accuracy and reduce feature dimension, so it can be regarded as a multi-objective optimization problem. Many evolutionary computation techniques have been applied to FS problems and achieved good results. However, an increase in data dimension means that search difficulty also greatly increases, and EC algorithms with insufficient search ability maybe only find sub-optimal solutions in high probability. Moreover, an improper initial population may negatively affect the convergence speed of algorithms. To solve the problems highlighted above, this paper proposes MOEA-ISa: a multi-objective evolutionary algorithm with interval based initialization and self-adaptive crossover operator for large-scale FS. The proposed interval based initialization can limit the number of selected features for solution to improve the distribution of the initial population in the target space and reduce the similarity of the initial population in the decision space. The proposed self-adaptive crossover operator can determine the number of nonzero genes in offspring according to the similarity of parents, and it combines with the feature weights obtained by ReliefF method to improve the quality of offspring. In the experiments, the proposed algorithm was compared with six other algorithms on 13 benchmark UCI datasets and two benchmark LIBSVM datasets, and an ablation experiment was performed on MOEA-ISa. The results show that MOEA-ISa's performance is better than the six other algorithms for solving large-scale FS problems, and the proposed interval based initialization and self -adaptive crossover operator can effectively improve the performance of MOEA-ISa. The source code of MOEA-ISa is available on GitHub at https://github.com/xueyunuist/MOEA-ISa.(c) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据