4.6 Article

An Evolutionary Multitasking-Based Feature Selection Method for High-Dimensional Classification

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 52, Issue 7, Pages 7172-7186

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.3042243

Keywords

Task analysis; Multitasking; Optimization; Search problems; Statistics; Sociology; Knowledge transfer; Evolutionary multitasking; feature selection (FS); high-dimensional classification; particle swarm optimization (PSO)

Funding

  1. National Key Research and Development Program of China [2017YFB1302400]
  2. National Natural Science Foundation of China [61773242, 61803227, 61876169]
  3. Major Agricultural Applied Technological Innovation Projects of Shandong Province [SD2019NJ014]
  4. Intelligent Robot and System Innovation Center Foundation [2019IRS19]
  5. Marsden Fund of New Zealand Government [VUW1509, VUW1615]
  6. Science for Technological Innovation Challenge (SfTI) [E3603/2903]
  7. MBIE Data Science SSIF Fund [RTVU1914]
  8. China Scholarship Council [201806220187]
  9. University Research Fund at Victoria University of Wellington [223805/3986]

Ask authors/readers for more resources

This study proposes a novel PSO-based feature selection method to solve high-dimensional classification problems through information sharing between two related tasks, achieving higher classification accuracy in a faster time than existing methods.
Feature selection (FS) is an important data preprocessing technique in data mining and machine learning, which aims to select a small subset of information features to increase the performance and reduce the dimensionality. Particle swarm optimization (PSO) has been successfully applied to FS due to being efficient and easy to implement. However, most of the existing PSO-based FS methods face the problems of trapping into local optima and computationally expensive high-dimensional data. Multifactorial optimization (MFO), as an effective evolutionary multitasking paradigm, has been widely used for solving complex problems through implicit knowledge transfer between related tasks. Inspired by MFO, this study proposes a novel PSO-based FS method to solve high-dimensional classification via information sharing between two related tasks generated from a dataset. To be specific, two related tasks about the target concept are established by evaluating the importance of features. A new crossover operator, called assortative mating, is applied to share information between these two related tasks. In addition, two mechanisms, which are variable-range strategy and subset updating mechanism, are also developed to reduce the search space and maintain the diversity of the population, respectively. The results show that the proposed FS method can achieve higher classification accuracy with a smaller feature subset in a reasonable time than the state-of-the-art FS methods on the examined high-dimensional classification problems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available