4.7 Article

Adaptive Data Structure Regularized Multiclass Discriminative Feature Selection

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2021.3071603

关键词

Data structures; Optimization; Noise measurement; Manifolds; Laplace equations; Feature extraction; Training data; Regularized discriminant analysis; semisupervised feature selection (SSFS); sparse data representation

资金

  1. National Natural Science Foundation of China [61922064, U2033210, 61772373]
  2. Zhejiang Provincial Natural Science Foundation [LR17F030001, LQ19F020005]
  3. Project of Science and Technology Plans of Wenzhou City [C20170008]

向作者/读者索取更多资源

Feature selection (FS) is an important approach to dimensionality reduction that aims to identify the most informative subset of input features. This article proposes a novel FS framework that combines data structure learning and FS to select the most discriminative and informative features. A new semisupervised FS (SSFS) method derived from this framework is demonstrated to be effective through experiments on real-world data sets.
Feature selection (FS), which aims to identify the most informative subset of input features, is an important approach to dimensionality reduction. In this article, a novel FS framework is proposed for both unsupervised and semisupervised scenarios. To make efficient use of data distribution to evaluate features, the framework combines data structure learning (as referred to as data distribution modeling) and FS in a unified formulation such that the data structure learning improves the results of FS and vice versa. Moreover, two types of data structures, namely the soft and hard data structures, are learned and used in the proposed FS framework. The soft data structure refers to the pairwise weights among data samples, and the hard data structure refers to the estimated labels obtained from clustering or semisupervised classification. Both of these data structures are naturally formulated as regularization terms in the proposed framework. In the optimization process, the soft and hard data structures are learned from data represented by the selected features, and then, the most informative features are reselected by referring to the data structures. In this way, the framework uses the interactions between data structure learning and FS to select the most discriminative and informative features. Following the proposed framework, a new semisupervised FS (SSFS) method is derived and studied in depth. Experiments on real-world data sets demonstrate the effectiveness of the proposed method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据