4.4 Article

SKICA: A feature extraction algorithm based on supervised ICA with kernel for anomaly detection

Journal

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS
Volume 36, Issue 1, Pages 761-773

Publisher

IOS PRESS
DOI: 10.3233/JIFS-17749

Keywords

Feature extraction; anomaly detection; independent component analysis (ICA); supervised; kernel method

Funding

  1. National Natural Science Foundation of China [61272399, 61572090]
  2. Chongqing Research Program of Basic Research and Frontier Technology [cstc2015jcyjBX0014, cstc2016jcyjA0304]
  3. Scientific and Technological Research Program of Chongqing Municipal Education Commission [KJ1600521]

Ask authors/readers for more resources

Feature extraction is an important preprocessing step in many research areas. For anomaly detection, the purpose of feature extraction lies in not only extracting the most important features hidden in the datasets, but also discriminating different classes of samples. The latter is usually referred to as discriminative ability. The data collected from production systems usually do not follow Gaussian distribution. They may correspond to nonlinear mixture of independent components. In order to cope with non-Gaussian data and implement nonlinear feature extraction, this article proposes a feature extraction algorithm based on Supervised Independent Component Analysis with Kernel (termed SKICA). SKICA first adopts Kernel Principle Component Analysis (KPCA) to whiten the datasets. Further, by virtue of the within-cluster scatter matrix derived from Linear Discriminate Analysis (LDA), SKICA extends Independent Component Analysis (ICA) to supervised situation by introducing within-cluster information into solving independent components. The latter improvement makes SKICA obtain the independent components more beneficial to separating different classes of samples. In order to quantitatively measure discriminative ability of the feature extraction algorithms involved in experiments, this article defines three kinds of average square distance. This article conducts experiments on artificial datasets, Cloud datasets, and KDD Cup datasets to evaluate the effectiveness of SKICA. The experimental results show that SKICA outperforms several popular supervised feature extraction algorithms, including LDA, LDA with kernel (KDA), and supervised ICA (SICA).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available