☆ 4.5 Article

An Efficient Intrusion Detection Method Based on LightGBM and Autoencoder

SYMMETRY-BASEL (2020)

Journal

SYMMETRY-BASEL

Volume 12, Issue 9, Pages -

Publisher

MDPI

DOI: 10.3390/sym12091458

Keywords

intrusion detection; LightGBM; feature selection; autoencoder; classification; deep learning

Funding

National Natural Science Foundation of China (NSFC) [61433012]
Innovation Environment Construction Special Project of Xinjiang Uygur Autonomous Region [PT1811]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Due to the insidious characteristics of network intrusion behaviors, developing an efficient intrusion detection system is still a big challenge, especially in the era of big data where the number of traffic and the dimension of each traffic feature are high. Because of the shortcomings of traditional common machine learning algorithms in network intrusion detection, such as insufficient accuracy, a network intrusion detection system based on LightGBM and autoencoder (AE) is proposed. The LightGBM-AE model proposed in this paper includes three steps: data preprocessing, feature selection, and classification. The LightGBM-AE model adopts the LightGBM algorithm for feature selection, and then uses an autoencoder for training and detection. When a set of data containing network intrusion behaviors are inputted into an autoencoder, there is a large reconstruction error between the original input data and the reconstructed data obtained by the autoencoder, which provides a basis for intrusion detection. According to the reconstruction error, an appropriate threshold is set to distinguish symmetrically between normal behavior and attack behavior. The experiment is carried out on the NSL-KDD dataset and implemented using Pytorch. In addition to autoencoder, variational autoencoder (VAE) and denoising autoencoder (DAE) are also used for intrusion detection and are compared with existing machine learning algorithms such as Decision Tree, Random Forest, KNN, GBDT, and XGBoost. The evaluation is carried out through classification evaluation indexes such as accuracy, precision, recall, F1-score. The experimental results show that the method can efficiently separate the attack behavior from normal behavior according to the reconstruction error. Compared with other methods, the effectiveness and superiority of this method are verified.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

A hybrid Intrusion Detection System based on Sparse autoencoder and Deep Neural Network

K. Narayana Rao, K. Venkata Rao, P. V. G. D. Prasad Reddy

Summary: The study found that machine learning has shown good results in intrusion detection systems. The two-stage hybrid methodology proposed by the authors significantly improves the detection of attacks, especially achieving excellent accuracy and detection rates on the UNSW-NB15 dataset.

COMPUTER COMMUNICATIONS (2021)