4.5 Article

Gradient Boosting Feature Selection With Machine Learning Classifiers for Intrusion Detection on Power Grids

Journal

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT
Volume 18, Issue 1, Pages 1104-1116

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNSM.2020.3032618

Keywords

Power grids; Feature extraction; Boosting; Intrusion detection; Cyberattack; SCADA systems; power grids; random forest; gradient boosting; feature selection; cyber security; network intrusions

Funding

  1. Natural Sciences and Engineering Research Council (NSERC), Canada through a Collaborative Research Grant

Ask authors/readers for more resources

Smart grids rely on SCADA systems for monitoring and controlling electrical networks, but face cybersecurity risks. We propose an IDS framework for smart grids that uses feature engineering and machine learning classifiers to improve detection rate and execution speed.
Smart grids rely on SCADA (Supervisory Control and Data Acquisition) systems to monitor and control complex electrical networks in order to provide reliable energy to homes and industries. However, the increased inter-connectivity and remote accessibility of SCADA systems expose them to cyber attacks. As a consequence, developing effective security mechanisms is a priority in order to protect the network from internal and external attacks. We propose an integrated framework for an Intrusion Detection System (IDS) for smart grids which combines feature engineering-based preprocessing with machine learning classifiers. Whilst most of the machine learning techniques fine-tune the hyper-parameters to improve the detection rate, our approach focuses on selecting the most promising features of the dataset using Gradient Boosting Feature Selection (GBFS) before applying the classification algorithm, a combination which improves not only the detection rate but also the execution speed. GBFS uses the Weighted Feature Importance (WFI) extraction technique to reduce the complexity of classifiers. We implement and evaluate various decision-tree based machine learning techniques after obtaining the most promising features of the power grid dataset through a GBFS module, and show that this approach optimizes the False Positive Rate (FPR) and the execution time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available