4.6 Article

Combining Unsupervised Approaches for Near Real-Time Network Traffic Anomaly Detection

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 3, 页码 -

出版社

MDPI
DOI: 10.3390/app12031759

关键词

unsupervised machine learning; anomaly detection; near real-time; network traffic; explainable artificial intelligence; SHAP

资金

  1. Fondo Europeo di Sviluppo Regionale Puglia Programma Operativo Regionale (POR) Puglia 2014-2020-Axis I-Specific Objective 1a-Action 1.1 (Research and Development)-Project Title: CyberSecurity and Security Operation Center (SOC) Product Suite by BV TECH S.p [CUP/CIG B93G18000040007]
  2. [2014-2020-Axis I-Specific Objective 1a-Action 1.1]

向作者/读者索取更多资源

The 0-day attack is a cyber-attack that exploits unpublished vulnerabilities. Detecting and predicting such attacks is crucial for smart enterprises and technology-dependent systems. Unsupervised machine learning methods are effective in identifying anomalies in real-time. The addition of Isolation Forest improves accuracy and inference time. The study also uses SHAP to identify important features for classifying attack events. Experiments were conducted on multiple datasets.
The 0-day attack is a cyber-attack based on vulnerabilities that have not yet been published. The detection of anomalous traffic generated by such attacks is vital, as it can represent a critical problem, both in a technical and economic sense, for a smart enterprise as for any system largely dependent on technology. To predict this kind of attack, one solution can be to use unsupervised machine learning approaches, as they guarantee the detection of anomalies regardless of their prior knowledge. It is also essential to identify the anomalous and unknown behaviors that occur within a network in near real-time. Three different approaches have been proposed and benchmarked in exactly the same condition: Deep Autoencoding with GMM and Isolation Forest, Deep Autoencoder with Isolation Forest, and Memory Augmented Deep Autoencoder with Isolation Forest. These approaches are thus the result of combining different unsupervised algorithms. The results show that the addition of the Isolation Forest improves the accuracy values and increases the inference time, although this increase does not represent a relevant problematic factor. This paper also explains the features that the various models consider most important for classifying an event as an attack using the explainable artificial intelligence methodology called Shapley Additive Explanations (SHAP). Experiments were conducted on KDD99, NSL-KDD, and CIC-IDS2017 datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据