4.6 Article

Learning From Experts' Experience: Toward Automated Cyber Security Data Triage

Journal

IEEE SYSTEMS JOURNAL
Volume 13, Issue 1, Pages 603-614

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSYST.2018.2828832

Keywords

Automated system; cyber security analysis; data triage; knowledge elicitation; security operations center

Funding

  1. ARO [W911NF-15-1-0576, W911NF-13-1-0421]
  2. NSF [CNS-1422594]
  3. IUK Grant-in Aid of Faculty Research and Summer Faculty Fellowship
  4. Direct For Computer & Info Scie & Enginr
  5. Division Of Computer and Network Systems [1422594] Funding Source: National Science Foundation

Ask authors/readers for more resources

Security operations centers (SOCs) employ various cyber defend measures to monitor network events. Apart from these measures, SOCs also have to resort to human analysts to make sense of the collected data for incident detection and response. However, with the oncoming network data collected and accumulated at a rapid speed, analysts are usually overwhelmed by tedious and repeated data triage tasks so that they can hardly concentrate on in-depth analysis to create timely and quality incident reports. This paper aims to reduce the analysts' workloads by developing data triage automatons. We have developed a computer-aided tracing method for capturing analysts' operations while they are performing a task. This paper proposes a graph-based trace mining approach for constructing useful patterns for data triage from the operation traces. Finite state machines can be constructed based on the rules to automate data triage. A human-in-the-loop case study is conducted to evaluate our approach, in which 30 professional analysts were recruited and asked to complete a cyber-analysis task with their operations being traced. State machines were constructed based on the traces and then the effectiveness of developing state machines and the performance of state machines are evaluated. The result shows that it is feasible to conduct automated data triage by leveraging analysts' traces. The state machines are able to complete processing a large amount of data within minutes. Comparing the performance of automated data triage with the ground truth, we found that a satisfactory false positive rate can be achieved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available