☆ 4.7 Article

CoBAn: A context based model for data leakage prevention

INFORMATION SCIENCES (2014)

Journal

INFORMATION SCIENCES

Volume 262, Issue -, Pages 137-158

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2013.10.005

Keywords

Information leakage; Security; Context

Funding

ISF of the Israeli Ministry of Science Technology [1116/12, 150378-0-0553]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A new context-based model (CoBAn) for accidental and intentional data leakage prevention (DLP) is proposed. Existing methods attempt to prevent data leakage by either looking for specific keywords and phrases or by using various statistical methods. Keyword-based methods are not sufficiently accurate since they ignore the context of the keyword, while statistical methods ignore the content of the analyzed text. The context-based approach we propose leverages the advantages of both these approaches. The new model consists of two phases: training and detection. During the training phase, clusters of documents are generated and a graph representation of the confidential content of each cluster is created. This representation consists of key terms and the context in which they need to appear in order to be considered confidential. During the detection phase, each tested document is assigned to several clusters and its contents are then matched to each cluster's respective graph in an attempt to determine the confidentiality of the document. Extensive experiments have shown that the model is superior to other methods in detecting leakage attempts, where the confidential information is rephrased or is different from the original examples provided in the learning set. (C) 2013 Elsevier Inc. All rights reserved.

CoBAn: A context based model for data leakage prevention

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

CoBAn: A context based model for data leakage prevention

Journal

INFORMATION SCIENCES

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper