4.2 Article

SPAM DETECTION USING DATA COMPRESSION AND SIGNATURES

Journal

CYBERNETICS AND SYSTEMS
Volume 44, Issue 6-7, Pages 533-549

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/01969722.2013.805110

Keywords

Bayesian filter; data compression; e-mail; S-tree; signatures; spam

Funding

  1. Grant Agency of the Czech Republic [P202/11/P142]
  2. European Regional Development Fund in the IT4Innovations Centre of Excellence project [CZ.1.05/1.1.00/02.0070]
  3. Bio-Inspired Methods: Research, development and knowledge transfer project [CZ.1.07/2.3.00/20.0073]
  4. Operational Programme Education for Competitiveness
  5. ESF
  6. state budget of the Czech Republic

Ask authors/readers for more resources

In this article, we introduce a novel method for spam detection based on a combination of Bayesian filtering, signature trees, and data compression-based similarity. Bayesian filtering is one of the most popular and most efficient algorithms for dealing with spam detection. The problem with Bayesian filtering is that it is unable to classify any e-mail without doubt and sometimes spam e-mails are classified as regular e-mails. This novel method sorts out this problem by using signature trees and data compression-based similarity. The main result of this article is an up to 99% improvement in spam detection precision using this novel method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available