4.7 Review

A review of machine learning approaches to Spam filtering

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 36, 期 7, 页码 10206-10222

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2009.02.037

关键词

Spam filtering; Online learning; Bag-of-words (BoW); Naive Bayes; Image spam

资金

  1. Bolsa Pesquisa program [20060519110414a]
  2. FAPEMIG
  3. CNPq

向作者/读者索取更多资源

In this paper, we present a comprehensive review of recent developments in the application of machine learning algorithms to Spam filtering, focusing on both textual- and image-based approaches. Instead of considering Spam filtering as a standard classification problem, we highlight the importance of considering specific characteristics of the problem, especially concept drift, in designing new filters. Two particularly important aspects not widely recognized in the literature are discussed: the difficulties in updating a classifier based on the bag-of-words representation and a major difference between two early naive Bayes models. Overall, we conclude that while important advancements have been made in the last years, several aspects remain to be explored, especially under more realistic evaluation settings. (C) 2009 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据