4.7 Article

Optimising anti-spam filters with evolutionary algorithms

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 40, 期 10, 页码 4010-4021

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2013.01.008

关键词

Anti-spam filters; Multiobjective optimisation; Evolutionary computation; Genetic algorithms

向作者/读者索取更多资源

This work is devoted to the problem of optimising scores for anti-spam filters, which is essential for the accuracy of any filter based anti-spam system, and is also one of the biggest challenges in this research area. In particular, this optimisation problem is considered from two different points of view: single and multiobjective problem formulations. Some of existing approaches within both formulations are surveyed, and their advantages and disadvantages are discussed. Two most popular evolutionary multiobjective algorithms and one single objective algorithm are adapted to optimisation of the anti-spam filters' scores and compared on publicly available datasets widely used for benchmarking purposes. This comparison is discussed, and the recommendations for the developers and users of optimising anti-spam filters are provided. (C) 2013 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Improvements for research data repositories: The case of text spam

Ismael Vazquez, Maria Novo-Loures, Reyes Pavon, Rosalia Laza, Jose Ramon Mendez, David Ruano-Ordas

Summary: Current research requires adequate description of algorithms and results, as well as reproducibility and comparison with other approximations. Public data sets are crucial for experimental protocols, but have limitations. Enhancements are needed for data repositories, such as customized data sets, comparison of techniques with different pre-processing methods, availability of software applications for reproducing pre-processing steps, and protection of licensing issues and user rights.

JOURNAL OF INFORMATION SCIENCE (2023)

Article Computer Science, Hardware & Architecture

Queue-priority optimized algorithm: a novel task scheduling for runtime systems of application integration platforms

Daniela L. Freire, Rafael Z. Frantz, Fabricia Roos-Frantz, Vitor Basto-Fernandes

Summary: Enterprises are increasingly integrating applications and services in business processes due to the advancement of cloud and mobile applications. Integration tools need to adapt to handle high volumes of data and utilize cloud computational resources efficiently to avoid increasing operational costs. The proposed Queue-priority algorithm with Particle Swarm Optimization shows promise in improving task scheduling for integration processes with high volumes of data.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Hardware & Architecture

Task scheduling characterisation in enterprise application integration

Daniela L. Freire, Rafael Z. Frantz, Fabricia Roos-Frantz, Vitor Basto-Fernandes

Summary: Cloud computing enables enterprises to integrate applications and computational resources, with integration platforms reducing maintenance costs. Improvements in task scheduling are needed due to high-performance demands. This study explores the specifics and vulnerabilities of integration task scheduling, proposing a ranking method based on conceptual models to optimize performance and minimize costs for companies.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Hardware & Architecture

Selecting services in the cloud: a decision support methodology focused on infrastructure-as-a-service context

Cassio L. M. Belusso, Sandro Sawicki, Vitor Basto-Fernandes, Rafael Z. Frantz, Fabricia Roos-Frantz

Summary: Growing demand for reduced local hardware infrastructure is driving the adoption of Cloud Computing. In this study, the researchers propose D-AHP, a decision-making methodology based on Pareto Dominance and Analytic Hierarchy Process (AHP), to assist users in selecting virtual machine instances. The study finds that considering the datacenter location as a criterion for instance selection can lead to different results on which instance is more suitable for the user.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Information Systems

Enhancing representation in the context of multiple-channel spam filtering

Maria Novo-Loures, David Ruano-Ordas, Reyes Pavon, Rosalia Laza, Silvana Gomez-Meire, Jose R. Mendez

Summary: This study examines the use of various features to complement synset-based and bag-of-words representations of texts for spam filtering using classical ML approaches. The evaluation of features across different channels and classifiers demonstrates the effectiveness of detecting non-textual entities and using language probability information for classification improvement. Additionally, features influenced by specific behaviors of Internet service users are found to be not useful for spam filtering.

INFORMATION PROCESSING & MANAGEMENT (2022)

Article Computer Science, Interdisciplinary Applications

Integration process simulator: A tool for performance evaluation of task scheduling of integration processes

Daniela L. Freire, Rafael Z. Frantz, Fabricia Roos-Frantz, Vitor Basto-Fernandes

Summary: The article proposes a simulation tool for task scheduling in Enterprise Application Integration, evaluating the performance of three different heuristics through statistical tests.

JOURNAL OF SIMULATION (2022)

Article Chemistry, Multidisciplinary

Using Live Spam Beater (LiSB) Framework for Spam Filtering during SMTP Transactions

Silvana Gomez-Meire, Cesar Gabriel Marquez, Eliana Patricia Aray-Cappello, Jose R. Mendez

Summary: This study introduces the Live Spam Beater (LiSB) framework for executing email filtering techniques during SMTP transactions, aiming to improve the effectiveness and efficiency of existing proactive filtering mechanisms. By implementing proactive filtering schemes, senders can be notified of spam emails through SMTP response codes during the transaction process. The framework, written in Python, acts as an MTA server and reverse proxy for SMTP, allowing easy incorporation of new proactive filtering techniques through plugins.

APPLIED SCIENCES-BASEL (2022)

Article Computer Science, Information Systems

Automatic knowledge exchange between ontologies and semantic graphs

Gabriela del Rocio Roldan-Molina, Iryna Yevseyeva, Silvana Gomez-Meire, Vitor Basto-Fernandes, Jose Ramon Mendez

Summary: This article presents an innovative approach for knowledge management using OWL format to enhance the quality of automatically created ontologies and assist domain experts in improving the target ontology. The method is validated using Babelnet semantic graph and a manually generated ontology, and its effectiveness is demonstrated by comparing the improved target ontology with the original version.

JOURNAL OF INFORMATION SCIENCE (2022)

Article Computer Science, Information Systems

Integrate d cyb ersecurity methodology and supporting tools for healthcare operational information systems

Bruno Coutinho, Joao Ferreira, Iryna Yevseyeva, Vitor Basto-Fernandes

Summary: The recent increase in cybersecurity threats and cyberattacks has impacted organizations at different levels, including critical business processes and the overall survival of the organizations. This study focuses on the cybersecurity of the healthcare sector, which is considered critical national infrastructure in many countries. An integrated cybersecurity methodology and supporting tools called Cyber.SCuris are proposed to complement existing cyber threat intelligence and incident response standards such as NIST SP800.

COMPUTERS & SECURITY (2023)

Article Computer Science, Artificial Intelligence

Multi-objective evolutionary optimization for dimensionality reduction of texts represented by synsets

Inaki Velez de Mendizabal, Vitor Basto-Fernandes, Enaitz Ezpeleta, Jose R. Mendez, Silvana Gomez-Meire, Urko Zurutuza

Summary: By using synset-based grouping methods, both lossless and low-loss reduction strategies can be implemented to address linguistic phenomena and improve the efficiency of spam filtering classifiers.

PEERJ COMPUTER SCIENCE (2023)

Article Forestry

Alnus Airborne Pollen Trends during the Last 26 Years for Improving Machine Learning-Based Forecasting Methods

Maria Novo-Loures, Maria Fernandez-Gonzalez, Reyes Pavon, Kenia C. Sanchez Espinosa, Rosalia Laza, Guillermo Guada, Jose R. Mendez, Florentino Fdez-Riverola, Francisco Javier Rodriguez-Rajo

Summary: Black alder, a widely distributed tree species in Europe, especially along watercourses in urban areas, has a high allergenic potential. This study proposes the use of a Machine Learning model to accurately predict high-risk periods for allergies among sensitive people, considering the increased allergenicity caused by global temperature rise induced by climate change. The Random Forest model was found to be the best performing model in detecting medium and high-risk days.

FORESTS (2023)

Article Computer Science, Artificial Intelligence

Improving Pipelining Tools for Pre-processing Data

Maria Novo-Loures, Yeray Lage, Reyes Pavon, Rosalia Laza, David Ruano-Ordas, Jose Ramon Mendez

Summary: Data mining has become a powerful tool for exploring unseen connections between variables and facts in different domains. However, current data analysis frameworks, specifically those using pipelining schemes, lack early error detection techniques and developer support mechanisms. In this study, a new pipelining framework, BDP4J, is introduced with improved features to address these limitations.

INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE (2022)

Review Computer Science, Artificial Intelligence

A comprehensive review of slope stability analysis based on artificial intelligence methods

Wei Gao, Shuangshuang Ge

Summary: This study provides a comprehensive review of slope stability research based on artificial intelligence methods, focusing on slope stability computation and evaluation. The review covers studies using quasi-physical intelligence methods, simulated evolutionary methods, swarm intelligence methods, hybrid intelligence methods, artificial neural network methods, vector machine methods, and other intelligence methods. The merits, demerits, and state-of-the-art research advancement of these studies are analyzed, and possible research directions for slope stability investigation based on artificial intelligence methods are suggested.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Machine learning approaches for lateral strength estimation in squat shear walls: A comparative study and practical implications

Khuong Le Nguyen, Hoa Thi Trinh, Saeed Banihashemi, Thong M. Pham

Summary: This study investigated the influence of input parameters on the shear strength of RC squat walls and found that ensemble learning models, particularly XGBoost, can effectively predict the shear strength. The axial load had a greater influence than reinforcement ratio, and longitudinal reinforcement had a more significant impact compared to horizontal and vertical reinforcement. The performance of XGBoost model outperforms traditional design models and reducing input features still yields reliable predictions.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

DHESN: A deep hierarchical echo state network approach for algal bloom prediction

Bo Hu, Huiyan Zhang, Xiaoyi Wang, Li Wang, Jiping Xu, Qian Sun, Zhiyao Zhao, Lei Zhang

Summary: A deep hierarchical echo state network (DHESN) is proposed to address the limitations of shallow coupled structures. By using transfer entropy, candidate variables with strong causal relationships are selected and a hierarchical reservoir structure is established to improve prediction accuracy. Simulation results demonstrate that DHESN performs well in predicting algal bloom.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Learning high-dependence Bayesian network classifier with robust topology

Limin Wang, Lingling Li, Qilong Li, Kuo Li

Summary: This paper discusses the urgency of learning complex multivariate probability distributions due to the increase in data variability and quantity. It introduces a highly scalable classifier called TAN, which utilizes maximum weighted spanning tree (MWST) for graphical modeling. The paper theoretically proves the feasibility of extending one-dependence MWST to model high-dependence relationships and proposes a heuristic search strategy to improve the fitness of the extended topology to data. Experimental results demonstrate that this algorithm achieves a good bias-variance tradeoff and competitive classification performance compared to other high-dependence or ensemble learning algorithms.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Make a song curative: A spatio-temporal therapeutic music transfer model for anxiety reduction

Zhejing Hu, Gong Chen, Yan Liu, Xiao Ma, Nianhong Guan, Xiaoying Wang

Summary: Anxiety is a prevalent issue and music therapy has been found effective in reducing anxiety. To meet the diverse needs of individuals, a novel model called the spatio-temporal therapeutic music transfer model (StTMTM) is proposed.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

A modified reverse-based analysis logic mining model with Weighted Random 2 Satisfiability logic in Discrete Hopfield Neural Network and multi-objective training of Modified Niched Genetic Algorithm

Nur Ezlin Zamri, Mohd. Asyraf Mansor, Mohd Shareduwan Mohd Kasihmuddin, Siti Syatirah Sidik, Alyaa Alway, Nurul Atiqah Romli, Yueling Guo, Siti Zulaikha Mohd Jamaludin

Summary: In this study, a hybrid logic mining model was proposed by combining the logic mining approach with the Modified Niche Genetic Algorithm. This model improves the generalizability and storage capacity of the retrieved induced logic. Various modifications were made to address other issues. Experimental results demonstrate that the proposed model outperforms baseline methods in terms of accuracy, precision, specificity, and correlation coefficient.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

On taking advantage of opportunistic meta-knowledge to reduce configuration spaces for automated machine learning

David Jacob Kedziora, Tien-Dung Nguyen, Katarzyna Musial, Bogdan Gabrys

Summary: The paper addresses the problem of efficiently optimizing machine learning solutions by reducing the configuration space of ML pipelines and leveraging historical performance. The experiments conducted show that opportunistic/systematic meta-knowledge can improve ML outcomes, and configuration-space culling is optimal when balanced. The utility and impact of meta-knowledge depend on various factors and are crucial for generating informative meta-knowledge bases.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Optimal location for an EVPL and capacitors in grid for voltage profile and power loss: FHO-SNN approach

G. Sophia Jasmine, Rajasekaran Stanislaus, N. Manoj Kumar, Thangamuthu Logeswaran

Summary: In the context of a rapidly expanding electric vehicle market, this research investigates the ideal locations for EV charging stations and capacitors in power grids to enhance voltage stability and reduce power losses. A hybrid approach combining the Fire Hawk Optimizer and Spiking Neural Network is proposed, which shows promising results in improving system performance. The optimization approach has the potential to enhance the stability and efficiency of electric grids.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

NLP-based approach for automated safety requirements information retrieval from project documents

Zhijiang Wu, Guofeng Ma

Summary: This study proposes a natural language processing-based framework for requirement retrieval and document association, which can help to mine and retrieve documents related to project managers' requirements. The framework analyzes the ontology relevance and emotional preference of requirements. The results show that the framework performs well in terms of iterations and threshold, and there is a significant matching between the retrieved documents and the requirements, which has significant managerial implications for construction safety management.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Dog nose-print recognition based on the shape and spatial features of scales

Yung-Kuan Chan, Chuen-Horng Lin, Yuan-Rong Ben, Ching-Lin Wang, Shu-Chun Yang, Meng-Hsiun Tsai, Shyr-Shen Yu

Summary: This study proposes a novel method for dog identification using nose-print recognition, which can be applied to controlling stray dogs, locating lost pets, and pet insurance verification. The method achieves high recognition accuracy through two-stage segmentation and feature extraction using a genetic algorithm.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Fostering supply chain resilience for omni-channel retailers: A two-phase approach for supplier selection and demand allocation under disruption risks

Shaohua Song, Elena Tappia, Guang Song, Xianliang Shi, T. C. E. Cheng

Summary: This study aims to optimize supplier selection and demand allocation decisions for omni-channel retailers in order to achieve supply chain resilience. It proposes a two-phase approach that takes into account various factors such as supplier evaluation and demand allocation.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Accelerating Benders decomposition approach for shared parking spaces allocation considering parking unpunctuality and no-shows

Jinyan Hu, Yanping Jiang

Summary: This paper examines the allocation problem of shared parking spaces considering parking unpunctuality and no-shows. It proposes an effective approach using sample average approximation (SAA) combined with an accelerating Benders decomposition (ABD) algorithm to solve the problem. The numerical experiments demonstrate the significance of supply-demand balance for the operation and user satisfaction of the shared parking system.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Review Computer Science, Artificial Intelligence

Financial fraud detection using graph neural networks: A systematic review

Soroor Motie, Bijan Raahemi

Summary: Financial fraud is a persistent problem in the finance industry, but Graph Neural Networks (GNNs) have emerged as a powerful tool for detecting fraudulent activities. This systematic review provides a comprehensive overview of the current state-of-the-art technologies in using GNNs for financial fraud detection, identifies gaps and limitations in existing research, and suggests potential directions for future research.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Review Computer Science, Artificial Intelligence

Occluded person re-identification with deep learning: A survey and perspectives

Enhao Ning, Changshuo Wang, Huang Zhang, Xin Ning, Prayag Tiwari

Summary: This review provides a detailed overview of occluded person re-identification methods and conducts a systematic analysis and comparison of existing deep learning-based approaches. It offers important theoretical and practical references for future research in the field.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

A hierarchical attention detector for bearing surface defect detection

Jiajun Ma, Songyu Hu, Jianzhong Fu, Gui Chen

Summary: The article presents a novel visual hierarchical attention detector for multi-scale defect location and classification, utilizing texture, semantic, and instance features of defects through a hierarchical attention mechanism, achieving multi-scale defect detection in bearing images with complex backgrounds.

EXPERT SYSTEMS WITH APPLICATIONS (2024)