4.7 Article

A feature selection method via analysis of relevance, redundancy, and interaction

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 183, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.115365

关键词

Feature selection; Feature redundancy; Interaction information; Classification; Feature relevance

资金

  1. National Social Science Foundation of China [17CTQ045]
  2. Soft Science Research Project of Guangdong Province [2019A101002108]
  3. Science and Technology Program of Guangzhou [202002030227]
  4. Special Innovation Project of Guangdong Education Department [2018KQNCX072]
  5. Key Field Project for Universities of Guangdong Province [2019KZDZX1016]

向作者/读者索取更多资源

The study introduces a novel feature selection algorithm that selects relevant and interactive features using a maximum criterion, leading to improved classification accuracy. Experimental results show that the algorithm efficiently selects features and enhances classifiers to achieve better or comparable classification accuracy compared to ten representative competing feature selection algorithms.
Feature selection aims at selecting important features that can enhance learning performance in data mining, pattern recognition, and machine learning. Filter feature selection methods offer computational efficiency and feature evaluation criteria, while feature interaction information, which may greatly help increase classification accuracy, is often ignored. In this work, we instead propose a novel feature selection algorithm that uses the maximum of the maximum criterion to select highly relevant features and their maximally interactive features. Extensive experiments are performed to evaluate the performance of the proposed method with regard to the number of selected features and classification accuracy on thirty UCI datasets. The results demonstrate that the proposed algorithm not only efficiently selects the relevant features and the interactive features, but also enables classifiers to achieve classification accuracy that is better than, or comparably well to, ten representative competing feature selection algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Efficient feature selection based on correlation measure between continuous and discrete features

Sheng-yi Jiang, Lian-xi Wang

INFORMATION PROCESSING LETTERS (2016)

Article Computer Science, Artificial Intelligence

A clustering-based feature selection via feature separability

Shengyi Jiang, Lianxi Wang

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS (2016)

Article Computer Science, Information Systems

Multi-label emotion classification based on adversarial multi-task learning

Nankai Lin, Sihui Fu, Xiaotian Lin, Lianxi Wang

Summary: This paper focuses on the task of multi-label emotion classification and proposes a novel multi-task multi-label emotion classification method to exploit the correlation among different emotions and overcome the limited public datasets for low-resource languages. The proposed approach achieves better performance than state-of-the-art baselines and constructs Indonesian and English datasets for the task.

INFORMATION PROCESSING & MANAGEMENT (2022)

Article Computer Science, Hardware & Architecture

A Chinese Grammatical Error Correction Model Based On Grammatical Generalization And Parameter Sharing

Nankai Lin, Xiaotian Lin, Yingwen Fu, Shengyi Jiang, Lianxi Wang

Summary: Chinese grammatical error correction (CGEC) is a key challenge in Chinese natural language processing. This paper proposes a competitive CGEC model that reduces the number of model parameters while achieving comparable performance to baseline models. The proposed model is evaluated using English datasets to assess its generalization and scalability, providing a new research direction for CGEC.

COMPUTER JOURNAL (2023)

Article Computer Science, Artificial Intelligence

Feature selection considering interaction, redundancy and complementarity for outlier detection in categorical data

Lianxi Wang, Yubing Ke

Summary: This paper proposes a feature selection method for outlier detection in categorical data, taking into account the feature relevance, interaction, redundancy, and complementarity. Experimental results demonstrate that the proposed method outperforms five other state-of-the-art feature selection methods on 14 real-world datasets.

KNOWLEDGE-BASED SYSTEMS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Multilingual China-Related News Identification Framework Based on Multiple Strategies

Lianxi Wang, Xiaotian Lin, Nankai Lin

Summary: This article proposes a framework for identifying China-related news in multilingual news, using weakly supervised learning and multi-task learning to improve accuracy. The framework utilizes XLM as a language model to identify China-related news in multiple languages, and employs pseudo-label technology to generate corpus and train with high confidence texts.

CHINESE LEXICAL SEMANTICS, CLSW 2021, PT I (2022)

Proceedings Paper Computer Science, Artificial Intelligence

An Interdisciplinary Literature Classifier Based on Multi-task Multi-label Learning

Lianxi Wang, Zhuolin Chen, Nankai Lin, Xixuan Huang

Summary: Interdisciplinary integration is a key driver of scientific innovation, and this paper proposes a classification framework for interdisciplinary literature based on multi-task learning using BERT, which improves the classification effect for minority categories and achieves a Macro-F1 value of 74.84%.

2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

HinPLMs: Pre-trained Language Models for Hindi

Xixuan Huang, Nankai Lin, Kexin Li, Lianxi Wang, Suifu Gan

Summary: The use of pre-trained models (PTMs) has been shown to significantly improve the performance of natural language processing tasks for languages with rich resources, while reducing the amount of labeled sample data needed for supervised learning. This study focuses on constructing a Hindi pre-training corpus in Devanagari and Romanized scripts, training Hindi pre-trained models, and evaluating their performance on various NLP tasks. Results indicate that the proposed Hindi pre-trained models outperform existing models, with the Devanagari script model excelling in certain tasks and the Romanized script model performing better in others.

2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP) (2021)

Proceedings Paper Computer Science, Information Systems

Research on Pseudo-label Technology for Multi-label News Classification

Lianxi Wang, Xiaotian Lin, Nankai Lin

Summary: This paper addresses the issue of multi-label news classification by constructing a corpus in Indonesian language and proposing a new framework based on pseudo-label technology. The framework utilizes the BERT model and cosine similarity algorithm for label matching to solve problems related to data imbalance and label quantity differences affecting model training.

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II (2021)

Review Computer Science, Artificial Intelligence

A comprehensive review of slope stability analysis based on artificial intelligence methods

Wei Gao, Shuangshuang Ge

Summary: This study provides a comprehensive review of slope stability research based on artificial intelligence methods, focusing on slope stability computation and evaluation. The review covers studies using quasi-physical intelligence methods, simulated evolutionary methods, swarm intelligence methods, hybrid intelligence methods, artificial neural network methods, vector machine methods, and other intelligence methods. The merits, demerits, and state-of-the-art research advancement of these studies are analyzed, and possible research directions for slope stability investigation based on artificial intelligence methods are suggested.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Machine learning approaches for lateral strength estimation in squat shear walls: A comparative study and practical implications

Khuong Le Nguyen, Hoa Thi Trinh, Saeed Banihashemi, Thong M. Pham

Summary: This study investigated the influence of input parameters on the shear strength of RC squat walls and found that ensemble learning models, particularly XGBoost, can effectively predict the shear strength. The axial load had a greater influence than reinforcement ratio, and longitudinal reinforcement had a more significant impact compared to horizontal and vertical reinforcement. The performance of XGBoost model outperforms traditional design models and reducing input features still yields reliable predictions.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

DHESN: A deep hierarchical echo state network approach for algal bloom prediction

Bo Hu, Huiyan Zhang, Xiaoyi Wang, Li Wang, Jiping Xu, Qian Sun, Zhiyao Zhao, Lei Zhang

Summary: A deep hierarchical echo state network (DHESN) is proposed to address the limitations of shallow coupled structures. By using transfer entropy, candidate variables with strong causal relationships are selected and a hierarchical reservoir structure is established to improve prediction accuracy. Simulation results demonstrate that DHESN performs well in predicting algal bloom.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Learning high-dependence Bayesian network classifier with robust topology

Limin Wang, Lingling Li, Qilong Li, Kuo Li

Summary: This paper discusses the urgency of learning complex multivariate probability distributions due to the increase in data variability and quantity. It introduces a highly scalable classifier called TAN, which utilizes maximum weighted spanning tree (MWST) for graphical modeling. The paper theoretically proves the feasibility of extending one-dependence MWST to model high-dependence relationships and proposes a heuristic search strategy to improve the fitness of the extended topology to data. Experimental results demonstrate that this algorithm achieves a good bias-variance tradeoff and competitive classification performance compared to other high-dependence or ensemble learning algorithms.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Make a song curative: A spatio-temporal therapeutic music transfer model for anxiety reduction

Zhejing Hu, Gong Chen, Yan Liu, Xiao Ma, Nianhong Guan, Xiaoying Wang

Summary: Anxiety is a prevalent issue and music therapy has been found effective in reducing anxiety. To meet the diverse needs of individuals, a novel model called the spatio-temporal therapeutic music transfer model (StTMTM) is proposed.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

A modified reverse-based analysis logic mining model with Weighted Random 2 Satisfiability logic in Discrete Hopfield Neural Network and multi-objective training of Modified Niched Genetic Algorithm

Nur Ezlin Zamri, Mohd. Asyraf Mansor, Mohd Shareduwan Mohd Kasihmuddin, Siti Syatirah Sidik, Alyaa Alway, Nurul Atiqah Romli, Yueling Guo, Siti Zulaikha Mohd Jamaludin

Summary: In this study, a hybrid logic mining model was proposed by combining the logic mining approach with the Modified Niche Genetic Algorithm. This model improves the generalizability and storage capacity of the retrieved induced logic. Various modifications were made to address other issues. Experimental results demonstrate that the proposed model outperforms baseline methods in terms of accuracy, precision, specificity, and correlation coefficient.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

On taking advantage of opportunistic meta-knowledge to reduce configuration spaces for automated machine learning

David Jacob Kedziora, Tien-Dung Nguyen, Katarzyna Musial, Bogdan Gabrys

Summary: The paper addresses the problem of efficiently optimizing machine learning solutions by reducing the configuration space of ML pipelines and leveraging historical performance. The experiments conducted show that opportunistic/systematic meta-knowledge can improve ML outcomes, and configuration-space culling is optimal when balanced. The utility and impact of meta-knowledge depend on various factors and are crucial for generating informative meta-knowledge bases.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Optimal location for an EVPL and capacitors in grid for voltage profile and power loss: FHO-SNN approach

G. Sophia Jasmine, Rajasekaran Stanislaus, N. Manoj Kumar, Thangamuthu Logeswaran

Summary: In the context of a rapidly expanding electric vehicle market, this research investigates the ideal locations for EV charging stations and capacitors in power grids to enhance voltage stability and reduce power losses. A hybrid approach combining the Fire Hawk Optimizer and Spiking Neural Network is proposed, which shows promising results in improving system performance. The optimization approach has the potential to enhance the stability and efficiency of electric grids.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

NLP-based approach for automated safety requirements information retrieval from project documents

Zhijiang Wu, Guofeng Ma

Summary: This study proposes a natural language processing-based framework for requirement retrieval and document association, which can help to mine and retrieve documents related to project managers' requirements. The framework analyzes the ontology relevance and emotional preference of requirements. The results show that the framework performs well in terms of iterations and threshold, and there is a significant matching between the retrieved documents and the requirements, which has significant managerial implications for construction safety management.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Dog nose-print recognition based on the shape and spatial features of scales

Yung-Kuan Chan, Chuen-Horng Lin, Yuan-Rong Ben, Ching-Lin Wang, Shu-Chun Yang, Meng-Hsiun Tsai, Shyr-Shen Yu

Summary: This study proposes a novel method for dog identification using nose-print recognition, which can be applied to controlling stray dogs, locating lost pets, and pet insurance verification. The method achieves high recognition accuracy through two-stage segmentation and feature extraction using a genetic algorithm.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Fostering supply chain resilience for omni-channel retailers: A two-phase approach for supplier selection and demand allocation under disruption risks

Shaohua Song, Elena Tappia, Guang Song, Xianliang Shi, T. C. E. Cheng

Summary: This study aims to optimize supplier selection and demand allocation decisions for omni-channel retailers in order to achieve supply chain resilience. It proposes a two-phase approach that takes into account various factors such as supplier evaluation and demand allocation.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

Accelerating Benders decomposition approach for shared parking spaces allocation considering parking unpunctuality and no-shows

Jinyan Hu, Yanping Jiang

Summary: This paper examines the allocation problem of shared parking spaces considering parking unpunctuality and no-shows. It proposes an effective approach using sample average approximation (SAA) combined with an accelerating Benders decomposition (ABD) algorithm to solve the problem. The numerical experiments demonstrate the significance of supply-demand balance for the operation and user satisfaction of the shared parking system.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Review Computer Science, Artificial Intelligence

Financial fraud detection using graph neural networks: A systematic review

Soroor Motie, Bijan Raahemi

Summary: Financial fraud is a persistent problem in the finance industry, but Graph Neural Networks (GNNs) have emerged as a powerful tool for detecting fraudulent activities. This systematic review provides a comprehensive overview of the current state-of-the-art technologies in using GNNs for financial fraud detection, identifies gaps and limitations in existing research, and suggests potential directions for future research.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Review Computer Science, Artificial Intelligence

Occluded person re-identification with deep learning: A survey and perspectives

Enhao Ning, Changshuo Wang, Huang Zhang, Xin Ning, Prayag Tiwari

Summary: This review provides a detailed overview of occluded person re-identification methods and conducts a systematic analysis and comparison of existing deep learning-based approaches. It offers important theoretical and practical references for future research in the field.

EXPERT SYSTEMS WITH APPLICATIONS (2024)

Article Computer Science, Artificial Intelligence

A hierarchical attention detector for bearing surface defect detection

Jiajun Ma, Songyu Hu, Jianzhong Fu, Gui Chen

Summary: The article presents a novel visual hierarchical attention detector for multi-scale defect location and classification, utilizing texture, semantic, and instance features of defects through a hierarchical attention mechanism, achieving multi-scale defect detection in bearing images with complex backgrounds.

EXPERT SYSTEMS WITH APPLICATIONS (2024)