4.7 Article

Feature-specific mutual information variation for multi-label feature selection

期刊

INFORMATION SCIENCES
卷 593, 期 -, 页码 449-471

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.02.024

关键词

Multi-label feature selection; Information theory; Feature relevance; Changed ratio; Relevance based weight

资金

  1. Project of Jilin Province Development and Reform Commission [2019FGWTZC001]
  2. Fundamental Research Funds for the Central Universities [93K172020K36]
  3. Science Foundation of Jilin Province of China [2020122209JC]

向作者/读者索取更多资源

This paper proposes a novel feature selection method based on relevance and weight. By considering two types of changed ratios, the proposed method effectively evaluates the relevance of features. Experimental results demonstrate its superior performance on multi-label datasets.
Recent years has witnessed urgent needs for addressing the curse of dimensionality regarding multi-label data, which attracts wide attention for feature selection. Feature relevance terms are often constructed depending on the amount of information contributed by selected features or candidate features to the label set in previous multi-label feature selection approaches based on information theory. Although it is important to consider the amount of information, they ignore both the changed ratio for the undetermined amount of information and the changed ratio for the established amount of information, two types of changed ratios regarding feature relevance evaluation cannot be underestimated. To this end, we devise a new feature relevance term, Relevance based on Weight (RW), which is based on two types of changed ratios. Both two types of changed ratios have positive or negative impacts regarding feature relevance evaluation. A novel multi-label feature selection approach, Relevance based on Weight Feature Selection (RWFS), is proposed based on RW. To verify the effectiveness, the proposed approach is compared to eight state-of-the-art multi-label approaches on thirteen real-world data sets. The experimental results present that RWFS approach has superior performance than other eight compared approaches. (C) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Physics, Multidisciplinary

Multi-Label Feature Selection Based on High-Order Label Correlation Assumption

Ping Zhang, Wanfu Gao, Juncheng Hu, Yonghao Li

ENTROPY (2020)

Article Computer Science, Artificial Intelligence

Multi-label feature selection with shared common mode

Liang Hu, Yonghao Li, Wanfu Gao, Ping Zhang, Juncheng Hu

PATTERN RECOGNITION (2020)

Article Computer Science, Artificial Intelligence

Robust multi-label feature selection with dual-graph regularization

Juncheng Hu, Yonghao Li, Wanfu Gao, Ping Zhang

KNOWLEDGE-BASED SYSTEMS (2020)

Article Computer Science, Information Systems

Multi-label feature selection based on the division of label topics

Ping Zhang, Wanfu Gao, Juncheng Hu, Yonghao Li

Summary: A multi-label feature selection method based on label spectral clustering was proposed, which identifies important features and constructs feature subsets by clustering labels, and experimental results showed its superiority over seven existing multi-label feature selection methods.

INFORMATION SCIENCES (2021)

Article Automation & Control Systems

A conditional-weight joint relevance metric for feature relevancy term

Ping Zhang, Wanfu Gao, Juncheng Hu, Yonghao Li

Summary: Feature selection is crucial in machine learning and data mining, with traditional methods being improved upon by the novel CWJR-FS method, which utilizes conditional-weight joint relevance to design a new feature relevancy term, outperforming other methods in experiments.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2021)

Article Physics, Multidisciplinary

Multi-Label Feature Selection Combining Three Types of Conditional Relevance

Lingbo Gao, Yiqiang Wang, Yonghao Li, Ping Zhang, Liang Hu

Summary: A novel feature selection method, combining three aspects of candidate features, selected features, and label correlations, is proposed in this paper to evaluate feature relevance. By reducing unnecessary redundancy, the method is able to better capture the optimal features and outperform other state-of-the-art multi-label approaches in experiments.

ENTROPY (2021)

Article Computer Science, Artificial Intelligence

Dynamic subspace dual-graph regularized multi-label feature selection

Juncheng Hu, Yonghao Li, Gaochao Xu, Wanfu Gao

Summary: This paper proposes a new multi-label feature selection method DSMFS, which achieves high-quality label subspace through dynamic subspace and dual-graph regularization, with experimental results demonstrating its superiority.

NEUROCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

Multi-label feature selection method based on dynamic weight

Ping Zhang, Jiyao Sheng, Wanfu Gao, Juncheng Hu, Yonghao Li

Summary: This study proposes a multi-label feature selection method based on information theory, categorizing labels into two groups based on remaining uncertainty and utilizing relevancy ratio and weighted feature relevancy to evaluate candidate features. Experiment results show the effectiveness of the proposed method on real-world data sets.

SOFT COMPUTING (2022)

Article Computer Science, Information Systems

Label correlations variation for robust multi-label feature selection

Yonghao Li, Liang Hu, Wanfu Gao

Summary: This paper proposes a robust multi-label feature selection method that considers both types of label correlations. It eliminates redundant and noisy information through a self-expression model and a regularizer, and designs an optimization scheme to handle the objective function.

INFORMATION SCIENCES (2022)

Article Computer Science, Artificial Intelligence

Robust multi-label feature selection with shared label enhancement

Yonghao Li, Juncheng Hu, Wanfu Gao

Summary: This paper proposes a multi-label feature selection method named RLEFS, which improves the classification performance of models by considering the relationship between feature sets and label sets, as well as the importance of labels.

KNOWLEDGE AND INFORMATION SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Robust sparse and low-redundancy multi-label feature selection with dynamic local and global structure preservation

Yonghao Li, Liang Hu, Wanfu Gao

Summary: In recent years, joint feature selection and multi-label learning have been widely studied. However, existing multi-label feature selection methods face three challenges: neglecting feature redundancy, using low-quality graphs to capture local label correlations, and considering only either local or global label correlations. To address these challenges, we propose a method that preserves global and dynamic local label correlations by preserving the graph structure. We also introduce regularization terms to select low redundant features. Experimental results demonstrate the superiority of our method in classification tasks.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Multi-label feature selection via robust flexible sparse regularization

Yonghao Li, Liang Hu, Wanfu Gao

Summary: Multi-label feature selection is an efficient technique for dealing with high-dimensional multi-label data, but existing methods suffer from low feature discrimination and redundancy. This paper proposes a new regularization norm and optimization framework to address these issues, and empirical studies demonstrate the effectiveness and efficiency of the proposed method.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Multilabel Feature Selection With Constrained Latent Structure Shared Term

Wanfu Gao, Yonghao Li, Liang Hu

Summary: When dealing with high-dimensional multilabel data, we propose a feature selection method that shares latent feature and label structure. By designing an LSS term to share and preserve the latent structure, and employing graph regularization technique to ensure consistency, we achieve better results on multiple evaluation criteria according to experiments.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Information Systems

Feature Redundancy Based on Interaction Information for Multi-Label Feature Selection

Wanfu Gao, Juncheng Hu, Yonghao Li, Ping Zhang

IEEE ACCESS (2020)

Article Computer Science, Information Systems

A consensus model considers managing manipulative and overconfident behaviours in large-scale group decision-making

Xia Liang, Jie Guo, Peide Liu

Summary: This paper investigates a novel consensus model based on social networks to manage manipulative and overconfident behaviors in large-scale group decision-making. By proposing a novel clustering model and improved methods, the consensus reaching is effectively facilitated. The feedback mechanism and management approach are employed to handle decision makers' behaviors. Simulation experiments and comparative analysis demonstrate the effectiveness of the model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

CGN: Class gradient network for the construction of adversarial samples

Xiang Li, Haiwang Guo, Xinyang Deng, Wen Jiang

Summary: This paper proposes a method based on class gradient networks for generating high-quality adversarial samples. By introducing a high-level class gradient matrix and combining classification loss and perturbation loss, the method demonstrates superiority in the transferability of adversarial samples on targeted attacks.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Distinguishing latent interaction types from implicit feedbacks for recommendation

Lingyun Lu, Bang Wang, Zizhuo Zhang, Shenghao Liu

Summary: Many recommendation algorithms only rely on implicit feedbacks due to privacy concerns. However, the encoding of interaction types is often ignored. This paper proposes a relation-aware neural model that classifies implicit feedbacks by encoding edges, thereby enhancing recommendation performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Proximity-based density description with regularized reconstruction algorithm for anomaly detection

Jaehong Yu, Hyungrok Do

Summary: This study discusses unsupervised anomaly detection using one-class classification, which determines whether a new instance belongs to the target class by constructing a decision boundary. The proposed method uses a proximity-based density description and a regularized reconstruction algorithm to overcome the limitations of existing one-class classification methods. Experimental results demonstrate the superior performance of the proposed algorithm.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Non-iterative border-peeling clustering algorithm based on swap strategy

Hui Tu, Shifei Ding, Xiao Xu, Haiwei Hou, Chao Li, Ling Ding

Summary: Border-Peeling algorithm is a density-based clustering algorithm, but its complexity and issues on unbalanced datasets restrict its application. This paper proposes a non-iterative border-peeling clustering algorithm, which improves the clustering performance by distinguishing and associating core points and border points.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A two-stage denoising framework for zero-shot learning with noisy labels

Long Tang, Pan Zhao, Zhigeng Pan, Xingxing Duan, Panos M. Pardalos

Summary: In this work, a two-stage denoising framework (TSDF) is proposed for zero-shot learning (ZSL) to address the issue of noisy labels. The framework includes a tailored loss function to remove suspected noisy-label instances and a ramp-style loss function to reduce the negative impact of remaining noisy labels. In addition, a dynamic screening strategy (DSS) is developed to efficiently handle the nonconvexity of the ramp-style loss.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Selection of a viable blockchain service provider for data management within the internet of medical things: An MCDM approach to Indian healthcare

Raghunathan Krishankumar, Sundararajan Dhruva, Kattur S. Ravichandran, Samarjit Kar

Summary: Health 4.0 is gaining global attention for better healthcare through digital technologies. This study proposes a new decision-making framework for selecting viable blockchain service providers in the Internet of Medical Things (IoMT). The framework addresses the limitations in previous studies and demonstrates its applicability in the Indian healthcare sector. The results show the top ranking BSPs, the importance of various criteria, and the effectiveness of the developed model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Q-learning with heterogeneous update strategy

Tao Tan, Hong Xie, Liang Feng

Summary: This paper proposes a heterogeneous update idea and designs HetUp Q-learning algorithm to enlarge the normalized gap by overestimating the Q-value corresponding to the optimal action and underestimating the Q-value corresponding to the other actions. To address the limitation, a softmax strategy is applied to estimate the optimal action, resulting in HetUpSoft Q-learning and HetUpSoft DQN. Extensive experimental results show significant improvements over SOTA baselines.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Dyformer: A dynamic transformer-based architecture for multivariate time series classification

Chao Yang, Xianzhi Wang, Lina Yao, Guodong Long, Guandong Xu

Summary: This paper proposes a dynamic transformer-based architecture called Dyformer for multivariate time series classification. Dyformer captures multi-scale features through hierarchical pooling and adaptive learning strategies, and improves model performance by introducing feature-map-wise attention mechanisms and a joint loss function.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

ESSENT: an arithmetic optimization algorithm with enhanced scatter search strategy for automated test case generation

Xiguang Li, Baolu Feng, Yunhe Sun, Ammar Hawbani, Saeed Hammod Alsamhi, Liang Zhao

Summary: This paper proposes an enhanced scatter search strategy, using opposition-based learning, to solve the problem of automated test case generation based on path coverage (ATCG-PC). The proposed ESSENT algorithm selects the path with the lowest path entropy among the uncovered paths as the target path and generates new test cases to cover the target path by modifying the dimensions of existing test cases. Experimental results show that the ESSENT algorithm outperforms other state-of-the-art algorithms, achieving maximum path coverage with fewer test cases.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

An attention based approach for automated account linkage in federated identity management

Shirin Dabbaghi Varnosfaderani, Piotr Kasprzak, Aytaj Badirova, Ralph Krimmel, Christof Pohl, Ramin Yahyapour

Summary: Linking digital accounts belonging to the same user is crucial for security, user satisfaction, and next-generation service development. However, research on account linkage is mainly focused on social networks, and there is a lack of studies in other domains. To address this, we propose SmartSSO, a framework that automates the account linkage process by analyzing user routines and behavior during login processes. Our experiments on a large dataset show that SmartSSO achieves over 98% accuracy in hit-precision.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A memetic algorithm with fuzzy-based population control for the joint order batching and picker routing problem

Renchao Wu, Jianjun He, Xin Li, Zuguo Chen

Summary: This paper proposes a memetic algorithm with fuzzy-based population control (MA-FPC) to solve the joint order batching and picker routing problem (JOBPRP). The algorithm incorporates batch exchange crossover and a two-level local improvement procedure. Experimental results show that MA-FPC outperforms existing algorithms in terms of solution quality.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Refining one-class representation: A unified transformer for unsupervised time-series anomaly detection

Guoxiang Zhong, Fagui Liu, Jun Jiang, Bin Wang, C. L. Philip Chen

Summary: In this study, we propose the AMFormer framework to address the problem of mixed normal and anomaly samples in deep unsupervised time-series anomaly detection. By refining the one-class representation and introducing the masked operation mechanism and cost sensitive learning theory, our approach significantly improves anomaly detection performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A data-driven optimisation method for a class of problems with redundant variables and indefinite objective functions

Jin Zhou, Kang Zhou, Gexiang Zhang, Ferrante Neri, Wangyang Shen, Weiping Jin

Summary: In this paper, the authors focus on the issue of multi-objective optimisation problems with redundant variables and indefinite objective functions (MOPRVIF) in practical problem-solving. They propose a dual data-driven method for solving this problem, which consists of eliminating redundant variables, constructing objective functions, selecting evolution operators, and using a multi-objective evolutionary algorithm. The experiments conducted on two different problem domains demonstrate the effectiveness, practicality, and scalability of the proposed method.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A Monte Carlo fuzzy logistic regression framework against imbalance and separation

Georgios Charizanos, Haydar Demirhan, Duygu Icen

Summary: This article proposes a new fuzzy logistic regression framework that addresses the problems of separation and imbalance while maintaining the interpretability of classical logistic regression. By fuzzifying binary variables and classifying subjects based on a fuzzy threshold, the framework demonstrates superior performance on imbalanced datasets.

INFORMATION SCIENCES (2024)