☆ 4.6 Article

Classifier ensemble selection based on affinity propagation clustering

JOURNAL OF BIOMEDICAL INFORMATICS (2016)

Journal

JOURNAL OF BIOMEDICAL INFORMATICS

Volume 60, Issue -, Pages 234-242

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.jbi.2016.02.010

Keywords

Classification; Ranking aggregation; Affinity propagation clustering; Kappa correlation; Ensemble feature selection

Categories

Computer Science, Interdisciplinary Applications Medical Informatics

Funding

National Natural Science Foundation of China [61472061, 31471880, 31272167]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

A small number of features are significantly correlated with classification in high-dimensional data. An ensemble feature selection method based on cluster grouping is proposed in this paper. Classification related features are chosen using a ranking aggregation technique. These features are divided into unrelated groups by an affinity propagation clustering algorithm with a bicor correlation coefficient. Some diversity and distinguishing feature subsets are constructed by randomly selecting a feature from each group and are used to train base classifiers. Finally, some base classifiers that have better classification performance are selected using a kappa coefficient and integrated using a majority voting strategy. The experimental results based on five gene expression datasets show that the proposed method has low classification error rates, stable classification performance and strong scalability in terms of sensitivity, specificity, accuracy and G-Mean criteria. (C) 2016 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Environmental Sciences

Correlation-Guided Ensemble Clustering for Hyperspectral Band Selection

Wenguang Wang, Wenhong Wang, Hongfu Liu

Summary: In this paper, a correlation-guided ensemble clustering approach is proposed for hyperspectral band selection. By utilizing ensemble clustering and a consensus function, this approach can effectively select informative and representative bands.

REMOTE SENSING (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Bi-level ensemble method for unsupervised feature selection

Peng Zhou, Xia Wang, Liang Du

Summary: Unsupervised feature selection is an important task in machine learning but suffers from stability and robustness issues due to the absence of labels. This paper proposes a novel bi-level feature selection ensemble method that not only ensembles at the feature level but also learns a consensus clustering result to guide the feature selection, outperforming other state-of-the-art methods.

INFORMATION FUSION (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Re-ranking and TOPSIS-based ensemble feature selection with multi-stage aggregation for text categorization

Guanghua Fu, Bencheng Li, Yongsheng Yang, Chaofeng Li

Summary: This paper proposes a four-stage ensemble feature selection method called RTEFS, which can reduce data dimensionality and improve the accuracy and computational cost of machine learning models. Experimental results show that RTEFS outperforms the base counterparts in terms of accuracy and F-measure scores.

PATTERN RECOGNITION LETTERS (2023)

Add to Collection

Review Physics, Multidisciplinary

Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data

Malik Yousef, Abhishek Kumar, Burcu Bakir-Gungor

Summary: In the past two decades, advancements in high throughput technologies have led to exponential growth of gene expression datasets. Integrative approaches combining statistical metrics and biological knowledge are necessary for improving biomarker identification and potential treatment targets. These approaches are expected to enhance disease prediction, diagnosis, treatment, and understanding of disease dynamics.

ENTROPY (2021)

Add to Collection

Article Computer Science, Information Systems

Unsupervised spectral feature selection algorithms for high dimensional data

Mingzhao Wang, Henry Han, Zhao Huang, Juanying Xie

Summary: It is proposed in this paper to detect the informative features for high dimensional data with a small number of samples through two unsupervised spectral feature selection algorithms. These algorithms group features using an advanced Self-Tuning spectral clustering algorithm and detect the global optimal feature clusters through feature ranking techniques. Extensive experiments demonstrate the effectiveness of the proposed algorithms, especially the one based on cosine similarity feature ranking technique. The detected features have strong discriminative capabilities, making them suitable for building reliable and explainable AI systems, particularly in medical diagnostic systems.

FRONTIERS OF COMPUTER SCIENCE (2023)

Add to Collection

Article Computer Science, Theory & Methods

The stability of different aggregation techniques in ensemble feature selection

Reem Salman, Ayman Alzaatreh, Hana Sulieman

Summary: This study explores the impact of different aggregation strategies on the stability and accuracy of ensemble feature selection, finding significant differences in the performance of ensembles under different aggregation methods, especially between score-based and rank-based aggregation strategies. Simple score-based strategies, such as Arithmetic Mean or L2-norm aggregation, appear to be efficient and compelling in most cases.

JOURNAL OF BIG DATA (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Ensemble of feature selection algorithms: a multi-criteria decision-making approach

Amin Hashemi, Mohammad Bagher Dowlatshahi, Hossein Nezamabadi-pour

Summary: In this paper, ensemble feature selection is modeled as a Multi-Criteria Decision-Making (MCDM) process, and a novel method called EFS-MCDM is proposed to rank and score features. Experimental results demonstrate that the proposed method outperforms other similar methods in terms of accuracy and efficiency.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2022)

Add to Collection

Review Automation & Control Systems

From clustering to clustering ensemble selection: A review

Keyvan Golalipour, Ebrahim Akbari, Seyed Saeed Hamidi, Malrey Lee, Rasul Enayatifar

Summary: Clustering aims to discover natural groupings of patterns, points, or objects without a deterministic approach to decide the best method for a given set of input data. Clustering ensemble combines computed solutions of base clustering algorithms to achieve stability and robustness, while clustering ensemble selection chooses a subset of base clustering based on quality and diversity for better performance. This survey covers the historical development of data clustering, basic clustering techniques, clustering ensemble algorithms, and clustering ensemble selection techniques for improved quality and diversity.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2021)

Add to Collection

Article Physics, Multidisciplinary

A Bootstrap Framework for Aggregating within and between Feature Selection Methods

Reem Salman, Ayman Alzaatreh, Hana Sulieman, Shaimaa Faisal

Summary: This study implemented a general framework for the ensemble of multiple feature selection methods, which aggregates importance scores generated by different selection methods to resolve inconsistency issues and control the diversity of selected feature subsets. Experimental results showed that the Within Aggregation Method (WAM) is more stable in identifying important features compared to the Between Aggregation Method (BAM), providing an effective tool for determining the best feature selection method for a given dataset. By applying both WAM and BAM, practitioners can gain a deeper understanding of the feature selection process.

ENTROPY (2021)

Add to Collection

Article Energy & Fuels

Power system coherency assessment by the affinity propagation algorithm and distance correlation

Jose Ortiz-Bejar, Alejandro Zamora-Mendez, Lucas Lugnani, Eric Tellez, Mario R. Arrieta Paternina

Summary: This paper assesses the coherency in power systems using the affinity propagation (AP) algorithm with different distance metrics and quality measurements. The AP method is adopted to identify and distinguish coherent patterns in a power system, and three different distance metrics are evaluated to determine their impact on the clustering quality. The experimental results demonstrate the effectiveness of the proposed strategy in identifying coherent patterns in large-scale power systems.

SUSTAINABLE ENERGY GRIDS & NETWORKS (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

On k-means iterations and Gaussian clusters

Renato Cordeiro de Amorim, Vladimir Makarenkov

Summary: This article explores the relationship between the convergence iteration number (τ) of the k-means algorithm and the structure and clustering quality of the data set. It demonstrates that τ can be used to identify irrelevant features, improve feature selection algorithms, and determine the true number of clusters in a data set.

NEUROCOMPUTING (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Detecting Meaningful Clusters From High-Dimensional Data: A Strongly Consistent Sparse Center-Based Clustering Approach

Saptarshi Chakraborty, Swagatam Das

Summary: In this paper, a simple and efficient sparse clustering algorithm called LW-k-means is proposed for high-dimensional data. The algorithm incorporates feature weighting to enable feature selection and has a time complexity similar to traditional algorithms. The strong consistency of the LW-k-means procedure is also established. Experimental results on synthetic and real-life datasets demonstrate that LW-k-means performs competitively in terms of clustering accuracy and computational time compared to existing methods for center-based high-dimensional clustering.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Particle ranking: An Efficient Method for Multi-Objective Particle Swarm Optimization Feature Selection

Abdolreza Rashno, Milad Shafipour, Sadegh Fadaei

Summary: This paper introduces a novel multi-objective particle swarm optimization feature selection method. It decodes feature vectors as particles and ranks them in a two-dimensional optimization space. The proposed method incorporates feature ranks to update particle velocity and position during the optimization process. Experimental results demonstrate the effectiveness of the method in finding Pareto Fronts of the best particles in multi-objective optimization space.

KNOWLEDGE-BASED SYSTEMS (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Diversity improvement in homogeneous ensemble feature selection: a case study of its impact on classification performance

Vahid Nosrati, Mohsen Rahmani

Summary: In this paper, the authors enhance the diversity paradigm in ensemble feature selection models by applying recursive balanced partitioning (RBP) approach. They propose a new diversity measurement and aggregation criterion. Experimental results demonstrate that the proposed RBP method outperforms the traditional random partitioning in terms of diversity achievement. Furthermore, the study shows a positive relationship between diversity and classification accuracy.

NEURAL COMPUTING & APPLICATIONS (2023)

Add to Collection

Article Biochemical Research Methods

Ensemble classification based feature selection: a case of identification on plant pentatricopeptide repeat proteins

Xudong Zhao, Jingwen Zhai, Tong Liu, Guohua Wang

Summary: This paper proposes an improved variable selection framework for identifying plant PPR proteins. The improvements include the use of a hybrid ensemble classifier and alternating feature selection strategy, and it is found that different base classifiers play an important role as the feature dimension increases. Experimental results demonstrate the effectiveness of the improvements.

BRIEFINGS IN BIOINFORMATICS (2022)

Add to Collection

No Data Available

No Data Available

© Peeref 2019-2024. All rights reserved.