☆ 4.7 Article

Graph Regularized Feature Selection with Data Reconstruction

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2016)

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Volume 28, Issue 3, Pages 689-700

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/TKDE.2015.2493537

Keywords

Feature selection; similarity preserving; data reconstruction

Categories

Computer Science, Artificial Intelligence Computer Science, Information Systems Engineering, Electrical & Electronic

Funding

National Basic Research Program of China (973 Program) [2013CB336500]
National Natural Science Foundation of China [61233011, 61125203]
China Knowledge Centre for Engineering Sciences and Technology (CKCEST)
[HKUST FSGRF13EG22]
[HKUST FSGRF14EG31]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Feature selection is a challenging problem for high dimensional data processing, which arises in many real applications such as data mining, information retrieval, and pattern recognition. In this paper, we study the problem of unsupervised feature selection. The problem is challenging due to the lack of label information to guide feature selection. We formulate the problem of unsupervised feature selection from the viewpoint of graph regularized data reconstruction. The underlying idea is that the selected features not only preserve the local structure of the original data space via graph regularization, but also approximately reconstruct each data point via linear combination. Therefore, the graph regularized data reconstruction error becomes a natural criterion for measuring the quality of the selected features. By minimizing the reconstruction error, we are able to select the features that best preserve both the similarity and discriminant information in the original data. We then develop an efficient gradient algorithm to solve the corresponding optimization problem. We evaluate the performance of our proposed algorithm on text clustering. The extensive experiments demonstrate the effectiveness of our proposed approach.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Classifier selection using geometry preserving feature

Binbin Pan, Wen-Sheng Chen, Liping Deng, Chen Xu, Xiaobo Zhou

Summary: The selection of proper classifiers for a given data set is challenging, and the critical problem is how to extract features. This paper proposes a new method that preserves the geometrical structure and characterizes the decision boundary of a data set. The extracted features can recover the same Euclidean geometrical structure as the original data set. An efficient algorithm is presented to compute the similarity between data set features, and the impact of feature similarity on the performance of the support vector machine is theoretically analyzed. Empirical results demonstrate the effectiveness of the proposed method in finding suitable classifiers.

NEURAL COMPUTING & APPLICATIONS (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

sCOs: Semi-Supervised Co-Selection by a Similarity Preserving Approach

Khalid Benabdeslem, Dou El Kefel Mansouri, Raywat Makkhongkaew

Summary: This paper focuses on the co-selection of instances and features in the semi-supervised learning scenario. It proposes a unified framework, called sCOs, that integrates labeled and unlabeled parts into the co-selection process. Two efficient algorithms are proposed and experimental results validate the effectiveness of the method.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Add to Collection

Article Computer Science, Information Systems

Unsupervised spectral feature selection algorithms for high dimensional data

Mingzhao Wang, Henry Han, Zhao Huang, Juanying Xie

Summary: It is proposed in this paper to detect the informative features for high dimensional data with a small number of samples through two unsupervised spectral feature selection algorithms. These algorithms group features using an advanced Self-Tuning spectral clustering algorithm and detect the global optimal feature clusters through feature ranking techniques. Extensive experiments demonstrate the effectiveness of the proposed algorithms, especially the one based on cosine similarity feature ranking technique. The detected features have strong discriminative capabilities, making them suitable for building reliable and explainable AI systems, particularly in medical diagnostic systems.

FRONTIERS OF COMPUTER SCIENCE (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Unsupervised feature selection with robust data reconstruction (UFS-RDR) and outlier detection

Abdul Wahid, Dost Muhammad Khan, Ijaz Hussain, Sajjad Ahmad Khan, Zardad Khan

Summary: A novel robust unsupervised feature selection method, UFS-RDR, is proposed to improve feature selection performance by minimizing the graph regularized weighted data reconstruction error function, using Mahalanobis distance to detect outliers and determine Huber-type weight function. The experimental results show that UFS-RDR outperforms non-robust methods in the presence of contamination in unlabeled data.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Add to Collection

Article Biochemical Research Methods

Accelerating Big Data Analysis through LASSO-Random Forest Algorithm in QSAR Studies

Fahimeh Motamedi, Horacio Perez-Sanchez, Alireza Mehridehnavi, Afshin Fassihi, Fahimeh Ghasemi

Summary: This article discusses two approaches for quantitative structure-activity prediction studies, focusing on identifying appropriate molecular descriptors and predicting the biological activities of designed compounds. The use of LASSO-random forest algorithm is shown to significantly improve output correlation, reduce implementation time and model complexity, while maintaining prediction accuracy.

BIOINFORMATICS (2022)

Add to Collection

Article Genetics & Heredity

The Unsupervised Feature Selection Algorithms Based on Standard Deviation and Cosine Similarity for Genomic Data Analysis

Juanying Xie, Mingzhao Wang, Shengquan Xu, Zhao Huang, Philip W. Grant

Summary: In this paper, an unsupervised feature selection technique called SCFS is proposed to address challenges in genomic data analysis caused by high dimensionality and imbalanced class distribution. By defining discernibility and independence of features, an optimal feature subset with high classification capability is identified for KNN and SVM classifiers, leading to improved results in genomic datasets analysis.

FRONTIERS IN GENETICS (2021)

Add to Collection

Article Automation & Control Systems

Adaptive Graph Embedded Preserving Projection Learning for Feature Extraction and Selection

Shuping Zhao, Jigang Wu, Bob Zhang, Lunke Fei, Shuyi Li, Pengyang Zhao

Summary: This article proposes a novel adaptive graph embedded preserving projection learning method. By combining sparse graph learning and projection learning, it achieves feature extraction and selection, and has been proven effective and competitive through experiments.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Add to Collection

Article Computer Science, Information Systems

Feature selection for label distribution learning via feature similarity and label correlation

Wenbin Qian, Yinsong Xiong, Jun Yang, Wenhao Shu

Summary: Feature selection is crucial in machine learning and data mining, and traditional methods may not be suitable for label distribution learning. This paper proposes a novel feature selection algorithm for label distribution learning, which utilizes neighborhood granularity, correlation coefficient, and sparse learning to improve effectiveness. Experimental results show that the proposed method outperforms five state-of-art algorithms on twelve datasets.

INFORMATION SCIENCES (2022)

Add to Collection

Article Chemistry, Multidisciplinary

Unsupervised and Supervised Feature Selection for Incomplete Data via L2,1-Norm and Reconstruction Error Minimization

Jun Cai, Linge Fan, Xin Xu, Xinrong Wu

Summary: This paper proposes unsupervised and supervised feature selection methods for incomplete data using L-2, L-1 norm and reconstruction error minimization methods. Experimental studies demonstrate the effectiveness of the proposed methods.

APPLIED SCIENCES-BASEL (2022)

Add to Collection

Article Multidisciplinary Sciences

Human disease prediction from microbiome data by multiple feature fusion and deep learning

Xingjian Chen, Zifan Zhu, Weitong Zhang, Yuchen Wang, Fuzhou Wang, Jianyi Yang, Ka-Chun Wong

Summary: Predicting human diseases from microbiome data is important in medical applications. Existing methods often overlook the abundance profiles of known and unknown microbial organisms, as well as the taxonomic relationships among them, resulting in information loss. To address these issues, we developed a comprehensive machine learning framework called MetaDR that combines deep learning and various information sources to predict human diseases.

ISCIENCE (2022)

Add to Collection

Article Environmental Sciences

Sharp Feature-Preserving 3D Mesh Reconstruction from Point Clouds Based on Primitive Detection

Qi Liu, Shibiao Xu, Jun Xiao, Ying Wang

Summary: This paper introduces a novel sharp-feature-preserving reconstruction framework based on primitive detection, which accurately segments primitive patches, fits meshes in each patch, and splits overlapping meshes at the triangle level to ensure true sharpness and obtain lightweight mesh models. Experimental results show that our framework outperforms both the state-of-the-art learning-based primitive detection methods and traditional reconstruction methods. Moreover, our designed modules are plug-and-play, and can be combined with other point cloud processing tasks to achieve high-fidelity results.

REMOTE SENSING (2023)

Add to Collection

Article Automation & Control Systems

Improved multiclass support vector data description for planetary gearbox fault diagnosis

Hui Hou, Hongquan Ji

Summary: A novel feature selection strategy is proposed to improve the multiclass support vector data description (SVDD) algorithm for planetary gearbox fault diagnosis. By selecting features sensitive to faults and developing an improved multiclass SVDD algorithm, the fault diagnosis task is effectively completed.

CONTROL ENGINEERING PRACTICE (2021)

Add to Collection

Article Engineering, Electrical & Electronic

Latent energy preserving embedding for unsupervised feature selection

Zihao Song, Peng Song

Summary: Feature selection is a fundamental and challenging topic in machine learning and pattern recognition, and unsupervised feature selection methods have received extensive attention. In this article, a novel latent energy preserving embedding method is proposed for unsupervised feature selection, which utilizes self-representation learning strategy and graph Laplacian for mining manifold information and selects features using l(2,1)-norm. Extensive experiments on real-world datasets validate the effectiveness of the proposed method.

DIGITAL SIGNAL PROCESSING (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Heterogeneous domain adaptation by Features Normalization and Data Topology Preserving

Mohammad Amin Pirbonyeh, Mohammad Amin Shayegan, Gholamreza Sotudeh, Shahab Shamshirband

Summary: Transfer Learning (TL) algorithms are effective methods for improving classifier learning by utilizing source domain knowledge in the target domain. Reducing the difference in feature space and distribution between domains is crucial for enhancing TL algorithms. Existing methods often employ complex computational structures but overlook the preservation of data topology. This paper proposes a unified framework called FN-DTP, which addresses heterogeneous domain adaptation problems by combining feature normalization, distribution reduction, and topology preservation, resulting in improved TL algorithm performance.

KNOWLEDGE-BASED SYSTEMS (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

ASMFS: Adaptive-similarity-based multi-modality feature selection for classification of Alzheimer's disease

Yuang Shi, Chen Zu, Mei Hong, Luping Zhou, Lei Wang, Xi Wu, Jiliu Zhou, Daoqiang Zhang, Yan Wang

Summary: Multimodal classification methods using different modalities have advantages over traditional single-modality-based ones for the diagnosis of Alzheimer's disease and mild cognitive impairment. This paper proposes a novel multimodal feature selection method called ASMFS, which performs adaptive similarity learning and feature selection simultaneously, and demonstrates its effectiveness and superiority over other state-of-the-art approaches for multi-modality classification of AD/MCI.

PATTERN RECOGNITION (2022)

Add to Collection

No Data Available

No Data Available

© Peeref 2019-2024. All rights reserved.