☆ 4.2 Article

On the Stability of Feature Selection Methods in Software Quality Prediction: An Empirical Investigation

INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING (2015)

期刊

INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING

卷 25, 期 9-10, 页码 1467-1490

出版社

WORLD SCIENTIFIC PUBL CO PTE LTD

DOI: 10.1142/S0218194015400288

关键词

Software metrics; feature selection; fixed-overlap partitions; stability

类别

Computer Science, Artificial Intelligence Computer Science, Software Engineering Engineering, Electrical & Electronic

资金

Division Of Computer and Network Systems
Direct For Computer & Info Scie & Enginr [1427536] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Software quality modeling is the process of using software metrics from previous iterations of development to locate potentially faulty modules in current under-development code. This has become an important part of the software development process, allowing practitioners to focus development efforts where they are most needed. One difficulty encountered in software quality modeling is the problem of high dimensionality, where the number of available software metrics is too large for a classifier to work well. In this case, many of the metrics may be redundant or irrelevant to defect prediction results, thereby selecting a subset of software metrics that are the best predictors becomes important. This process is called feature (metric) selection. There are three major forms of feature selection: filter-based feature rankers, which uses statistical measures to assign a score to each feature and present the user with a ranked list; filter-based feature subset evaluation, which uses statistical measures on feature subsets to find the best feature subset; and wrapper-based subset selection, which builds classification models using different subsets to find the one which maximizes performance. Software practitioners are interested in which feature selection methods are best at providing the most stable feature subset in the face of changes to the data (here, the addition or removal of instances). In this study we select feature subsets using fifteen feature selection methods and then use our newly proposed Average Pairwise Tanimoto Index (APTI) to evaluate the stability of the feature selection methods. We evaluate the stability of feature selection methods on a pair of subsamples generated by our fixed-overlap partitions algorithm. Four different levels of overlap are considered in this study. 13 software metric datasets from two real-world software projects are used in this study. Results demonstrate that ReliefF (RF) is the most stable feature selection method and wrapper based feature subset selection shows least stability. In addition, as the overlap of partitions increased, the stability of the feature selection strategies increased.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.2

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

A Comprehensive Investigation of the Impact of Class Overlap on Software Defect Prediction

Lina Gong, Haoxiang Zhang, Jingxuan Zhang, Mingqiang Wei, Zhiqiu Huang

Summary: Software Defect Prediction (SDP) is an important operation to ensure software quality, but class overlap in SDP datasets hinders performance. In this empirical study, we propose an approach to identify overlapping instances and investigate the impact of class overlap on the performance and interpretation of seven SDP models. We find that 70.0% of SDP datasets have overlapping instances and different levels of class overlap affect SDP model performance and feature ranking. Handling class overlap can significantly improve SDP model performance on datasets with over 12.5% overlap ratios.

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING (2023)