4.6 Article

A variable-level automated defect identification model based on machine learning

期刊

SOFT COMPUTING
卷 24, 期 2, 页码 1045-1061

出版社

SPRINGER
DOI: 10.1007/s00500-019-03942-3

关键词

Machine learning; Static analysis; Automated defect identification; Alarm classification; Model evaluation

资金

  1. National Natural Science Foundation of China [61702044, U1736110]
  2. National Key Research and Development Program of China [2016YFF0204002]

向作者/读者索取更多资源

Static analysis tools, automatically detecting potential source code defects at an early phase during the software development process, are diffusely applied in safety-critical software fields. However, alarms reported by the tools need to be inspected manually by developers, which is inevitable and costly, whereas a large proportion of them are found to be false positives. Aiming at automatically classifying the reported alarms into true defects and false positives, we propose a defect identification model based on machine learning. We design a set of novel features at variable level, called variable characteristics, for building the classification model, which is more fine-grained than the existing traditional features. We select 13 base classifiers and two ensemble learning methods for model building based on our proposed approach, and the reported alarms classified as unactionable (false positives) are pruned for the purpose of mitigating the effort of manual inspection. In this paper, we firstly evaluate the approach on four open-source C projects, and the classification results show that the proposed model achieves high performance and reliability in practice. Then, we conduct a baseline experiment to evaluate the effectiveness of our proposed model in contrast to traditional features, indicating that features at variable level improve the performance significantly in defect identification. Additionally, we use machine learning techniques to rank the variable characteristics in order to identify the contribution of each feature to our proposed model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Software Engineering

Automated defect identification via path analysis-based features with transfer learning

Yuwei Zhang, Dahai Jin, Ying Xing, Yunzhan Gong

JOURNAL OF SYSTEMS AND SOFTWARE (2020)

Article Computer Science, Artificial Intelligence

Cross-project defect prediction based on G-LSTM model

Ying Xing, Xiaomeng Qian, Yu Guan, Bin Yang, Yuwei Zhang

Summary: Cross-project defect prediction is a popular research direction in software reliability, and traditional methods struggle in capturing the semantic and contextual information of programs. This paper applies technology from the NLP domain and proposes a deep learning model, Generative Adversarial Long-Short Term Memory Neural Networks (G-LSTM), to automatically learn the semantic and contextual features of programs. Experimental results show that the proposed method outperforms traditional and state-of-the-art methods in evaluation metrics.

PATTERN RECOGNITION LETTERS (2022)

Article Computer Science, Information Systems

ST-TLF: Cross-version defect prediction framework based transfer learning

Yanyang Zhao, Yawen Wang, Yuwei Zhang, Dalin Zhang, Yunzhan Gong, Dahai Jin

Summary: The ST-TLF framework proposed in this study can effectively perform cross-version defect prediction and improve the accuracy of CVDP. By selecting the best training set and eliminating concept drift, ST-TLF overcomes the limitations of previous research and shows significant improvements in various metrics.

INFORMATION AND SOFTWARE TECHNOLOGY (2022)

Proceedings Paper Computer Science, Software Engineering

The Impact of Feature Selection Techniques on Software Defect Identification Models

Huiquan Gong, Yuwei Zhang

Summary: Researchers have utilized artificial intelligence to improve defect identification in software quality, proposing a feature selection method based on majority voting which achieved optimal performance at a ratio of 20%. This finding can serve as a practical guideline for software defect identification.

PROCEEDINGS OF 2021 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Detecting Interprocedural Infeasible Paths via Symbolic Propagation and Dataflow Analysis

Huiquan Gong, Yuwei Zhang, Ying Xing, Wei Jia

PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019) (2019)

Proceedings Paper Computer Science, Software Engineering

Whole Program Paths Generation Method

Wei Jia, Yawen Wang, Yuwei Zhang, Yunzhan Gong

PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS) (2018)

暂无数据