☆ 4.6 Article

A variable-level automated defect identification model based on machine learning

SOFT COMPUTING (2020)

期刊

SOFT COMPUTING

卷 24, 期 2, 页码 1045-1061

出版社

SPRINGER

DOI: 10.1007/s00500-019-03942-3

关键词

Machine learning; Static analysis; Automated defect identification; Alarm classification; Model evaluation

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

资金

National Natural Science Foundation of China [61702044, U1736110]
National Key Research and Development Program of China [2016YFF0204002]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Static analysis tools, automatically detecting potential source code defects at an early phase during the software development process, are diffusely applied in safety-critical software fields. However, alarms reported by the tools need to be inspected manually by developers, which is inevitable and costly, whereas a large proportion of them are found to be false positives. Aiming at automatically classifying the reported alarms into true defects and false positives, we propose a defect identification model based on machine learning. We design a set of novel features at variable level, called variable characteristics, for building the classification model, which is more fine-grained than the existing traditional features. We select 13 base classifiers and two ensemble learning methods for model building based on our proposed approach, and the reported alarms classified as unactionable (false positives) are pruned for the purpose of mitigating the effort of manual inspection. In this paper, we firstly evaluate the approach on four open-source C projects, and the classification results show that the proposed model achieves high performance and reliability in practice. Then, we conduct a baseline experiment to evaluate the effectiveness of our proposed model in contrast to traditional features, indicating that features at variable level improve the performance significantly in defect identification. Additionally, we use machine learning techniques to rank the variable characteristics in order to identify the contribution of each feature to our proposed model.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.6

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Attention-Guided Multitask Learning for Surface Defect Identification

Vignesh Sampath, Inaki Maurtua, Juan Jose Aguilar Martin, Andoni Rivera, Jorge Molina, Aitor Gutierrez

Summary: This article presents a method for improving the generalization ability of surface defect identification tasks by exploiting auxiliary information beyond the primary labels. By jointly learning features of pixel-level segmentation masks, object-level bounding boxes, and global image-level classification labels, the proposed method significantly improves the performance of state-of-the-art models. Experimental results show an overall accuracy of 97.1%, a Dice score of 0.926, and a mean average precision of 0.762 on defect classification, segmentation, and detection tasks.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2023)