4.4 Article

Comparing the effectiveness of several modeling methods for fault prediction

期刊

EMPIRICAL SOFTWARE ENGINEERING
卷 15, 期 3, 页码 277-295

出版社

SPRINGER
DOI: 10.1007/s10664-009-9111-2

关键词

Empirical study; Fault prediction; Negative binomial; Recursive partitioning; Random forests; Bayesian trees; Fault-percentile-average

向作者/读者索取更多资源

We compare the effectiveness of four modeling methods-negative binomial regression, recursive partitioning, random forests and Bayesian additive regression trees-for predicting the files likely to contain the most faults for 28 to 35 releases of three large industrial software systems. Predictor variables included lines of code, file age, faults in the previous release, changes in the previous two releases, and programming language. To compare the effectiveness of the different models, we use two metrics-the percent of faults contained in the top 20% of files identified by the model, and a new, more general metric, the fault-percentile-average. The negative binomial regression and random forests models performed significantly better than recursive partitioning and Bayesian additive regression trees, as assessed by either of the metrics. For each of the three systems, the negative binomial and random forests models identified 20% of the files in each release that contained an average of 76% to 94% of the faults.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据