4.2 Article

Cross-Project Defect Prediction via Semi-Supervised Discriminative Feature Learning

Journal

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume E103D, Issue 10, Pages 2237-2240

Publisher

IEICE-INST ELECTRONICS INFORMATION COMMUNICATION ENGINEERS
DOI: 10.1587/transinf.2020EDL8044

Keywords

cross-project defection prediction; semi-supervised learning; discriminative feature learning

Funding

  1. NSFC-Key Project of General Technology Fundamental Research United Fund [U1736211]
  2. Natural Science Foundation Key Project for Innovation Group of Hubei Province [2018CFA024]
  3. National Natural Science Foundation of China [61702280]
  4. National Postdoctoral Program for Innovative Talents [BX20180146]
  5. Natural Science Foundation of Jiangsu Province [BK20170900]
  6. China Postdoctoral Science Foundation [2019M661901]
  7. Jiangsu Planned Projects for Postdoctoral Research Funds [2019K024]
  8. CCF-Tencent Open Fund WeBank Special Funding [CCF-WebankRAGR20190104]

Ask authors/readers for more resources

Cross-project defect prediction (CPDP) is a feasible solution to build an accurate prediction model without enough historical data. Although existing methods for CPDP that use only labeled data to build the prediction model achieve great results, there are much room left to further improve on prediction performance. In this paper we propose a Semi-Supervised Discriminative Feature Learning (SSDFL) approach for CPDP. SSDFL first transfers knowledge of source and target data into the common space by using a fully-connected neural network to mine potential similarities of source and target data. Next, we reduce the differences of both marginal distributions and conditional distributions between mapped source and target data. We also introduce the discriminative feature learning to make full use of label information, which is that the instances from the same class are close to each other and the instances from different classes are distant from each other. Extensive experiments are conducted on 10 projects from AEEEM and NASA datasets, and the experimental results indicate that our approach obtains better prediction performance than baselines.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available