4.5 Article

DBMUTE: density-based majority under-sampling technique

期刊

KNOWLEDGE AND INFORMATION SYSTEMS
卷 50, 期 3, 页码 827-850

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s10115-016-0957-5

关键词

Pattern recognition; Class imbalance; Under-sampling; Density-based

资金

  1. Thailand Research Fund [TRG5680082]

向作者/读者索取更多资源

Class imbalance is a challenging problem that demonstrates the unsatisfactory classification performance of a minority class. A trivial classifier is biased toward minority instances because of their tiny fraction. In this paper, our density function is defined as the distance along the shortest path between each majority instance and a minority-cluster pseudo-centroid in an underlying cluster graph. Ashort path implies highly overlapping dense minority instances. In contrast, a long path indicates a sparsity of instances. A new under-sampling algorithm is proposed to eliminate majority instances with low distances because these instances are insignificant and obscure the classification boundary in the overlapping region. The results show predictive improvements on a minority class from various classifiers on different UCI datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据