4.6 Article

Kernel Probabilistic K-Means Clustering

期刊

SENSORS
卷 21, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/s21051892

关键词

fuzzy c-means; kernel probabilistic k-means; nonlinear programming; fast active gradient projection

资金

  1. National Natural Science Foundation of China [61876010, 61806013, 61906005]

向作者/读者索取更多资源

The proposed Kernel Probabilistic K-means (KPKM) is an improved version of Kernel Fuzzy C-means (KFCM) that can handle linearly inseparable datasets, and it is solved using the active gradient projection (AGP) method. KPKM shows effectiveness in finding nonlinearly separable structures in both synthetic and real datasets, with better clustering performance on at least six real datasets. The proposed Fast AGP (FAGP) algorithm reduces running time by 76-95% on real datasets compared to the original AGP.
Kernel fuzzy c-means (KFCM) is a significantly improved version of fuzzy c-means (FCM) for processing linearly inseparable datasets. However, for fuzzification parameter m=1, the problem of KFCM (kernel fuzzy c-means) cannot be solved by Lagrangian optimization. To solve this problem, an equivalent model, called kernel probabilistic k-means (KPKM), is proposed here. The novel model relates KFCM to kernel k-means (KKM) in a unified mathematic framework. Moreover, the proposed KPKM can be addressed by the active gradient projection (AGP) method, which is a nonlinear programming technique with constraints of linear equalities and linear inequalities. To accelerate the AGP method, a fast AGP (FAGP) algorithm was designed. The proposed FAGP uses a maximum-step strategy to estimate the step length, and uses an iterative method to update the projection matrix. Experiments demonstrated the effectiveness of the proposed method through a performance comparison of KPKM with KFCM, KKM, FCM and k-means. Experiments showed that the proposed KPKM is able to find nonlinearly separable structures in synthetic datasets. Ten real UCI datasets were used in this study, and KPKM had better clustering performance on at least six datsets. The proposed fast AGP requires less running time than the original AGP, and it reduced running time by 76-95% on real datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据