4.0 Article

HOW THE INITIALIZATION AFFECTS THE STABILITY OF THE k-MEANS ALGORITHM

期刊

ESAIM-PROBABILITY AND STATISTICS
卷 16, 期 -, 页码 436-452

出版社

EDP SCIENCES S A
DOI: 10.1051/ps/2012013

关键词

Clustering; k-means; stability; model selection

资金

  1. NSF [IIS-031339]

向作者/读者索取更多资源

We investigate the role of the initialization for the stability of the k-means clustering algorithm. As opposed to other papers, we consider the actual k-means algorithm (also known as Lloyd algorithm). In particular we leverage on the property that this algorithm can get stuck in local optima of the k-means objective function. We are interested in the actual clustering, not only in the costs of the solution. We analyze when different initializations lead to the same local optimum, and when they lead to different local optima. This enables us to prove that it is reasonable to select the number of clusters based on stability scores.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据