☆ 4.7 Article

An experimental study of constrained clustering effectiveness in presence of erroneous constraints

INFORMATION PROCESSING & MANAGEMENT (2012)

期刊

INFORMATION PROCESSING & MANAGEMENT

卷 48, 期 3, 页码 537-551

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.ipm.2011.08.006

关键词

Algorithms; Clustering; Constrained clustering; Erroneous constraints; Experimentation

类别

Computer Science, Information Systems Information Science & Library Science

资金

Ministerio de Ciencia e Innovacion from the Spanish Government [TIN2008-06566-C04-04]
Ministerio de Educacion from the Spanish Government [AP2007-02476]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recently a new fashion of semi-supervised clustering algorithms, coined as constrained clustering, has emerged. These new algorithms can incorporate some a priori domain knowledge to the clustering process, allowing the user to guide the method. The vast majority of studies about the effectiveness of these approaches have been performed using information, in the form of constraints, which was totally accurate. This would be the ideal case, but such a situation will be impossible in most realistic settings, due to errors in the constraint creation process, misjudgements of the user, inconsistent information, etc. Hence, the robustness of the constrained clustering algorithms when dealing with erroneous constraints is bound to play an important role in their final effectiveness. In this paper we study the behaviour of four constrained clustering algorithms (Constrained k-Means, Soft Constrained k-Means, Constrained Normalised Cut and Normalised Cut with Imposed Constraints) when not all the information supplied to them is accurate. The experimentation over text and numeric datasets using two different noise models, one of them an original approach based on similarities, highlighted the strengths and weaknesses of each method when working with positive and negative constraints, indicating the scenarios in which each algorithm is more appropriate. (C) 2011 Elsevier Ltd. All rights reserved.

An experimental study of constrained clustering effectiveness in presence of erroneous constraints

期刊

INFORMATION PROCESSING & MANAGEMENT

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

An experimental study of constrained clustering effectiveness in presence of erroneous constraints

期刊

INFORMATION PROCESSING & MANAGEMENT

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文