4.2 Article

A comparison of extrinsic clustering evaluation metrics based on formal constraints

期刊

INFORMATION RETRIEVAL
卷 12, 期 4, 页码 461-486

出版社

SPRINGER
DOI: 10.1007/s10791-008-9066-8

关键词

Clustering; Evaluation metrics; Formal constraints

资金

  1. QEAVIS [TIN2007-67581-C02-01]
  2. INES/Text-Mess [TIN2006-15265-C06-02]

向作者/读者索取更多资源

There is a wide set of evaluation metrics available to compare the quality of text clustering algorithms. In this article, we define a few intuitive formal constraints on such metrics which shed light on which aspects of the quality of a clustering are captured by different metric families. These formal constraints are validated in an experiment involving human assessments, and compared with other constraints proposed in the literature. Our analysis of a wide range of metrics shows that only BCubed satisfies all formal constraints. We also extend the analysis to the problem of overlapping clustering, where items can simultaneously belong to more than one cluster. As Bcubed cannot be directly applied to this task, we propose a modified version of Bcubed that avoids the problems found with other metrics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据