4.7 Article

A model for clustering data from heterogeneous dissimilarities

期刊

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
卷 253, 期 3, 页码 659-672

出版社

ELSEVIER
DOI: 10.1016/j.ejor.2016.03.033

关键词

Data mining; Clustering; Heterogeneity; Optimization; Heuristics

资金

  1. National Council for Scientific and Technological Development - CNPq/Brazil [471143/2012-0]
  2. CAPES/Brazil

向作者/读者索取更多资源

Clustering algorithms partition a set of n objects into p groups (called clusters), such that objects assigned to the same groups are homogeneous according to some criteria. To derive these clusters, the data input required is often a single n x n dissimilarity matrix. Yet for many applications, more than one instance of the dissimilarity matrix is available and so to conform to model requirements, it is common practice to aggregate (e.g., sum up, average) the matrices. This aggregation practice results in clustering solutions that mask the true nature of the original data. In this paper we introduce a clustering model which, to handle the heterogeneity, uses all available dissimilarity matrices and identifies for groups of individuals clustering objects in a similar way. The model is a nonconvex problem and difficult to solve exactly, and we thus introduce a Variable Neighborhood Search heuristic to provide solutions efficiently. Computational experiments and an empirical application to perception of chocolate candy show that the heuristic algorithm is efficient and that the proposed model is suited for recovering heterogeneous data. Implications for clustering researchers are discussed. (C) 2016 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据