4.3 Article

A cluster boundary detection algorithm based on shadowed set

Journal

INTELLIGENT DATA ANALYSIS
Volume 20, Issue 1, Pages 29-45

Publisher

IOS PRESS
DOI: 10.3233/IDA-150792

Keywords

Mixed attributes; high-dimensional data; cluster boundary; shadowed set

Funding

  1. Science and Technology Research Key Project of Henan Province of China [9412013Y1486]

Ask authors/readers for more resources

To meet the need of extracting cluster boundary from mixed attribute data in the field of data analysis, we propose a cluster boundary detection algorithm for mixed attribute data sets in this research, named CHASM(Cluster Boundary Detection Algorithm based on Shadowed Set). Based on the structure of clusters, the CHASM defines a new objective function according to the data set which is categorized into three collections, i. e. core, exclusion and shadow. Then CHASM updates the centroid information of clusters based on the variance of contribution degree among these collections to the clusters centroids. Finally, in an iterative optimization process, the CHASM can extract its shadow sets from each cluster to form the boundary of clusters. The experimental results, on both the synthetic data and real data with mixed attributes, numerical attributes and categorical attributes, show that CHASM can effectively detect cluster boundary with higher or similar accuracy to its rival methods. Furthermore, the CHASM can eliminate noise effectively.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available