Journal
INTELLIGENT DATA ANALYSIS
Volume 22, Issue 1, Pages 77-101Publisher
IOS PRESS
DOI: 10.3233/IDA-163222
Keywords
Data clustering; co-evolutionary algorithm; genetic algorithm; multidimensional clustering; feature selection
Categories
Funding
- Research Center of the Female Scientific and Medical Colleges, Deanship of Scientific Research, King Saud University
Ask authors/readers for more resources
Data clustering refers to constructing groups of objects that are highly correlated, based on some similarity measure. It is a very popular technique for intelligent knowledge discovery. A challenge that arises in automatic data clustering, though, is the high dimensionality of data, since each object can be described by several relevant features. Thus, we often need to assign a relative weight for each feature to indicate its importance during the clustering process. With the absence of domain knowledge about the nature of data, assigning such weights becomes a challenging task. Dynamic adjustment of feature weights in an unsupervised manner is an attractive solution for such problem. In this paper, we propose a co-evolutionary algorithm for the dynamic adjustment of feature weights during data clustering. Two populations are simultaneously evolved for the optimization of both the clusters and their associated feature weights. In addition, the number of clusters are also learnt and optimized in the evolutionary process. Extensive experimental results on several datasets from UCI machine learning repository indicate the efficacy of the proposed approach. The algorithm outperforms both a non-adaptive version, where feature weights are not considered, as well as K-means clustering for a fixed number of clusters.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available