Journal
JOURNAL OF MULTIVARIATE ANALYSIS
Volume 196, Issue -, Pages -Publisher
ELSEVIER INC
DOI: 10.1016/j.jmva.2023.105181
Keywords
Cluster validation; Mixture models; Model-based clustering; Resampling methods
Categories
Ask authors/readers for more resources
Cluster analysis requires fixing the number of clusters and many hyper-parameters. Multiple partitions are produced, and a final one is chosen based on validation or selection criteria. This paper focuses on groups that can be well separated by quadratic or linear boundaries. The proposed methodology achieves a better overall performance compared to established alternatives from the literature.
Cluster analysis requires fixing the number of clusters and often many hyper-parameters. In practice, one produces several partitions, and a final one is chosen based on validation or selection criteria. There exist an abundance of validation methods that, implicitly or explicitly, assume a certain clustering notion. In this paper, we focus on groups that can be well separated by quadratic or linear boundaries. The reference cluster concept is defined through the quadratic discriminant function and parameters describing clusters' size, center and scatter. We develop two cluster-quality criteria that are consistent with groups generated from a class of elliptic-symmetric distributions. Using the bootstrap resampling of the proposed criteria, we propose a selection rule that allows choosing among many clustering solutions, eventually obtained from different methods. Extensive experimental analysis shows that the proposed methodology achieves a better overall performance compared to established alternatives from the literature.(c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available