☆ 4.7 Article

ISBFK-means: A new clustering algorithm based on influence space

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Volume 201, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2022.117018

Keywords

Clustering; Influence space; Region partition; Representative data objects

Funding

National Natural Science Foundation of China [U1931209]
Key Research and Development Projects of Shanxi Province, China [201903D121116]
central government guides local science and technology development funds, China [20201070]
Fundamental Research Program of Shanxi Province, China [20210302123223, 202103021224275]
National Development and Reform Commission, China

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this paper, a new clustering algorithm named ISBFK-means based on the influence space is proposed to address the issues of huge time overhead and unstable clustering quality when running the K-means algorithm on massive raw data. The approach effectively reduces data volume in the clustering process and improves the stability of clustering quality. Experimental results demonstrate the algorithm's high performance in processing celestial spectral data.

The time overhead is huge and the clustering quality is unstable when running the K-means algorithm on massive raw data. To solve these problems, the concept of the influence space is introduced, and on this basis, a new clustering algorithm named ISBFK-means based on the influence space is proposed in this paper. First, the influence space divides the given data set into multiple small regions. Then, the representative data objects in each region are obtained to form a new data set, in which the class labels of representative data objects are those of all the data objects in the correlation influence space. Next, the K-means clustering is performed on the new data set, thereby obtaining the final clustering result. Theoretical analysis and experimental results show that this approach effectively reduces the amount of data in the clustering process and improves the stability of clustering quality. As a major feature of this work, the celestial spectral data observed by the LAMOST survey are especially employed to verify the algorithm ISBFK-means. The experimental results indicate that this algorithm has higher performance than other similar algorithms on the correctness, efficiency and sensitivity to the quality of spectral data.

ISBFK-means: A new clustering algorithm based on influence space

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

ISBFK-means: A new clustering algorithm based on influence space

Journal

EXPERT SYSTEMS WITH APPLICATIONS

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper