☆ 4.7 Article

Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans

FUZZY SETS AND SYSTEMS (2013)

期刊

FUZZY SETS AND SYSTEMS

卷 221, 期 -, 页码 48-64

出版社

ELSEVIER

DOI: 10.1016/j.fss.2013.01.004

关键词

Fuzzy clustering; Semi-supervised learning; Text mining; Fuzzy semi-Kmeans

类别

Computer Science, Theory & Methods Mathematics, Applied Statistics & Probability

资金

National Science Council [NSC-101-2221-E-009-163]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

While focusing on document clustering, this work presents a fuzzy semi-supervised clustering algorithm called fuzzy semi-Kmeans. The fuzzy semi-Kmeans is an extension of K-means clustering model, and it is inspired by an EM algorithm and a Gaussian mixture model. Additionally, the fuzzy semi-Kmeans provides the flexibility to employ different fuzzy membership functions to measure the distance between data. This work employs Gaussian weighting function to conduct experiments, but cosine similarity function can be used as well. This work conducts experiments on three data sets and compares fuzzy semi-Kmeans with several methods. The experimental results indicate that fuzzy semi-Kmeans can generally outperform the other methods. (C) 2013 Elsevier B.V. All rights reserved.

Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans

期刊

FUZZY SETS AND SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans

期刊

FUZZY SETS AND SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文