4.5 Article

Selecting publication keywords for domain analysis in bibliometrics: A comparison of three methods

Journal

JOURNAL OF INFORMETRICS
Volume 10, Issue 1, Pages 212-223

Publisher

ELSEVIER
DOI: 10.1016/j.joi.2016.01.006

Keywords

Domain analysis; Keyword analysis; Keyword Activity Index; Digital Library in China

Funding

  1. National Social Science Foundation of China [12ZD221]

Ask authors/readers for more resources

Publication keywords have been widely utilized to reveal the knowledge structure of research domains. An important but under-addressed problem is the decision of which keywords should be retained as analysis objects after a great number of keywords are gathered from domain publications. In this paper, we discuss the problems with the traditional term frequency (TF) method and introduce two alternative methods: TF-inverse document frequency (TF-IDF) and TF-Keyword Activity Index (TF-KAI). These two methods take into account keyword discrimination by considering their frequency both in and out of the domain. To test their performance, the keywords they select in China's Digital Library domain are evaluated both qualitatively and quantitatively. The evaluation results show that the TF-KAI method performs the best: it can retain keywords that match expert selection much better and reveal the research specialization of the domain with more details. (C) 2016 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available