4.7 Article

Understanding the topic evolution of scientific literatures like an evolving city: Using Google Word2Vec model and spatial autocorrelation analysis

Journal

INFORMATION PROCESSING & MANAGEMENT
Volume 56, Issue 4, Pages 1185-1203

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.ipm.2019.02.014

Keywords

Semantic relatedness; Topic evolution; Spatial clustering; Spatial autocorrelation; Word2Vec

Funding

  1. Open Research Fund of State Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University [18104]
  2. National Natural Science Foundation of China [41371372, 31771680, 21706096]
  3. Natural Science Foundation of Jiangsu Province [BK20160162]
  4. Fundamental Research Funds for the Central Universities of China [JUSRP51730A]
  5. Modern Agriculture Funds of Jiangsu Province [BE2015310]
  6. New Agricultural Engineering of Jiangsu Province [SXGC [2016]106]
  7. 111 Project [B12018]
  8. Research Funds for New Faculty of Jiangnan University

Ask authors/readers for more resources

Topic evolution has been described by many approaches from a macro level to a detail level, by extracting topic dynamics from text in literature and other media types. However, why the evolution happens is less studied. In this paper, we focus on whether and how the keyword semantics can invoke or affect the topic evolution. We assume that the semantic relatedness among the keywords can affect topic popularity during literature surveying and citing process, thus invoking evolution. However, the assumption is needed to be confirmed in an approach that fully considers the semantic interactions among topics. Traditional topic evolution analyses in scientometric domains cannot provide such support because of using limited semantic meanings. To address this problem, we apply the Google Word2Vec, a deep learning language model, to enhance the keywords with more complete semantic information. We further develop the semantic space as an urban geographic space. We analyze the topic evolution geographically using the measures of spatial autocorrelation, as if keywords are the changing lands in an evolving city. The keyword citations (keyword citation counts one when the paper containing this keyword obtains a citation) are used as an indicator of keyword popularity. Using the bibliographical datasets of the geographical natural hazard field, experimental results demonstrate that in some local areas, the popularity of keywords is affecting that of the surrounding keywords. However, there are no significant impacts on the evolution of all keywords. The spatial autocorrelation analysis identifies the interaction patterns (including High-High leading, High-Low suppressing) among the keywords in local areas. This approach can be regarded as an analyzing framework borrowed from geospatial modeling. Moreover, the prediction results in local areas are demonstrated to be more accurate if considering the spatial autocorrelations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available