4.7 Article

Graph cells: Top-k structural-textual aggregated query over information networks

Journal

INFORMATION SCIENCES
Volume 547, Issue -, Pages 354-366

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.08.057

Keywords

Graph cell; Aggregated query; Information network; Top-k

Funding

  1. CSC scholarship
  2. NSFC [61732003, 61729201]
  3. Fundamental Research Funds for the Central Universities [N181605012]

Ask authors/readers for more resources

The paper proposes a novel OLAP query for analyzing information data, aiming to find top-k structural-textual aggregated graph cells in text-rich multidimensional information networks.
The graph OLAP aggregated analysis in information networks has been extensively studied. However, previous works have neglected to integrate the structural information into this kind of query and ignored the influence of enough textual information in graph aggregation operations. In this paper, we propose a novel OLAP query called top-k structural-textual aggregated graph cell query to analyze the information data. According to the given keywords, this query is to find top-k structural-textual aggregated graph cells in text-rich multidimensional information networks. Under the conditions of matching attribution values in a portion of dimensions, a graph cell is defined as a subgraph of the network. It only contains documents of all included vertices in this subgraph. To distinguish the importance of different graph cells, we firstly design a dominating number-based threshold testing and a flexible ranking function integrating the text similarity with the query and the structural size to obtain k most relevant graph cells. Then, we propose a new hybrid index structure and a filtering-and-verification framework, which includes an efficient search algorithm and several pruning and bounding techniques. Finally, we verify the effectiveness and efficiency of the proposed methods through extensive experiments. (C) 2020 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available