4.8 Article

Network histograms and universality of blockmodel approximation

出版社

NATL ACAD SCIENCES
DOI: 10.1073/pnas.1400374111

关键词

community detection; graphons; nonparametric statistics; graph limits; sparse networks

资金

  1. US Army Research Office under Presidential Early Career Award for Scientists and Engineers [W911NF-09-1-0555]
  2. Multidisciplinary University Research Initiative Award [W911NF-11-1-0036]
  3. US Office of Naval Research [N00014-14-1-0819]
  4. UK Engineering and Physical Sciences Research Council under Mathematical Sciences Leadership Fellowship [EP/I005250/1]
  5. Established Career Fellowship [EP/K005413/1]
  6. Developing Leaders Award [EP/L001519/1]
  7. UK Royal Society under a Wolfson Research Merit Award
  8. Marie Curie FP7 within 7th European Union Framework Program [PCIG12-GA-2012-334622]
  9. EPSRC [EP/K005413/1, EP/L001519/1, EP/I005250/1] Funding Source: UKRI
  10. Engineering and Physical Sciences Research Council [EP/I005250/1, EP/L001519/1, EP/K005413/1] Funding Source: researchfish

向作者/读者索取更多资源

In this paper we introduce the network histogram, a statistical summary of network interactions to be used as a tool for exploratory data analysis. A network histogram is obtained by fitting a stochastic blockmodel to a single observation of a network dataset. Blocks of edges play the role of histogram bins and community sizes that of histogram bandwidths or bin sizes. Just as standard histograms allow for varying bandwidths, different blockmodel estimates can all be considered valid representations of an underlying probability model, subject to bandwidth constraints. Here we provide methods for automatic bandwidth selection, by which the network histogram approximates the generating mechanism that gives rise to exchangeable random graphs. This makes the blockmodel a universal network representation for unlabeled graphs. With this insight, we discuss the interpretation of network communities in light of the fact that many different community assignments can all give an equally valid representation of such a network. To demonstrate the fidelity-versus-interpretability tradeoff inherent in considering different numbers and sizes of communities, we analyze two publicly available networks-political weblogs and student friendships-and discuss how to interpret the network histogram when additional information related to node and edge labeling is present.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据