4.7 Article Data Paper

SciSciNet: A large-scale open data lake for the science of science research

Journal

SCIENTIFIC DATA
Volume 10, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41597-023-02198-9

Keywords

-

Ask authors/readers for more resources

The science of science has gained increasing research interest due to the availability of large-scale datasets. SciSciNet is introduced as a large-scale open data lake for science of science research, covering millions of scientific publications and linkages to funding and public uses. Detailed documentation is provided for data processing steps and analytical choices, along with commonly used measures computed in the literature. This data lake serves as a valuable resource by reducing duplication of efforts, improving replicability of empirical claims, and broadening diversity and representation of ideas in the field.
The science of science has attracted growing research interests, partly due to the increasing availability of large-scale datasets capturing the innerworkings of science. These datasets, and the numerous linkages among them, enable researchers to ask a range of fascinating questions about how science works and where innovation occurs. Yet as datasets grow, it becomes increasingly difficult to track available sources and linkages across datasets. Here we present SciSciNet, a large-scale open data lake for the science of science research, covering over 134M scientific publications and millions of external linkages to funding and public uses. We offer detailed documentation of pre-processing steps and analytical choices in constructing the data lake. We further supplement the data lake by computing frequently used measures in the literature, illustrating how researchers may contribute collectively to enriching the data lake. Overall, this data lake serves as an initial but useful resource for the field, by lowering the barrier to entry, reducing duplication of efforts in data processing and measurements, improving the robustness and replicability of empirical claims, and broadening the diversity and representation of ideas in the field.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Review Computer Science, Cybernetics

Structural Hole Theory in Social Network Analysis: A Review

Zihang Lin, Yuwei Zhang, Qingyuan Gong, Yang Chen, Atte Oksanen, Aaron Yi Ding

Summary: Social networks connect billions of users with different social roles and behavior characteristics. The structural hole spanners have control over information diffusion paths between communities, playing a crucial role in integrating social network structures and user behaviors. Research reviews detection algorithms for structural hole spanners and their practical applications in various scenarios.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2022)

No Data Available