☆ 4.4 Article

Scalable reduction of large datasets to interesting subsets

JOURNAL OF WEB SEMANTICS (2010)

Journal

JOURNAL OF WEB SEMANTICS

Volume 8, Issue 4, Pages 365-373

Publisher

ELSEVIER SCIENCE BV

DOI: 10.1016/j.websem.2010.08.002

Keywords

Billion Triples Challenge; Scalability; Parallel; Inferencing; Query; Triplestore

Funding

DARPA's Transformational Convergence Technology Office
Lockheed Martin Advanced Technology Laboratories
Fujitsu Laboratories of America

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

With a huge amount of RDF data available on the web, the ability to find and access relevant information is crucial. Traditional approaches to storing, querying, and reasoning fall short when faced with web-scale data. We present a system that combines the computational power of large clusters for enabling large-scale reasoning and data access with an efficient data structure for storing and querying the accessed data on a traditional personal computer or other resource-constrained device. We present results of using this system to load the 2009 Billion Triples Challenge dataset, materialize RDFS inferences, extract an interesting subset of the data using a large cluster, and further analyze the extracted data using a personal computer, all in the order of tens of minutes. (C) 2010 Elsevier B. V. All rights reserved.

Scalable reduction of large datasets to interesting subsets

Journal

JOURNAL OF WEB SEMANTICS

Publisher

ELSEVIER SCIENCE BV

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Scalable reduction of large datasets to interesting subsets

Journal

JOURNAL OF WEB SEMANTICS

Publisher

ELSEVIER SCIENCE BV

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper