4.2 Article Proceedings Paper

MaReIA: a cloud MapReduce based high performance whole slide image analysis framework

期刊

DISTRIBUTED AND PARALLEL DATABASES
卷 37, 期 2, 页码 251-272

出版社

SPRINGER
DOI: 10.1007/s10619-018-7237-1

关键词

Whole slide images; Spatial application; Pathology image analysis; MapReduce; Cloud computing

资金

  1. National Science Foundation [ACI 1443054, IIS 1350885]
  2. National Institute of Health [K25CA181503]
  3. CNPq

向作者/读者索取更多资源

Recent advancements in systematic analysis of high resolution whole slide images have increase efficiency of diagnosis, prognosis and prediction of cancer and important diseases. Due to the enormous sizes and dimensions of whole slide images, the analysis requires extensive computing resources which are not commonly available. Images have to be tiled for processing due to computer memory limitations, which lead to inaccurate results due to the ignorance of boundary crossing objects. Thus, we propose a generic and highly scalable cloud-based image analysis framework for whole slide images. The framework enables parallelized integration of image analysis steps, such as segmentation and aggregation of micro-structures in a single pipeline, and generation of final objects manageable by databases. The core concept relies on the abstraction of objects in whole slide images as different classes of spatial geometries, which in turn can be handled as text based records in MapReduce. The framework applies an overlapping partitioning scheme on images, and provides parallelization of tiling and image segmentation based on MapReduce architecture. It further provides robust object normalization, graceful handling of boundary objects with an efficient spatial indexing based matching method to generate accurate results. Our experiments on Amazon EMR show that MaReIA is highly scalable, generic and extremely cost effective by benchmark tests.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据