☆ 4.3 Article

U-Net versus Pix2Pix: a comparative study on degraded document image binarization

JOURNAL OF ELECTRONIC IMAGING (2020)

期刊

JOURNAL OF ELECTRONIC IMAGING

卷 29, 期 6, 页码 -

出版社

SPIE-SOC PHOTO-OPTICAL INSTRUMENTATION ENGINEERS

DOI: 10.1117/1.JEI.29.6.063019

关键词

binarization; document image; U-Net; Pix2Pix; Dice loss; document image binarization competition

类别

Engineering, Electrical & Electronic Optics Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Document image binarization is the process in which pixels in a document image are classified into two groups-foreground and background. This process becomes challenging when it deals with various degradation and noise present in the images. In the recent past, it has been observed that researchers are relying on deep learning-based approaches to solve the problem of document image binarization. Of these, a group of methods considers the segmentation as a pixel-level classification problem, whereas another group considers it as an image-to-image translation problem. We have explored two popular deep learning-based architectures, one from each group, namely, U-Net and Pix2Pix, and presented a comparative assessment of their performances when applied for degraded document image binarization. In this study, no preprocessing or postprocessing methods are applied, which helps us to realize the actual strength of these architectures for the said purpose. For the performance evaluation and comparative assessment, six publicly available standard datasets, namely, document image binarization competition 2013 (DIBCO 2013), H-DIBCO 2014, H-DIBCO 2016, DIBCO 2017, H-DIBCO 2018, and DIBCO 2019, are considered. The performances of these architectures are compared with the best performing methods of the respective binarization competitions, some state-of-the-art nondeep learning-based methods, and some recently published deep learning-based methods separately. The obtained results confirm that in most of the cases U-Net outperforms the Pix2Pix model. (C) 2020 SPIE and IS&T

U-Net versus Pix2Pix: a comparative study on degraded document image binarization

期刊

JOURNAL OF ELECTRONIC IMAGING

出版社

SPIE-SOC PHOTO-OPTICAL INSTRUMENTATION ENGINEERS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

U-Net versus Pix2Pix: a comparative study on degraded document image binarization

期刊

JOURNAL OF ELECTRONIC IMAGING

出版社

SPIE-SOC PHOTO-OPTICAL INSTRUMENTATION ENGINEERS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文