☆ 4.7 Article

Text-line extraction from handwritten document images using GAN

EXPERT SYSTEMS WITH APPLICATIONS (2020)

期刊

EXPERT SYSTEMS WITH APPLICATIONS

卷 140, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.eswa.2019.112916

关键词

GAN; Deep Learning; Text-line extraction; Handwritten documents; HIT-MW dataset; ICDAR dataset

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Operations Research & Management Science

资金

CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India
DST [EMR/2016/007213]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Text-line extraction (TLE) from unconstrained handwritten document images is still considered an open research problem. Literature survey reveals that use of various rule-based methods is commonplace in this regard. But these methods mostly fail when the document images have touching and/or multi skewed text lines or overlapping words/characters and non-uniform inter-line space. To encounter this problem, in this paper, we have used a deep learning-based method. In doing so, we have, for the first time in the literature, applied Generative Adversarial Networks (GANs) where we have considered TLE as image-to-image translation task. We have used U-Net architecture for the Generator, and Patch GAN architecture for the discriminator with different combinations of loss functions namely GAN loss, L1 loss and L2 loss. Evaluation is done on two datasets: handwritten Chinese text dataset HIT-MW and ICDAR 2013 Handwritten Segmentation Contest dataset. After exhaustive experimentations, it has been observed, that U-Net architecture with combination of the said three losses not only produces impressive results but also outperforms some state-of-the-art methods. (C) 2019 Elsevier Ltd. All rights reserved.

Text-line extraction from handwritten document images using GAN

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Text-line extraction from handwritten document images using GAN

期刊

EXPERT SYSTEMS WITH APPLICATIONS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文