期刊
MULTIMEDIA TOOLS AND APPLICATIONS
卷 81, 期 1, 页码 975-1000出版社
SPRINGER
DOI: 10.1007/s11042-021-11425-7
关键词
Handwriting recognition; Character spotting; IAM dataset; YOLOv3
Handwriting recognition has been a challenging task, usually requiring large datasets and complex lexicon-based approaches. This study proposes a lexicon-free handwriting recognition technique that is trained on only 1200 word images, achieving successful recognition of handwritten English text without dependency on writers' styles.
Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition remains a challenging task. Most of the methods in the literature use lexicon-based approaches and train their models on large datasets having near 50 K word samples to achieve good results. This results in high computational requirements. While these models use around 50 K words in their dictionary when recognizing handwritten English text, the actual number of words in the dictionary is much higher than this. To this end, we propose a handwriting recognition technique to recognize handwritten English text based on a YOLOv3 object recognition model that is lexicon-free and that performs sequential character detection and identification with a low number of training samples (only 1200 word images). This model works well without any dependency on writers' style of writing. This is tested on the IAM dataset and it is able to achieve 29.21% Word Error Rate and 9.53% Character Error Rate without a predefined vocabulary, which is on par with the state-of-the-art lexicon-based word recognition models.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据