4.8 Article

Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2013.49

关键词

Character string recognition; semi-Markov conditional random field; lattice pruning; beam search

资金

  1. National Natural Science Foundation of China [61273269, 60933010, 61232013, 61170182]

向作者/读者索取更多资源

This paper proposes a method for handwritten Chinese/Japanese text (character string) recognition based on semi-Markov conditional random fields (semi-CRFs). The high-order semi-CRF model is defined on a lattice containing all possible segmentation-recognition hypotheses of a string to elegantly fuse the scores of candidate character recognition and the compatibilities of geometric and linguistic contexts by representing them in the feature functions. Based on given models of character recognition and compatibilities, the fusion parameters are optimized by minimizing the negative log-likelihood loss with a margin term on a training string sample set. A forward-backward lattice pruning algorithm is proposed to reduce the computation in training when trigram language models are used, and beam search techniques are investigated to accelerate the decoding speed. We evaluate the performance of the proposed method on unconstrained online handwritten text lines of three databases. On the test sets of databases CASIA-OLHWDB (Chinese) and TUAT Kondate (Japanese), the character level correct rates are 95.20 and 95.44 percent, and the accurate rates are 94.54 and 94.55 percent, respectively. On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

An online overlaid handwritten Japanese text recognition system for small tablet

Jianjuan Liang, Cuong Tuan Nguyen, Bilan Zhu, Masaki Nakagawa

PATTERN ANALYSIS AND APPLICATIONS (2019)

Article Computer Science, Artificial Intelligence

Robust and real-time stroke order evaluation using incremental stroke context for learners to write Kanji characters correctly

Cuong Tuan Nguyen, Hung Tuan Nguyen, Kazuhiro Mita, Masaki Nakagawa

PATTERN RECOGNITION LETTERS (2019)

Article Computer Science, Artificial Intelligence

A unified method for augmented incremental recognition of online handwritten Japanese and English text

Cuong Tuan Nguyen, Bipin Indurkhya, Masaki Nakagawa

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION (2020)

Article Computer Science, Artificial Intelligence

CNN based spatial classification features for clustering offline handwritten mathematical expressions

Cuong Tuan Nguyen, Vu Tran Minh Khuong, Hung Tuan Nguyen, Masaki Nakagawa

PATTERN RECOGNITION LETTERS (2020)

Article Computer Science, Artificial Intelligence

Nom document digitalization by deep convolution neural networks

Kha Cong Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa

PATTERN RECOGNITION LETTERS (2020)

Article Computer Science, Artificial Intelligence

An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents

Nam Tuan Ly, Cuong Tuan Nguyen, Masaki Nakagawa

PATTERN RECOGNITION LETTERS (2020)

Proceedings Paper Computer Science, Artificial Intelligence

Handwriting Recognition and Automatic Scoring for Descriptive Answers in Japanese Language Tests

Hung Tuan Nguyen, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Masaki Nakagawa

Summary: This research paper presents an experiment on automatically scoring handwritten descriptive answers, achieving high accuracy using deep neural networks and a pre-trained automatic scoring system. The results demonstrate the potential for further research on end-to-end automatic scoring of descriptive answers.

FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022 (2022)

Proceedings Paper Computer Science, Information Systems

GSSF: A Generative Sequence Similarity Function Based on a Seq2Seq Model for Clustering Online Handwritten Mathematical Answers

Huy Quang Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Masaki Nakagawa

Summary: This study proposes a method for computer-assisted marking by clustering online handwritten mathematical expressions to improve the efficiency and reliability of marking. Experimental results show that the method performs well in terms of accuracy and marking cost.

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Information Systems

2D Self-attention Convolutional Recurrent Network for Offline Handwritten Text Recognition

Nam Tuan Ly, Hung Tuan Nguyen, Masaki Nakagawa

Summary: The paper proposes a model called 2D-SACRN for offline handwritten text recognition, which utilizes self-attention mechanism and recurrent encoder to achieve the recognition process from feature sequence to label probability sequence to final label sequence, and the experimental results show similar or even better accuracy compared to state-of-the-art models on all datasets.

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I (2021)

Proceedings Paper Computer Science, Information Systems

A-VLAD: An End-to-End Attention-Based Neural Network for Writer Identification in Historical Documents

Trung Tan Ngo, Hung Tuan Nguyen, Masaki Nakagawa

Summary: This paper introduces an end-to-end attention-based neural network for identifying writers in historical documents. The model outperforms state-of-the-art results, showing promise in handling various sizes of historical document fragments for writer identification and image retrieval.

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Information Systems

Global Context for Improving Recognition of Online Handwritten Mathematical Expressions

Cuong Tuan Nguyen, Thanh-Nghia Truong, Hung Tuan Nguyen, Masaki Nakagawa

Summary: This paper introduces a temporal classification method for online handwritten mathematical expressions, trained by multiple paths of symbol and spatial relations derived from Symbol Relation Tree, benefiting from a deep bidirectional LSTM network for learning temporal classification. The method constructs a symbol-level parse tree with Context-Free Grammar to recognize online HME effectively.

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Artificial Intelligence

A Transformer-Based Math Language Model for Handwritten Math Expression Recognition

Huy Quang Ung, Cuong Tuan Nguyen, Hung Tuan Nguyen, Thanh-Nghia Truong, Masaki Nakagawa

Summary: The paper introduces a Transformer-based Math Language Model to address ambiguities in handwritten mathematical expressions recognition. By training the model and incorporating it into the recognition system, significant improvements in expression rates have been achieved.

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Recurrent Neural Network Transducer for Japanese and Chinese Offline Handwritten Text Recognition

Trung Tan Ngo, Hung Tuan Nguyen, Nam Tuan Ly, Masaki Nakagawa

Summary: This paper proposes an RNN-Transducer model for recognizing Japanese and Chinese offline handwritten text line images, which combines visual and linguistic features and achieves state-of-the-art performance on two datasets through experiments.

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Temporal Classification Constraint for Improving Handwritten Mathematical Expression Recognition

Cuong Tuan Nguyen, Hung Tuan Nguyen, Kei Morizumi, Masaki Nakagawa

Summary: A temporal classification constraint was presented as an auxiliary learning method to improve recognition of Handwritten Mathematical Expression (HME), utilizing connectionist temporal classification (CTC) to learn temporal alignment of input feature sequence and corresponding symbol label sequence. Training CTC alignment through a combination of CTC loss and encoder-decoder loss was shown to enhance feature learning in the encoder of the encoder-decoder model, demonstrating effectiveness in symbol classification and expression recognition on the CROHME datasets.

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Relation-Based Representation for Handwritten Mathematical Expression Recognition

Thanh-Nghia Truong, Huy Quang Ung, Hung Tuan Nguyen, Cuong Tuan Nguyen, Masaki Nakagawa

Summary: This paper proposes a relation-based sequence representation for offline handwritten mathematical expressions (HMEs) recognition, which outperforms the traditional LaTeX-based representation system. Experimental results show that the HME recognition system using the proposed representation achieves significantly higher recognition rates on the CROHME dataset.

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I (2021)

暂无数据