4.6 Article

R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation

期刊

SENSORS
卷 21, 期 3, 页码 -

出版社

MDPI
DOI: 10.3390/s21030888

关键词

scene text detection; arbitrarily-oriented text; rotation anchor; convolutional neural network; YOLOv4

资金

  1. National Natural Science Foundation of China [41671452, 41701532]
  2. China Postdoctoral Science Foundation-funded project [2017M612510]

向作者/读者索取更多资源

The paper introduces a novel method R-YOLO for detecting arbitrarily-oriented texts in natural scenes, utilizing rotated anchor boxes and multi-scale features to improve detection efficiency and accuracy. The proposed method outperforms state-of-the-art methods in terms of performance on benchmark datasets.
Accurate and efficient text detection in natural scenes is a fundamental yet challenging task in computer vision, especially when dealing with arbitrarily-oriented texts. Most contemporary text detection methods are designed to identify horizontal or approximately horizontal text, which cannot satisfy practical detection requirements for various real-world images such as image streams or videos. To address this lacuna, we propose a novel method called Rotational You Only Look Once (R-YOLO), a robust real-time convolutional neural network (CNN) model to detect arbitrarily-oriented texts in natural image scenes. First, a rotated anchor box with angle information is used as the text bounding box over various orientations. Second, features of various scales are extracted from the input image to determine the probability, confidence, and inclined bounding boxes of the text. Finally, Rotational Distance Intersection over Union Non-Maximum Suppression is used to eliminate redundancy and acquire detection results with the highest accuracy. Experiments on benchmark comparison are conducted upon four popular datasets, i.e., ICDAR2015, ICDAR2013, MSRA-TD500, and ICDAR2017-MLT. The results indicate that the proposed R-YOLO method significantly outperforms state-of-the-art methods in terms of detection efficiency while maintaining high accuracy; for example, the proposed R-YOLO method achieves an F-measure of 82.3% at 62.5 fps with 720 p resolution on the ICDAR2015 dataset.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据