Key points are not available for this paper at this time.
The anchor mechanism of Faster R-CNN and SSD framework is considered not effective enough to scene text detection, which can be attributed to its IoU based matching criterion between anchors and ground-truth boxes. In order to better enclose scene text instances of various shapes, it requires to design anchors of various scales, aspect ratios and even orientations manually, which makes anchor-based methods sophisticated and inefficient. In this paper, we propose a novel anchor-free region proposal network (AF-RPN) to replace the original anchor-based RPN in the Faster R-CNN framework to address the above problem. Compared with a vanilla RPN and FPN-RPN, AF-RPN can get rid of complicated anchor design and achieve higher recall rate on large-scale COCO-Text dataset. Owing to the high-quality text proposals, our Faster R-CNN based two-stage text detection approach achieves state-of-the-art results on ICDAR-2017 MLT, ICDAR-2015 and ICDAR-2013 text detection benchmarks when using single-scale and single-model (ResNet50) testing only.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhuoyao Zhong
Microsoft Research Asia (China)
Lei Sun
Peking University
Qiang Huo
University of Science and Technology of China
South China University of Technology
Microsoft Research Asia (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhong et al. (Tue,) studied this question.
synapsesocial.com/papers/6a22190b9f07bfb2f8e21cc1 — DOI: https://doi.org/10.48550/arxiv.1804.09003