TKDN: Scene text detection via keypoints detection

Yuanshun Cui; Jie Li; Hu Han; Shiguang Shan; Xilin Chen

Conference Proceedings

TKDN: Scene text detection via keypoints detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11365 LNCS 231-246

DOI: 10.1007/978-3-030-20873-8_15

0Citations

11Readers

Get full text

Abstract

In the past few years, great efforts have been devoted to scene text detection. Nevertheless, efficient text detection in the wild remains a challenging problem. Methods for general object detection usually have limitations in handling the arbitrary orientations and large aspect ratios of scene text. In this paper, we present a novel scene text detection method which treats text detection as a text keypoint detection task performed in a coarse-to-fine scheme (text keypoint detection network, TKDN). Specifically, in TKDN we first generate the coarse text instance regions using feature pyramid network (FPN) as well as region proposal network (RPN) and ResNet50. Within the coarse text regions, we then perform text keypoint detection, bounding box classification and regression, and text region segmentation in a multi-task way. In the inference stage, an effective post-processing algorithm is designed to combine the outputs from three branches and obtain the final text keypoint detection results. The proposed TKDN approach outperforms the state-of-the-art approach and achieves an F-measure of 82.0% on the public-domain ICDAR2015 database.

Cite

CITATION STYLE

APA

Cui, Y., Li, J., Han, H., Shan, S., & Chen, X. (2019). TKDN: Scene text detection via keypoints detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11365 LNCS, pp. 231–246). Springer Verlag. https://doi.org/10.1007/978-3-030-20873-8_15

TKDN: Scene text detection via keypoints detection

Abstract

Cite

Register to see more suggestions