Large scale scene text verification with guided attention

Dafang He; Yeqing Li; Alexander Gorban; Derrall Heath; Julian Ibarz; Qian Yu; Daniel Kifer; C. Lee Giles

Conference Proceedings

Large scale scene text verification with guided attention

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11365 LNCS 260-275

DOI: 10.1007/978-3-030-20873-8_17

0Citations

24Readers

Get full text

Abstract

Many tasks are related to determining if a particular text string exists in an image. In this work, we propose a new framework that learns this task in an end-to-end way. The framework takes an image and a text string as input and then outputs the probability of the text string being present in the image. This is the first end-to-end framework that learns such relationships between text and images in scene text area. The framework does not require explicit scene text detection or recognition and thus no bounding box annotations are needed. It is also the first work in scene text area that tackles such a weakly labeled problem. Based on this framework, we developed a model called Guided Attention. Our designed model achieves better results than several state-of-the-art scene text reading based solutions for a challenging Street View Business Matching task. The task tries to find correct business names for storefront images and the dataset we collected for it is substantially larger, and more challenging than existing scene text dataset. This new real-world task provides a new perspective for studying scene text related problems.

Author supplied keywords

Cite

CITATION STYLE

APA

He, D., Li, Y., Gorban, A., Heath, D., Ibarz, J., Yu, Q., … Giles, C. L. (2019). Large scale scene text verification with guided attention. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11365 LNCS, pp. 260–275). Springer Verlag. https://doi.org/10.1007/978-3-030-20873-8_17

Large scale scene text verification with guided attention

Abstract

Author supplied keywords

Cite

Register to see more suggestions