Detecting arbitrary oriented text in the wild with a visual attention model

Wenyi Huang; Dafang He; Xiao Yang; Zihan Zhou; Daniel Kifer; C. Lee Giles

Conference Proceedings

Detecting arbitrary oriented text in the wild with a visual attention model

MM 2016 - Proceedings of the 2016 ACM Multimedia Conference (2016) 551-555

DOI: 10.1145/2964284.2967282

13Citations

20Readers

Get full text

Abstract

Text embedded in images provides important semantic information about a scene and its content. Detecting text in an unconstrained environment is a challenging task because of the many fonts, sizes, backgrounds, and alignments of the characters. We present a novel attention model for detecting arbitrary oriented and curved scene text. Inspired by the attention mechanisms in the human visual system, our model utilizes a spatial glimpse network to processes the attended area and deploys a recurrent neural network that aggregates the information over time to determine the attention movement. Combining this with an off-the-shelf region proposal method, the model achieves the state-of-the-art performance on the highly cited ICDAR2013 dataset, and the MSRA-TD500 dataset which contains arbitrary oriented text.

Author supplied keywords

Cite

CITATION STYLE

APA

Huang, W., He, D., Yang, X., Zhou, Z., Kifer, D., & Giles, C. L. (2016). Detecting arbitrary oriented text in the wild with a visual attention model. In MM 2016 - Proceedings of the 2016 ACM Multimedia Conference (pp. 551–555). Association for Computing Machinery, Inc. https://doi.org/10.1145/2964284.2967282

Detecting arbitrary oriented text in the wild with a visual attention model

Abstract

Author supplied keywords

Cite

Register to see more suggestions