Deep features for text spotting

Max Jaderberg; Andrea Vedaldi; Andrew Zisserman

Conference ProceedingsOPEN ACCESS

Deep features for text spotting

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8692 LNCS(PART 4) 512-528

DOI: 10.1007/978-3-319-10593-2_34

436Citations

333Readers

Abstract

The goal of this work is text spotting in natural images. This is divided into two sequential tasks: detecting words regions in the image, and recognizing the words within these regions. We make the following contributions: first, we develop a Convolutional Neural Network (CNN) classifier that can be used for both tasks. The CNN has a novel architecture that enables efficient feature sharing (by using a number of layers in common) for text detection, character case-sensitive and insensitive classification, and bigram classification. It exceeds the state-of-the-art performance for all of these. Second, we make a number of technical changes over the traditional CNN architectures, including no downsampling for a per-pixel sliding window, and multi-mode learning with a mixture of linear models (maxout). Third, we have a method of automated data mining of Flickr, that generates word and character level annotations. Finally, these components are used together to form an end-to-end, state-of-the-art text spotting system. We evaluate the text-spotting system on two standard benchmarks, the ICDAR Robust Reading data set and the Street View Text data set, and demonstrate improvements over the state-of-the-art on multiple measures. © 2014 Springer International Publishing.

Cite

CITATION STYLE

APA

Jaderberg, M., Vedaldi, A., & Zisserman, A. (2014). Deep features for text spotting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8692 LNCS, pp. 512–528). Springer Verlag. https://doi.org/10.1007/978-3-319-10593-2_34

Deep features for text spotting

Abstract

Cite

Register to see more suggestions