A modified isomap approach to manifold learning in word spotting

Sebastian Sudholt; Gernot A. Fink

Conference ProceedingsOPEN ACCESS

A modified isomap approach to manifold learning in word spotting

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9358 529-539

DOI: 10.1007/978-3-319-24947-6_44

16Citations

5Readers

Abstract

Word spotting is an effective paradigm for indexing document images with minimal human effort. Here, the use of the Bag- of-Features principle has been shown to achieve competitive results on different benchmarks. Recently, a spatial pyramid approach was used as a word image representation to improve the retrieval results even further. The high dimensionality of the spatial pyramids was attempted to be countered by applying Latent Semantic Analysis. However, this leads to increasingly worse results when reducing to lower dimensions. In this paper, we propose a new approach to reducing the dimensionality of word image descriptors which is based on a modified version of the Isomap Manifold Learning algorithm. This approach is able to not only outperform Latent Semantic Analysis but also to reduce a word image descriptor to up to 0.12% of its original size without losing retrieval precision. We evaluate our approach on two different datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Sudholt, S., & Fink, G. A. (2015). A modified isomap approach to manifold learning in word spotting. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9358, pp. 529–539). Springer Verlag. https://doi.org/10.1007/978-3-319-24947-6_44

A modified isomap approach to manifold learning in word spotting

Abstract

Author supplied keywords

Cite

Register to see more suggestions