Automatic extraction of text from multimedia contents is an important problem that needs to be solved in order to obtain more effective retrieval engines. Recently, Crandall, Antani and Kasturi have shown that a direct analysis of certain DCT coefficients can be used to locate potential regions of caption text in MPEG-1 videos. In this paper, we extend their proposal to wavelet-coded images, and show that localization of text superimposed in natural scenes can also be effectively and efficiently performed by a wavelet transformation of the image followed by an analysis of the distribution of second order statistics on high frequency wavelet bands. © Springer-Verlag 2004.
CITATION STYLE
Jiménez, J., & Martí, E. (2004). Localization of caption texts in natural scenes using a wavelet transformation. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3287, 100–107. https://doi.org/10.1007/978-3-540-30463-0_12
Mendeley helps you to discover research relevant for your work.