An automatic video text detection, localization and extraction approach

Chengjun Zhu; Ouyang Yuanxin; Lei Gao; Zhenyong Chen; Xiong Zhang

Conference Proceedings

An automatic video text detection, localization and extraction approach

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 4879 LNCS 1-9

DOI: 10.1007/978-3-642-01350-8_1

3Citations

10Readers

Get full text

Abstract

Text in video is a very compact and accurate clue for video indexing= and summarization. This paper presents an algorithm regarding word group as a special symbol to detect, localize and extract video text using support vector machine (SVM) automatically. First, four sobel operators are applied to get the EM(edge map) of the video frame and the EM is segmented into N×2N size blocks. Then character features and characters group structure features are extracted to construct a 19-dimension feature vector. We use a pre-trained SVM to partition each block into two classes: text and non-text blocks. Secondly a dilatation- shrink process is employed to adjust the text position. Finally text regions are enhanced by multiple frame information. After binarization of enhanced text region, the text region with clean background is recognized by OCR software. Experimental results show that the proposed method can detect, localize, and extract video texts with high accuracy. © Springer-Verlag Berlin Heidelberg 2009.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhu, C., Yuanxin, O., Gao, L., Chen, Z., & Zhang, X. (2009). An automatic video text detection, localization and extraction approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4879 LNCS, pp. 1–9). https://doi.org/10.1007/978-3-642-01350-8_1

An automatic video text detection, localization and extraction approach

Abstract

Author supplied keywords

Cite

Register to see more suggestions