Text detection in images has been studied and improved for decades. There are many works that extend the existing methods for analyzing videos, however, few of them create or adapt approaches that consider inherent characteristics of videos, such as temporal information. This work proposes a very fast method for identifying video frames that contain text through a special data structure called visual rhythm. The method is robust to detect video captions with respect to font styles, color intensity, and text orientation. A data set was built in our experiments to compare and evaluate the effectiveness of the proposed method. © 2011 Springer-Verlag.
CITATION STYLE
Valio, F. B., Pedrini, H., & Leite, N. J. (2011). Fast rotation-invariant video caption detection based on visual rhythm. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7042 LNCS, pp. 157–164). https://doi.org/10.1007/978-3-642-25085-9_18
Mendeley helps you to discover research relevant for your work.