Content-based retrieval has been identified as one of the most challenging problems, requiring a multidisciplinary research among computer vision, information retrieval, artificial intelligence, database, and other fields. In this paper, we address the specific aspect of inferring semantics automatically from raw video data. In particular, we present the Cobra video database management system that supports the integrated use of different knowledge-based methods for mapping low-level features to high-level concepts. We focus on dynamic Bayesian networks and demonstrate how they can be effectively used for fusing the evidence obtained from different media information sources. The approach is validated in the particular domain of Formula 1 race videos. For that specific domain we introduce a robust audio-visual feature extraction scheme and a text recognition and detection method. Based on numerous experiments performed with DBNs, we give some recommendations with respect to the modeling of temporal dependences and different learning algorithms. Finally, we present the experimental results for the detection of excited speech and the extraction of highlights, as well as the advantageous query capabilities of our system. © Springer-Verlag Berlin Heidelberg 2002.
CITATION STYLE
Petković, M., Mihajlović, V., & Jonker, W. (2002). Extending a DBMS to support content-based video retrieval: A formula 1 case study. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2490 LNCS, pp. 318–341). Springer Verlag. https://doi.org/10.1007/3-540-36128-6_18
Mendeley helps you to discover research relevant for your work.