'Video data mining' is a technique to discover useful patterns from videos. It plays an important role in efficient video management. Particularly, we concentrate on extracting useful editing patterns from movies. These editing patterns are useful for an amateur editor to produce a new, more attractive video. But, it is essential to extract editing patterns associated with their semantic contents, called 'semantic structures'. Otherwise the amateur editor can't determine how to use the extracted editing patterns during the process of editing a new video. In this paper, we propose two approaches to extract semantic structures from a movie, based on two different time series models of the movie. In one approach, the movie is represented as a multi-stream of metadata derived from visual and audio features in each shot. In another approach, the movie is represented as one-dimensional time series consisting of durations of target character's appearance and disappearance. To both time series models, we apply data mining techniques. As a result, we extract the semantic structures about shot transitions and about how the target character appears on the screen and disappears from the screen. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Shirahama, K., Matsuo, Y., & Uehara, K. (2005). Mining semantic structures in movies. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3392 LNAI, pp. 116–133). Springer Verlag. https://doi.org/10.1007/11415763_8
Mendeley helps you to discover research relevant for your work.