Movie Content Analysis, Indexing and Skimming Via Multimodal Information

Ying Li; Shrikanth Narayanan; C.-C. Jay Kuo

Book Chapter

Movie Content Analysis, Indexing and Skimming Via Multimodal Information

Li Y
Narayanan S
Kuo C

Springer US, (2003), 123-154

DOI: 10.1007/978-1-4757-6928-9_5

N/ACitations

10Readers

Get full text

Abstract

A content-based movie analysis, indexing and skimming system is developed in this research. Specifically, it includes the following three major modules: 1) an event detection module, where three types of movie events, namely, two-speaker dialogs, multiple-speaker dialogs, and hybrid events are extracted from the content. Multiple media cues such as audio, speech, visual and face information are integrated to achieve this goal; 2) a speaker identification module, where an adaptive speaker identification scheme is proposed to recognize target movie cast members for content indexing purposes. Both audio and visual sources are exploited in the identification process, where the audio source is analyzed to recognize speakers using a likelihood-based approach, and the visual source is examined to locate talking faces with face detection/recognition and mouth tracking techniques; 3) a movie skimming module, where an event-based skimming system is developed to abstract movie content in the form of a short video clip for content browsing purposes. Extensive experiments on integrating multiple media cues for movie content analysis, indexing and skimming have yielded encouraging results.

Cite

CITATION STYLE

APA

Li, Y., Narayanan, S., & Kuo, C.-C. J. (2003). Movie Content Analysis, Indexing and Skimming Via Multimodal Information. In Video Mining (pp. 123–154). Springer US. https://doi.org/10.1007/978-1-4757-6928-9_5

Movie Content Analysis, Indexing and Skimming Via Multimodal Information

Abstract

Cite

Register to see more suggestions