Pursuing atomic video words by information projection

Youdong Zhao; Haifeng Gong; Yunde Jia

Conference Proceedings

Pursuing atomic video words by information projection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6493 LNCS(PART 2) 254-267

DOI: 10.1007/978-3-642-19309-5_20

0Citations

2Readers

Get full text

Abstract

In this paper, we study mathematical models of atomic visual patterns from natural videos and establish a generative visual vocabulary for video representation. Empirically, we employ small video patches (e.g., 15×15×5, called video "bricks") in natural videos as basic analysis unit. There are a variety of brick subspaces (or atomic video words) of varying dimensions in the high dimensional brick space. The structures of the words are characterized by both appearance and motion dynamics. Here, we categorize the words into two pure types: structural video words (SVWs) and textural video words (TVWs). A common generative model is introduced to model these two type video words in a unified form. The representation power of a word is measured by its information gain, based on which words are pursued one by one via a novel pursuit algorithm, and finally a holistic video vocabulary is built up. Experimental results show the potential power of our framework for video representation. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Zhao, Y., Gong, H., & Jia, Y. (2011). Pursuing atomic video words by information projection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6493 LNCS, pp. 254–267). https://doi.org/10.1007/978-3-642-19309-5_20

Pursuing atomic video words by information projection

Abstract

Cite

Register to see more suggestions