Contribution of spatio-temporal intensity variation to bottom-up saliency

Eleonora Vig; Michael Dorr; Erhardt Barth

Conference Proceedings

Contribution of spatio-temporal intensity variation to bottom-up saliency

Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (2012) 87 LNICST 469-474

DOI: 10.1007/978-3-642-32615-8_44

1Citations

3Readers

Get full text

Abstract

We investigate the contribution of local spatio-temporal variation of image intensity to saliency. To measure different types of variation, we use the geometrical invariants of the structure tensor. With a video represented in spatial axes x and y and temporal axis t, the n-dimensional structure tensor can be evaluated for different combinations of axes (2D and 3D) and also for the (degenerate) case of only one axis. The resulting features are evaluated on several spatio-temporal scales in terms of how well they can predict eye movements on complex videos. We find that a 3D structure tensor is optimal: the most predictive regions of a movie are those where intensity changes along all spatial and temporal directions. Among two-dimensional variations, the axis pair yt, which is sensitive to horizontal translation, outperforms xy and xt by a large margin, and is even superior in prediction to two baseline models of bottom-up saliency. © 2012 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering.

Author supplied keywords

Cite

CITATION STYLE

APA

Vig, E., Dorr, M., & Barth, E. (2012). Contribution of spatio-temporal intensity variation to bottom-up saliency. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering (Vol. 87 LNICST, pp. 469–474). https://doi.org/10.1007/978-3-642-32615-8_44

Contribution of spatio-temporal intensity variation to bottom-up saliency

Abstract

Author supplied keywords

Cite

Register to see more suggestions