Energy Minimization Methods in Computer Vision and Pattern Recognition

  • Ommer B
  • Buhmann J
  • Yuille A
  • et al.
ISSN: 0302-9743
N/ACitations
Citations of this article
3Readers
Mendeley users who have this article in their library.

Abstract

The complexity of visual representations is substantially limited by the compositional nature of our visual world which, therefore, renders learning structured object models feasible. During recognition, such structured models might however be disadvantageous, especially under the high computational demands of video. This contribution presents a compositional approach to video analysis that demonstrates the value of compositionality for both, learning of structured object models and recognition in near real-time. We unite category-level, multi-class object recognition, segmentation, and tracking in the same probabilistic graphical model. A model selection strategy is pursued to facilitate recognition and tracking of multiple objects that appear simultaneously in a video. Object models are learned from videos with heavy clutter and camera motion where only an overall category label for a training video is provided, but no hand-segmentation or localization of objects is required. For evaluation purposes a video categorization database is assembled and experiments convincingly demonstrate the suitability of the approach.

Author supplied keywords

Cite

CITATION STYLE

APA

Ommer, B., Buhmann, J., Yuille, A., Zhu, S.-C., Cremers, D., & Wang, Y. (2007). Energy Minimization Methods in Computer Vision and Pattern Recognition. (A. L. Yuille, S.-C. Zhu, D. Cremers, & Y. Wang, Eds.) (Vol. 4679, pp. 318–333). Berlin, Heidelberg: Springer Berlin Heidelberg. Retrieved from http://www.springerlink.com/content/75757v5k13314522/

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free