Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues

3Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper presents a framework for semi-automatic annotation and semantic image retrieval, applied to the sports domain, based upon semantic analysis of both image text captions and visual features of the image. Unstructured text captions of images are analysed in order to extract the concepts and restructure them into a semantic model. SVM classification of the multi-dominant colours and edge ratio information of the images are used to classify the sport genre. The novelty of the proposed semantic framework is that it can find both the indirectly relevant concepts (concepts not directly referred to) in the visual information and can represent the semantic of images at a higher level by combining image captions and visual feature information. In addition, integrating LSI into the semantic framework enables the proposed system to tolerate ontology imperfections. Experimental results show that the use of the semantic approach significantly enhances image retrieval. Semantic visual information classification and retrieval based upon multimodal cues. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Kesorn, K., & Poslad, S. (2009). Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5414 LNCS, pp. 817–828). https://doi.org/10.1007/978-3-540-92957-4_71

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free