Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues

Kraisak Kesorn; Stefan Poslad

Conference ProceedingsOPEN ACCESS

Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5414 LNCS 817-828

DOI: 10.1007/978-3-540-92957-4_71

3Citations

10Readers

Abstract

This paper presents a framework for semi-automatic annotation and semantic image retrieval, applied to the sports domain, based upon semantic analysis of both image text captions and visual features of the image. Unstructured text captions of images are analysed in order to extract the concepts and restructure them into a semantic model. SVM classification of the multi-dominant colours and edge ratio information of the images are used to classify the sport genre. The novelty of the proposed semantic framework is that it can find both the indirectly relevant concepts (concepts not directly referred to) in the visual information and can represent the semantic of images at a higher level by combining image captions and visual feature information. In addition, integrating LSI into the semantic framework enables the proposed system to tolerate ontology imperfections. Experimental results show that the use of the semantic approach significantly enhances image retrieval. Semantic visual information classification and retrieval based upon multimodal cues. © 2009 Springer Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Kesorn, K., & Poslad, S. (2009). Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5414 LNCS, pp. 817–828). https://doi.org/10.1007/978-3-540-92957-4_71

Enhanced sports image annotation and retrieval based upon semantic analysis of multimodal cues

Abstract

Author supplied keywords

Cite

Register to see more suggestions