Content-Based Image Indexing by Data Clustering and Inverse Document Frequency

Rafał Grycuk; Marcin Gabryel; Marcin Korytkowski; Rafał Scherer

Conference Proceedings

Content-Based Image Indexing by Data Clustering and Inverse Document Frequency

Communications in Computer and Information Science (2014) 424 374-383

DOI: 10.1007/978-3-319-06932-6_36

26Citations

3Readers

Get full text

Abstract

In this paper we present an algorithm for creating and searching large image databases. Effective browsing and searching such collections of images based on their content is one of the most important challenges of computer science. In the presented algorithm, the process of inserting data to the database consists of several stages. In the first step interest points are generated from images by e.g. SIFT, SURF or PCA SIFT algorithms. The resulting huge number of key points is then reduced by data clustering, in our case by a novel, parameterless version of the mean shift algorithm. The reduction is achieved by subsequent operation on generated cluster centers. This algorithm has been adapted specifically for the presented method. Cluster centers are treated as terms and images as documents in the term frequency-inverse document frequency (TF-IDF) algorithm. TF-IDF algorithm allows to create an indexed image database and to fast retrieve desired images. The proposed approach is validated by numerical experiments on images with different content. © Springer International Publishing Switzerland 2014.

Author supplied keywords

Cite

CITATION STYLE

APA

Grycuk, R., Gabryel, M., Korytkowski, M., & Scherer, R. (2014). Content-Based Image Indexing by Data Clustering and Inverse Document Frequency. In Communications in Computer and Information Science (Vol. 424, pp. 374–383). Springer Verlag. https://doi.org/10.1007/978-3-319-06932-6_36

Content-Based Image Indexing by Data Clustering and Inverse Document Frequency

Abstract

Author supplied keywords

Cite

Register to see more suggestions