A grid-based scalable classifier for high dimensional datasets

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

High dimensionality and large dataset size are two common characteristics of real-world datasets and databases. These characteristics pose unique challenges for the classification of such datasets. The classification algorithms that perform well (in terms of scalability and efficiency) on small and medium datasets with moderate dimensionality fail to scale well with the large and high dimensional datasets. Therefore, in this paper, we propose a scalable classifier to cope with large and high dimensional datasets. The proposed method inherits its scalability feature from the concept of grid-based partitioning. Our goals in using this method are to divide the data space into small partitions called cells and to map the data on the partitioned data space. Thus, instead of managing the individual data points within the data, abstract entities called cells are used to decrease the classification runtime for large and high dimensional datasets. The presented experimental results demonstrate the scalability and efficiency of our algorithm. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Saini, S., & Dua, S. (2010). A grid-based scalable classifier for high dimensional datasets. Communications in Computer and Information Science, 54, 404–415. https://doi.org/10.1007/978-3-642-12035-0_42

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free