The Challenges of Clustering High Dimensional Data

Michael Steinbach; Levent Ertöz; Vipin Kumar

Book Chapter

The Challenges of Clustering High Dimensional Data

Steinbach M
Ertöz L
Kumar V

Springer Berlin Heidelberg, (2004), 273-309

DOI: 10.1007/978-3-662-08968-2_16

N/ACitations

497Readers

Get full text

Abstract

Cluster analysis divides data into groups (clusters) for the purposes of summarization or improved understanding. For example, cluster analysis has been used to group related documents for browsing, to find genes and proteins that have similar functionality, or as a means of data compression. While clustering has a long history and a large number of clustering techniques have been developed in statistics, pattern recognition, data mining, and other fields, significant challenges still remain. In this chapter we provide a short introduction to cluster analysis, and then focus on the challenge of clustering high dimensional data. We present a brief overview of several recent techniques, including a more detailed description of recent work of our own which uses a concept-based clustering approach. 1

Cite

CITATION STYLE

APA

Steinbach, M., Ertöz, L., & Kumar, V. (2004). The Challenges of Clustering High Dimensional Data. In New Directions in Statistical Physics (pp. 273–309). Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-662-08968-2_16

The Challenges of Clustering High Dimensional Data

Abstract

Cite

Register to see more suggestions