Linear storage and potentially constant time hierarchical clustering using the Baire metric and random spanning paths

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We study how random projections can be used with large data sets in order (1) to cluster the data using a fast, binning approach which is characterized in terms of direct inducing of a hierarchy through use of the Bairemetric; and (2) based on clusters found, selecting subsets of the original data for further analysis. In this work, we focus on random projection that is used for processing high dimensional data. A random projection, outputting a random permutation of the observation set, provides a random spanning path. We show how a spanning path relates to contiguity- or adjacency-constrained clustering.We study performance properties of hierarchical clustering constructed from random spanning paths, and we introduce a novel visualization of the results.

Cite

CITATION STYLE

APA

Murtagh, F., & Contreras, P. (2016). Linear storage and potentially constant time hierarchical clustering using the Baire metric and random spanning paths. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 43–52). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-319-25226-1_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free