Collective principal component analysis from distributed, heterogeneous data

30Citations
Citations of this article
30Readers
Mendeley users who have this article in their library.

Abstract

Principal component analysis (PCA) is a statistical technique to identify the dependency structure of multivariate stochastic observations. PCA is frequently used in data mining applications. This paper considers PCA in the context of the emerging network-based computing environments. It offers a technique to perform PCA from distributed and heterogeneous data sets with relatively small communication overhead. The technique is evaluated against different data sets, including a data set for a web mining application. This approach is likely to facilitate the development of distributed clustering, associative link analysis, and other heterogeneous data mining applications that frequently use PCA.

Cite

CITATION STYLE

APA

Kargupta, H., Huang, W., Sivakumar, K., Park, B. H., & Wang, S. (2000). Collective principal component analysis from distributed, heterogeneous data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1910, pp. 452–457). Springer Verlag. https://doi.org/10.1007/3-540-45372-5_50

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free