Collective principal component analysis from distributed, heterogeneous data

Hillol Kargupta; Weiyun Huang; Krishnamoorthy Sivakumar; Byung Hoon Park; Shuren Wang

Conference ProceedingsOPEN ACCESS

Collective principal component analysis from distributed, heterogeneous data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2000) 1910 452-457

DOI: 10.1007/3-540-45372-5_50

30Citations

30Readers

Abstract

Principal component analysis (PCA) is a statistical technique to identify the dependency structure of multivariate stochastic observations. PCA is frequently used in data mining applications. This paper considers PCA in the context of the emerging network-based computing environments. It offers a technique to perform PCA from distributed and heterogeneous data sets with relatively small communication overhead. The technique is evaluated against different data sets, including a data set for a web mining application. This approach is likely to facilitate the development of distributed clustering, associative link analysis, and other heterogeneous data mining applications that frequently use PCA.

Cite

CITATION STYLE

APA

Kargupta, H., Huang, W., Sivakumar, K., Park, B. H., & Wang, S. (2000). Collective principal component analysis from distributed, heterogeneous data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1910, pp. 452–457). Springer Verlag. https://doi.org/10.1007/3-540-45372-5_50

Collective principal component analysis from distributed, heterogeneous data

Abstract

Cite

Register to see more suggestions