We propose a second-order accurate method to estimate the eigenvectors of extremely large matrices thereby addressing a problem of relevance to statisticians working in the analysis of very large datasets. More specifically, we show that averaging eigenvectors of randomly subsampled matrices efficiently approximates the true eigenvectors of the original matrix under certain conditions on the incoherence of the spectral decomposition. This incoherence assumption is typically milder than those made in matrix completion and allows eigenvectors to be sparse. We discuss applications to spectral methods in dimensionality reduction and information retrieval. © 2010, Institute of Mathematical Statistics. All rights reserved.
CITATION STYLE
El Karoui, N., & D’Aspremont, A. (2010). Second order accurate distributed eigenvector computation for extremely large matrices. Electronic Journal of Statistics, 4, 1345–1385. https://doi.org/10.1214/10-EJS577
Mendeley helps you to discover research relevant for your work.