Subspace perspective on canonical correlation analysis: Dimension reduction and minimax rates

8Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

Canonical correlation analysis (CCA) is a fundamental statistical tool for exploring the correlation structure between two sets of random variables. In this paper, motivated by the recent success of applying CCA to learn low dimensional representations of high dimensional objects, we propose two losses based on the principal angles between the model spaces spanned by the sample canonical variates and their population correspondents, respectively. We further characterize the non-asymptotic error bounds for the estimation risks under the proposed error metrics, which reveal how the performance of sample CCA depends adaptively on key quantities including the dimensions, the sample size, the condition number of the covariance matrices and particularly the population canonical correlation coefficients. The optimality of our uniform upper bounds is also justified by lower-bound analysis based on stringent and localized parameter spaces. To the best of our knowledge, for the first time our paper separates p1 and p2 for the first order term in the upper bounds without assuming the residual correlations are zeros. More significantly, our paper derives (1 -λ2k )(1 -λ2k +1)/(λk -λk+1)2 for the first time in the non-asymptotic CCA estimation convergence rates, which is essential to understand the behavior of CCA when the leading canonical correlation coefficients are close to 1.

Cite

CITATION STYLE

APA

Ma, Z., & Li, X. (2020). Subspace perspective on canonical correlation analysis: Dimension reduction and minimax rates. Bernoulli, 26(1), 432–470. https://doi.org/10.3150/19-BEJ1131

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free