Probabilistic penalized principal component analysis

Chongsun Park; Morgan C. Wang; Eun Bi Mo

Journal ArticleOPEN ACCESS

Probabilistic penalized principal component analysis

Communications for Statistical Applications and Methods (2017) 24(2) 143-154

DOI: 10.5351/CSAM.2017.24.2.143

1Citations

152Readers

Abstract

A variable selection method based on probabilistic principal component analysis (PCA) using penalized likelihood method is proposed. The proposed method is a two-step variable reduction method. The first step is based on the probabilistic principal component idea to identify principle components. The penalty function is used to identify important variables in each component. We then build a model on the original data space instead of building on the rotated data space through latent variables (principal components) because the proposed method achieves the goal of dimension reduction through identifying important observed variables. Consequently, the proposed method is of more practical use. The proposed estimators perform as the oracle procedure and are root-n consistent with a proper choice of regularization parameters. The proposed method can be successfully applied to high-dimensional PCA problems with a relatively large portion of irrelevant variables included in the data set. It is straightforward to extend our likelihood method in handling problems with missing observations using EM algorithms. Further, it could be effectively applied in cases where some data vectors exhibit one or more missing values at random.

Author supplied keywords

Cite

CITATION STYLE

APA

Park, C., Wang, M. C., & Mo, E. B. (2017). Probabilistic penalized principal component analysis. Communications for Statistical Applications and Methods, 24(2), 143–154. https://doi.org/10.5351/CSAM.2017.24.2.143

Probabilistic penalized principal component analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions