High-dimensional correlation matrix estimation for general continuous data with Bagging technique

3Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

High-dimensional covariance matrix estimation plays a central role in multivariate statistical analysis. It is well-known that the sample covariance matrix is singular when the sample size is smaller than the dimension of the variable, but the covariance estimate must be positive-definite. This motivates some modifications of the sample covariance matrix to preserve its efficient estimation of pairwise covariance. In this paper, we modify the sample correlation matrix using the Bagging technique. The proposed Bagging estimator is flexible for general continuous data. Under some mild conditions, we show theoretically that the Bagging estimator can ensure positive-definiteness with probability one in finite samples. We also prove the consistency of the bootstrap estimator of Pearson correlation and the consistency of our Bagging estimator when the dimension p is fixed. Simulation results and a real application are provided to demonstrate that our method strikes a better balance between RMSE and likelihood, and is more robust, than other existing estimators.

Cite

CITATION STYLE

APA

Wang, C., Du, J., & Fan, X. (2022). High-dimensional correlation matrix estimation for general continuous data with Bagging technique. Machine Learning, 111(8), 2905–2927. https://doi.org/10.1007/s10994-022-06138-3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free