Partitioned approach for high-dimensional confidence intervals with large split sizes

Zemin Zheng; Jiarui Zhang; Yang Li; Yaohua Wu

Journal ArticleOPEN ACCESS

Partitioned approach for high-dimensional confidence intervals with large split sizes

Statistica Sinica (2020) 30(1)

DOI: 10.5705/SS.202018.0379

1Citations

6Readers

Abstract

With the availability of massive data sets, making accurate inference with low computational cost is the key to improving scalability. When both the sample size and the dimensionality are large, naively applying the de-biasing idea to derive confidence intervals can be computationally inefficient or infeasible as the de-biasing procedure increases the computational cost by an order of magnitude compared with the initial penalized estimation. Therefore, we suggest a split and conquer approach to ameliorate the scalability in the de-biasing procedure and show that the length of the established confidence interval is asymptotically the same as that using the data all at once. Moreover, a significant improvement in the largest split size is demonstrated by separating the initial estimation and the relaxed projection steps, which reveals that the sample sizes needed for these two steps with statistical guarantees are different. Last but not least, a refined inference procedure is proposed to address the inflation issue in finite sample performances when the split size indeed gets large. Both computational advantage and theoretical guarantee of our new methodology are evidenced by numerical studies.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Zheng, Z., Zhang, J., Li, Y., & Wu, Y. (2020). Partitioned approach for high-dimensional confidence intervals with large split sizes. Statistica Sinica, 30(1). https://doi.org/10.5705/SS.202018.0379

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 1

50%

Researcher 1

50%

Readers' Discipline

Mathematics 2

67%

Computer Science 1

33%

Partitioned approach for high-dimensional confidence intervals with large split sizes

Abstract

Author supplied keywords

References Powered by Scopus

Regression Shrinkage and Selection Via the Lasso

Regularization and variable selection via the elastic net

Least angle regression

Cited by Powered by Scopus

Particle guided metaheuristic algorithm for global optimization and feature selection problems[Formula presented]

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline