Algorithms for heavy-tailed statistics: Regression, covariance estimation, and beyond

21Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

We study polynomial-time algorithms for linear regression and covariance estimation in the absence of strong (Gaussian) assumptions on the underlying distributions of samples, making assumptions instead about only finitely-many moments. We focus on how many samples are required to perform estimation and regression with high accuracy and exponentially-good success probability in the face of heavy-tailed data. For covariance estimation, linear regression, and several other problems in high-dimensional statistics, estimators have recently been constructed whose sample complexities and rates of statistical error match what is possible when the underlying distribution is Gaussian, but known algorithms for these estimators require exponential time. We narrow the gap between the Gaussian and heavy-tailed settings for polynomial-time estimators with: (a) a polynomial-time estimator which takes n samples from a d-dimensional random vector X with covariance ς and produces ς such that in spectral norm ||ς - ς ||2 ≤ Õ(d3/4/gn) w.p. 1-2-d where the information-theoretically optimal error bound is Õ(gd/n), while previous approaches to polynomial-time algorithms were stuck at Õ(d/gn) and (b) a polynomial-time algorithm which takes n samples (Xi,Yi) where Yi = u,Xi »+ i where both X and have a constant number of bounded moments and produces u such that the loss ||u - u||2 ≤ O(d/n) w.p. 1-2-d for any n ≥ d3/2 log(d). This (information-theoretically optimal) error is achieved by inefficient algorithms for any n ≫ d, while previous approaches to polynomial-time algorithms suffer loss ω(d2/n) and require n ≫ d2. Our algorithms make crucial use of degree-8 sum-of-squares semidefinite programs. Both apply to any X which has constantly-many certifiably hypercontractive moments. We offer preliminary evidence that improving on these rates of error in polynomial time is not possible in the median of means framework our algorithms employ. Our work introduces new techniques to high-probability estimation, and suggests numerous new algorithmic questions in the following vein: when is it computationally feasible to do statistics in high dimensions with Gaussian-style errors when data is far from Gaussian?

Cite

CITATION STYLE

APA

Cherapanamjeri, Y., Hopkins, S. B., Kathuria, T., Raghavendra, P., & Tripuraneni, N. (2020). Algorithms for heavy-tailed statistics: Regression, covariance estimation, and beyond. In Proceedings of the Annual ACM Symposium on Theory of Computing (pp. 601–609). Association for Computing Machinery. https://doi.org/10.1145/3357713.3384329

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free