Learning geometric concepts with nasty noise

Ilias Diakonikolas; Daniel M. Kane; Alistair Stewart

Conference Proceedings

Learning geometric concepts with nasty noise

Proceedings of the Annual ACM Symposium on Theory of Computing (2018) 1061-1073

DOI: 10.1145/3188745.3188754

64Citations

23Readers

Get full text

Abstract

We study the efficient learnability of geometric concept classes — specifically, low-degree polynomial threshold functions (PTFs) and intersections of halfspaces — when a fraction of the training data is adversarially corrupted. We give the first polynomial-time PAC learning algorithms for these concept classes with dimension-independent error guarantees in the presence of nasty noise under the Gaussian distribution. In the nasty noise model, an omniscient adversary can arbitrarily corrupt a small fraction of both the unlabeled data points and their labels. This model generalizes well-studied noise models, including the malicious noise model and the agnostic (adversarial label noise) model. Prior to our work, the only concept class for which efficient malicious learning algorithms were known was the class of origin-centered halfspaces. At the core of our results is an efficient algorithm to approximate the low-degree Chow-parameters of any bounded function in the presence of nasty noise. Our robust approximation algorithm for the Chow parameters provides near-optimal error guarantees for a range of distribution families satisfying mild concentration bounds and moment conditions. At the technical level, this algorithm employs an iterative “spectral” technique for outlier detection and removal inspired by recent work in robust unsupervised learning, which makes essential use of low-degree multivariate polynomials. Our robust learning algorithm for low-degree PTFs provides dimension-independent error guarantees for a class of tame distributions, including Gaussians and, more generally, any logconcave distribution with (approximately) known low-degree moments. For LTFs under the Gaussian distribution, using a refinement of the localization technique, we give a polynomial-time algorithm that achieves a near-optimal error of O(), where is the noise rate. Our robust learning algorithm for intersections of halfspaces proceeds by projecting down to an appropriate low-dimensional subspace. Its correctness makes essential use of a novel robust inverse independence lemma that is of independent interest.

Author supplied keywords

Cite

CITATION STYLE

APA

Diakonikolas, I., Kane, D. M., & Stewart, A. (2018). Learning geometric concepts with nasty noise. In Proceedings of the Annual ACM Symposium on Theory of Computing (pp. 1061–1073). Association for Computing Machinery. https://doi.org/10.1145/3188745.3188754

Learning geometric concepts with nasty noise

Abstract

Author supplied keywords

Cite

Register to see more suggestions