VC dimension and distribution-free sample-based testing

Eric Blais; Renato Ferreira Pinto; Nathaniel Harms

Conference ProceedingsOPEN ACCESS

VC dimension and distribution-free sample-based testing

Proceedings of the Annual ACM Symposium on Theory of Computing (2021) 504-517

DOI: 10.1145/3406325.3451104

14Citations

5Readers

Get full text

Abstract

We consider the problem of determining which classes of functions can be tested more efficiently than they can be learned, in the distribution-free sample-based model that corresponds to the standard PAC learning setting. Our main result shows that while VC dimension by itself does not always provide tight bounds on the number of samples required to test a class of functions in this model, it can be combined with a closely-related variant that we call "lower VC"(or LVC) dimension to obtain strong lower bounds on this sample complexity. We use this result to obtain strong and in many cases nearly optimal bounds on the sample complexity for testing unions of intervals, halfspaces, intersections of halfspaces, polynomial threshold functions, and decision trees. Conversely, we show that two natural classes of functions, juntas and monotone functions, can be tested with a number of samples that is polynomially smaller than the number of samples required for PAC learning. Finally, we also use the connection between VC dimension and property testing to establish new lower bounds for testing radius clusterability and testing feasibility of linear constraint systems.

Author supplied keywords

Cite

CITATION STYLE

APA

Blais, E., Pinto, R. F., & Harms, N. (2021). VC dimension and distribution-free sample-based testing. In Proceedings of the Annual ACM Symposium on Theory of Computing (pp. 504–517). Association for Computing Machinery. https://doi.org/10.1145/3406325.3451104

VC dimension and distribution-free sample-based testing

Abstract

Author supplied keywords

Cite

Register to see more suggestions