On lower bounds for statistical learning theory

Po Ling Loh

ArticleOPEN ACCESS

On lower bounds for statistical learning theory

Loh P

Entropy

DOI: 10.3390/e19110617

10Citations

25Readers

Abstract

In recent years, tools from information theory have played an increasingly prevalent role in statistical machine learning. In addition to developing efficient, computationally feasible algorithms for analyzing complex datasets, it is of theoretical importance to determine whether such algorithms are "optimal" in the sense that no other algorithm can lead to smaller statistical error. This paper provides a survey of various techniques used to derive information-theoretic lower bounds for estimation and learning. We focus on the settings of parameter and function estimation, community recovery, and online learning for multi-armed bandits. A common theme is that lower bounds are established by relating the statistical learning problem to a channel decoding problem, for which lower bounds may be derived involving information-theoretic quantities such as the mutual information, total variation distance, and Kullback-Leibler divergence. We close by discussing the use of information-theoretic quantities to measure independence in machine learning applications ranging from causality to medical imaging, and mention techniques for estimating these quantities efficiently in a data-driven manner.

Author supplied keywords

Cite

CITATION STYLE

APA

Loh, P. L. (2017, November 1). On lower bounds for statistical learning theory. Entropy. MDPI AG. https://doi.org/10.3390/e19110617

On lower bounds for statistical learning theory

Abstract

Author supplied keywords

Cite

Register to see more suggestions