Let ℱ be a class of measurable functions f : S → [0, 1] defined on a probability space (5, A, P). Given a sample (X 1,..., X n) of i.i.d. random variables taking values in 5 with common distribution P, let P n denote the empirical measure based on (X 1,..., X n). We study an empirical risk minimization problem P nf → min, f ∈ ℱ. Given a solution f̂ n of this problem, the goal is to obtain very general upper bounds on its excess risk ε P(f̂ n):= P f̂ n - inf f ∈ℱ P f, expressed in terms of relevant geometric parameters of the class ℱ. Using concentration inequalities and other empirical processes tools, we obtain both distribution-dependent and data-dependent upper bounds on the excess risk that are of asymptotically correct order in many examples. The bounds involve localized sup-norms of empirical and Rademacher processes indexed by functions from the class. We use these bounds to develop model selection techniques in abstract risk minimization problems that can be applied to more specialized frameworks of regression and classification. © Institute of Mathematical Statistics, 2006.
CITATION STYLE
Koltchinskii, V. (2006). Local rademacher complexities and oracle inequalities in risk minimization. Annals of Statistics, 34(6), 2593–2656. https://doi.org/10.1214/009053606000001019
Mendeley helps you to discover research relevant for your work.