论文信息 - ON THE OPTIMALITY OF SAMPLE-BASED ESTIMATES OF THE EXPECTATION OF THE EMPIRICAL MINIMIZER ∗, ∗∗

ON THE OPTIMALITY OF SAMPLE-BASED ESTIMATES OF THE EXPECTATION OF THE EMPIRICAL MINIMIZER ∗, ∗∗

We study sample-based estimates of the expectation of the function produced by the empirical minimization algorithm. We investigate the extent to which one can estimate the rate of convergence of the empirical minimizer in a data dependent manner. We establish three main results. First, we provide an algorithm that upper bounds the expectation of the empirical minimizer in a completely data-dependent manner. This bound is based on a structural result due to Bartlett and Mendelson, which relates expectations to sample averages. Second, we show that these structural upper bounds can be loose, compared to previous bounds. In particular, we demonstrate a class for which the expectation of the empirical minimizer decreases as O(1/n) for sample size n, although the upper bound based on structural properties is Ω(1). Third, we show that this looseness of the bound is inevitable: we present an example that shows that a sharp bound cannot be universally recovered from empirical data.

Peter L. Bartlett | Shahar Mendelson | Petra Philips

[1] P. Massart,et al. Risk bounds for statistical learning , 2007, math/0702683.

[2] M. Rudelson,et al. Combinatorics of random processes and sections of convex bodies , 2004, math/0404192.

[3] P. Massart. Some applications of concentration inequalities to statistics , 2000 .

[4] Shahar Mendelson,et al. A Few Notes on Statistical Learning Theory , 2002, Machine Learning Summer School.

[5] Gilles Blanchard,et al. On the Rate of Convergence of Regularized Boosting Classifiers , 2003, J. Mach. Learn. Res..

[6] O. Bousquet. Concentration Inequalities and Empirical Processes Theory Applied to the Analysis of Learning Algorithms , 2002 .

[7] S. Geer. A New Approach to Least-Squares Estimation, with Applications , 1986 .

[8] M. Ledoux. The concentration of measure phenomenon , 2001 .

[9] M. Talagrand. New concentration inequalities in product spaces , 1996 .

[10] S. Geer,et al. Adaptivity of Support Vector Machines with ` 1 Penalty , 2004 .

[11] V. Koltchinskii. Local Rademacher complexities and oracle inequalities in risk minimization , 2006, 0708.0083.