Oracle inequalities for computationally budgeted model selection

erieure Abstract We analyze general model selection procedures using penalized empirical loss minimiza- tion under computational constraints. While classical model selection approaches do not consider computational aspects of performing model selection, we argue that any practical model selection procedure must not only trade o estimation and approximation error, but also the eects of the computational eort required to compute empirical minimizers for dierent function classes. We provide a framework for analyzing such problems, and we give algorithms for model selection under a computational budget. These algorithms satisfy oracle inequalities that show that the risk of the selected model is not much worse than if we had devoted all of our computational budget to the best function class.

[1]  C. L. Mallows Some comments on C_p , 1973 .

[2]  P. Massart,et al.  Risk bounds for model selection via penalization , 1999 .

[3]  P. Massart,et al.  Concentration inequalities and model selection , 2007 .

[4]  Peter L. Bartlett,et al.  Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..

[5]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[6]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[7]  R. Dudley Central Limit Theorems for Empirical Measures , 1978 .

[8]  Peter L. Bartlett,et al.  Model Selection and Error Estimation , 2000, Machine Learning.

[9]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[10]  J. Lamperti ON CONVERGENCE OF STOCHASTIC PROCESSES , 1962 .

[11]  H. Akaike A new look at the statistical model identification , 1974 .

[12]  C. Mallows More comments on C p , 1995 .

[13]  Andrew R. Barron,et al.  Complexity Regularization with Application to Artificial Neural Networks , 1991 .

[14]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[15]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[16]  G. Lugosi,et al.  Complexity regularization via localized random penalties , 2004, math/0410091.

[17]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[18]  S. Geman,et al.  Nonparametric Maximum Likelihood Estimation by the Method of Sieves , 1982 .

[19]  R. Dudley The Sizes of Compact Subsets of Hilbert Space and Continuity of Gaussian Processes , 1967 .