论文信息 - Oracle-Based Robust Optimization via Online Learning

Oracle-Based Robust Optimization via Online Learning

Robust optimization is a common optimization framework under uncertainty when problem parameters are unknown, but it is known that they belong to some given uncertainty set. In the robust optimization framework, a min-max problem is solved wherein a solution is evaluated according to its performance on the worst possible realization of the parameters. In many cases, a straightforward solution to a robust optimization problem of a certain type requires solving an optimization problem of a more complicated type, which might be NP-hard in some cases. For example, solving a robust conic quadratic program, such as those arising in a robust support vector machine (SVM) with an ellipsoidal uncertainty set, leads in general to a semidefinite program. In this paper, we develop a method for approximately solving a robust optimization problem using tools from online convex optimization, where at every stage a standard (nonrobust) optimization program is solved. Our algorithms find an approximate robust solution usin...

[1] Jorge J. Moré,et al. Computing a Trust Region Step , 1983 .

[2] Éva Tardos,et al. Fast approximation algorithms for fractional packing and covering problems , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[4] Franz Rendl,et al. A semidefinite framework for trust region subproblems with applications to large scale minimization , 1997, Math. Program..

[5] Arkadi Nemirovski,et al. Robust Convex Optimization , 1998, Math. Oper. Res..

[6] Philip N. Klein,et al. On the Number of Iterations for Dantzig-Wolfe Optimization and Packing-Covering Approximation Algorithms , 1999, SIAM J. Comput..

[7] Michael I. Jordan,et al. A Robust Minimax Approach to Classification , 2003, J. Mach. Learn. Res..

[8] Arkadi Nemirovski,et al. Robust optimization – methodology and applications , 2002, Math. Program..

[9] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, J. Comput. Syst. Sci..

[10] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[11] Melvyn Sim,et al. Robust discrete optimization and network flows , 2003, Math. Program..

[12] Michael I. Jordan,et al. Robust Sparse Hyperplane Classifiers: Application to Uncertain Molecular Profiling Data , 2004, J. Comput. Biol..

[13] Alexander J. Smola,et al. A Second Order Cone programming Formulation for Classifying Missing Data , 2004, NIPS.

[14] Laurent El Ghaoui,et al. Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..

[15] Giuseppe Carlo Calafiore,et al. Uncertain convex programs: randomized solutions and confidence levels , 2005, Math. Program..

[16] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[17] Alexander J. Smola,et al. Second Order Cone Programming Approaches for Handling Missing and Uncertain Data , 2006, J. Mach. Learn. Res..

[18] Theodore B. Trafalis,et al. Robust support vector machines for classification and computational issues , 2007, Optim. Methods Softw..

[19] Shie Mannor,et al. Robustness and Regularization of Support Vector Machines , 2008, J. Mach. Learn. Res..

[20] Stephen P. Boyd,et al. Cutting-set methods for robust convex optimization with pessimizing oracles , 2009, Optim. Methods Softw..

[21] David P. Woodruff,et al. Sublinear Optimization for Machine Learning , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.

[22] U. Rieder,et al. Markov Decision Processes , 2010 .

[23] Shie Mannor,et al. Robust Regression and Lasso , 2008, IEEE Transactions on Information Theory.

[24] Nathan Srebro,et al. Beating SGD: Learning SVMs in Sublinear Time , 2011, NIPS.

[25] Constantine Caramanis,et al. Theory and Applications of Robust Optimization , 2010, SIAM Rev..

[26] Elad Hazan,et al. Approximating Semidefinite Programs in Sublinear Time , 2011, NIPS.

[27] Elad Hazan. The convex optimization approach to regret minimization , 2011 .

[28] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[29] Sanjeev Arora,et al. The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[30] Shie Mannor,et al. Scaling Up Robust MDPs using Function Approximation , 2014, ICML.

[31] Elad Hazan,et al. A linear-time algorithm for trust region problems , 2014, Math. Program..