论文信息 - Stochastic Simultaneous Optimistic Optimization - 字舞流文

Stochastic Simultaneous Optimistic Optimization

We study the problem of global maximization of a function f given a finite number of evaluations perturbed by noise. We consider a very weak assumption on the function, namely that it is locally smooth (in some precise sense) with respect to some semi-metric, around one of its global maxima. Compared to previous works on bandits in general spaces (Kleinberg et al., 2008; Bubeck et al., 2011a) our algorithm does not require the knowledge of this semi-metric. Our algorithm, StoSOO, follows an optimistic strategy to iteratively construct upper confidence bounds over the hierarchical partitions of the function domain to decide which point to sample next. A finite-time analysis of StoSOO shows that it performs almost as well as the best specifically-tuned algorithms even though the local smoothness of the function is not known.

Rémi Munos | Michal Valko | Alexandra Carpentier | R. Munos | A. Carpentier | Michal Valko

[1] Jia Yuan Yu,et al. Lipschitz Bandits without the Lipschitz Constant , 2011, ALT.

[2] Rémi Munos,et al. Pure Exploration in Multi-armed Bandits Problems , 2009, ALT.

[3] Eli Upfal,et al. Multi-Armed Bandits in Metric Spaces ∗ , 2008 .

[4] A. Neumaier. Interval methods for systems of equations , 1990 .

[5] Aleksandrs Slivkins,et al. Multi-armed bandits on implicit metric spaces , 2011, NIPS.

[6] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.

[7] Michèle Sebag,et al. The grand challenge of computer Go , 2012, Commun. ACM.

[8] Adam D. Bull,et al. Convergence Rates of Efficient Global Optimization Algorithms , 2011, J. Mach. Learn. Res..

[9] Rémi Munos,et al. Optimistic Planning of Deterministic Systems , 2008, EWRL.

[10] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[11] R. B. Kearfott. Rigorous Global Search: Continuous Problems , 1996 .

[12] Csaba Szepesvári,et al. Online Optimization in X-Armed Bandits , 2008, NIPS.

[13] Rémi Munos,et al. Bandit Algorithms for Tree Search , 2007, UAI.

[14] W. J. Thron,et al. Encyclopedia of Mathematics and its Applications. , 1982 .

[15] Rémi Munos,et al. Optimistic Optimization of Deterministic Functions , 2011, NIPS 2011.

[16] S. K. Mishra,et al. Nonconvex Optimization and Its Applications , 2008 .

[17] Y. D. Sergeyev,et al. Global Optimization with Non-Convex Constraints - Sequential and Parallel Algorithms (Nonconvex Optimization and its Applications Volume 45) (Nonconvex Optimization and Its Applications) , 2000 .

[18] Michael A. Osborne. Bayesian Gaussian processes for sequential prediction, optimisation and quadrature , 2010 .

[19] C. D. Perttunen,et al. Lipschitzian optimization without the Lipschitz constant , 1993 .

[20] J D Pinter,et al. Global Optimization in Action—Continuous and Lipschitz Optimization: Algorithms, Implementations and Applications , 2010 .

[21] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[22] G. William Walster,et al. Global Optimization Using Interval Analysis: Revised and Expanded , 2007 .

[23] Csaba Szepesvári,et al. –armed Bandits , 2022 .