Exploiting correlation and budget constraints in Bayesian multi-armed bandit optimization
暂无分享,去创建一个
[1] Rémi Munos,et al. Pure Exploration in Multi-armed Bandits Problems , 2009, ALT.
[2] Nando de Freitas,et al. Portfolio Allocation for Bayesian Optimization , 2010, UAI.
[3] Aurélien Garivier,et al. On Bayesian Upper Confidence Bounds for Bandit Problems , 2012, AISTATS.
[4] Nando de Freitas,et al. Adaptive MCMC with Bayesian Optimization , 2012, AISTATS.
[5] Nando de Freitas,et al. Active Policy Learning for Robot Planning and Exploration under Uncertainty , 2007, Robotics: Science and Systems.
[6] Nando de Freitas,et al. Bayesian optimization in high dimensions via random embeddings , 2013, IJCAI 2013.
[7] Nando de Freitas,et al. A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning , 2010, ArXiv.
[8] Nando de Freitas,et al. Self-Avoiding Random Dynamics on Integer Complex Systems , 2011, TOMC.
[9] Andrew W. Moore,et al. Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation , 1993, NIPS.
[10] Eric Walter,et al. An informational approach to the global optimization of expensive-to-evaluate functions , 2006, J. Glob. Optim..
[11] Alan Fern,et al. Budgeted Optimization with Concurrent Stochastic-Duration Experiments , 2011, NIPS.
[12] Sylvain Arlot,et al. A survey of cross-validation procedures for model selection , 2009, 0907.4728.
[13] Rémi Munos,et al. Stochastic Simultaneous Optimistic Optimization , 2013, ICML.
[14] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[15] Alexander J. Smola,et al. Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations , 2012, ICML.
[16] Shipra Agrawal,et al. Thompson Sampling for Contextual Bandits with Linear Payoffs , 2012, ICML.
[17] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.
[18] J. Mockus,et al. The Bayesian approach to global optimization , 1989 .
[19] Ron Kohavi,et al. Controlled experiments on the web: survey and practical guide , 2009, Data Mining and Knowledge Discovery.
[20] Kevin Leyton-Brown,et al. Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.
[21] Thomas P. Hayes,et al. Stochastic Linear Optimization under Bandit Feedback , 2008, COLT.
[22] Andreas Krause,et al. Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.
[23] Yoshua Bengio,et al. Algorithms for Hyper-Parameter Optimization , 2011, NIPS.
[24] Rémi Munos,et al. Optimistic Optimization of Deterministic Functions , 2011, NIPS 2011.
[25] Alessandro Lazaric,et al. Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence , 2012, NIPS.
[26] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[27] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .
[28] Donald R. Jones,et al. A Taxonomy of Global Optimization Methods Based on Response Surfaces , 2001, J. Glob. Optim..
[29] Nando de Freitas,et al. Adaptive Hamiltonian and Riemann manifold Monte Carlo samplers , 2013, ICML 2013.
[30] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.
[31] Lihong Li,et al. An Empirical Evaluation of Thompson Sampling , 2011, NIPS.
[32] W. R. Thompson. ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .
[33] Alessandro Lazaric,et al. Multi-Bandit Best Arm Identification , 2011, NIPS.
[34] Steven L. Scott,et al. A modern Bayesian look at the multi-armed bandit , 2010 .
[35] Ryan P. Adams,et al. Opportunity Cost in Bayesian Optimization , 2011 .
[36] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[37] D. Lizotte,et al. An experimental methodology for response surface optimization methods , 2012, J. Glob. Optim..
[38] Benjamin Van Roy,et al. Learning to Optimize via Posterior Sampling , 2013, Math. Oper. Res..
[39] Philipp Hennig,et al. for Information-Ecie nt Global Optimization , 2012 .
[40] Kevin Leyton-Brown,et al. Sequential Model-Based Optimization for General Algorithm Configuration , 2011, LION.
[41] Rémi Munos,et al. Thompson Sampling: An Asymptotically Optimal Finite-Time Analysis , 2012, ALT.
[42] P. Burman. A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods , 1989 .
[43] Nando de Freitas,et al. Active Preference Learning with Discrete Choice Data , 2007, NIPS.
[44] Nando de Freitas,et al. A Bayesian interactive optimization approach to procedural animation design , 2010, SCA '10.