Mistake bounds on the noise-free multi-armed bandit game
暂无分享,去创建一个
[1] F. R. Rosendaal,et al. Prediction , 2015, Journal of thrombosis and haemostasis : JTH.
[2] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[3] Atsuyoshi Nakamura,et al. Noise Free Multi-armed Bandit Game , 2015, LATA.
[4] Jean-Yves Audibert,et al. Minimax Policies for Adversarial and Stochastic Bandits. , 2009, COLT 2009.
[5] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[6] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..