Experts in a Markov Decision Process
暂无分享,去创建一个
[1] Y. Freund,et al. Adaptive game playing using multiplicative weights , 1999 .
[2] Johan Håstad,et al. Some optimal inapproximability results , 2001, JACM.
[3] Sham M. Kakade,et al. On the sample complexity of reinforcement learning. , 2003 .
[4] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.
[5] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 1998, Machine Learning.
[6] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, Journal of computer and system sciences (Print).