The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes
暂无分享,去创建一个
[1] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.
[2] D. Blackwell. An analog of the minimax theorem for vector payoffs. , 1956 .
[3] J. Stoer,et al. Convexity and Optimization in Finite Dimensions I , 1970 .
[4] Pravin Varaiya,et al. Stochastic Systems: Estimation, Identification, and Adaptive Control , 1986 .
[5] A. Shwartz,et al. Guaranteed performance regions in Markovian systems with competing decision makers , 1993, IEEE Trans. Autom. Control..
[6] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[7] Nicolò Cesa-Bianchi,et al. Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.
[8] D. Fudenberg,et al. Consistency and Cautious Fictitious Play , 1995 .
[9] Vladimir Vovk,et al. A game of prediction with expert advice , 1995, COLT '95.
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[12] J. Filar,et al. Competitive Markov Decision Processes , 1996 .
[13] Dimitri P. Bertsekas,et al. Stochastic shortest path games: theory and algorithms , 1997 .
[14] S. Hart,et al. A simple adaptive procedure leading to correlated equilibrium , 2000 .
[15] O. J. Vrieze,et al. Simplifying Optimal Strategies in Stochastic Games , 1998 .
[16] Neri Merhav,et al. Universal Prediction , 1998, IEEE Trans. Inf. Theory.
[17] Prakash Narayan,et al. Reliable Communication Under Channel Uncertainty , 1998, IEEE Trans. Inf. Theory.
[18] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .
[19] Y. Freund,et al. Adaptive game playing using multiplicative weights , 1999 .
[20] A. Rustichini. Minimizing Regret : The General Case , 1999 .
[21] S. Hart,et al. A General Class of Adaptive Strategies , 1999 .
[22] D. Bertsekas,et al. Stochastic Shortest Path Games , 1999 .
[23] Andreu Mas-Colell,et al. A General Class of Adaptive Strategies , 1999, J. Econ. Theory.