Percentile optimization in uncertain Markov decision processes with application to efficient exploration
暂无分享,去创建一个
[1] L. Ghaoui,et al. Robust markov decision processes with uncertain transition matrices , 2004 .
[2] Michael L. Littman,et al. A theoretical analysis of Model-Based Interval Estimation , 2005, ICML.
[3] J Figueira,et al. Stochastic Programming , 1998, J. Oper. Res. Soc..
[4] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[5] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[6] Alexander Shapiro,et al. Convex Approximations of Chance Constrained Programs , 2006, SIAM J. Optim..
[7] Laurent El Ghaoui,et al. Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..
[8] D. Krass,et al. Percentile performance criteria for limiting average Markov decision processes , 1995, IEEE Trans. Autom. Control..
[9] Garud Iyengar,et al. Robust Dynamic Programming , 2005, Math. Oper. Res..
[10] John N. Tsitsiklis,et al. Bias and variance in value function estimation , 2004, ICML.
[11] Ronald A. Howard,et al. Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..
[12] E. Silver. MARKOVIAN DECISION PROCESSES WITH UNCERTAIN TRANSITION PROBABILITIES OR REWARDS , 1963 .
[13] Arkadi Nemirovski,et al. Robust Convex Optimization , 1998, Math. Oper. Res..
[14] David Andre,et al. Model based Bayesian Exploration , 1999, UAI.
[15] G. Calafiore,et al. On Distributionally Robust Chance-Constrained Linear Programs , 2006 .
[16] Jerry Nedelman,et al. Book review: “Bayesian Data Analysis,” Second Edition by A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin Chapman & Hall/CRC, 2004 , 2005, Comput. Stat..
[17] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..
[18] Robert Givan,et al. Bounded-parameter Markov decision processes , 2000, Artif. Intell..
[19] John N. Tsitsiklis,et al. Bias and Variance Approximation in Value Function Estimates , 2007, Manag. Sci..