论文信息 - Fast reachability analysis for uncertain SSPs

Fast reachability analysis for uncertain SSPs

Stochastic Shortest Path problems (SSPs) can be efficiently dealt with by the Real-Time Dynamic Programming algorithm (RTDP). Yet, RTDP requires that a goal state is always reachable, what can be checked easily for a certain SSP, and with a more complex algorithm for an uncertain SSP, i.e. where only a possible interval is known for each transition probability. This paper makes a simplified description of these two processes, and demonstrates how the time consuming uncertain analysis can be dramatically speeded up. The main improvement still needed is to turn to a symbolic analysis in order to avoid a complete state-space enumeration.

Olivier Buffet | O. Buffet

[1] Rémi Munos. Efficient Resources Allocation for Markov Decision Processes , 2001, NIPS.

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Olivier Buffet,et al. Robust Planning with (L)RTDP , 2005, IJCAI.

[4] O. Buffet. Planning with Robust (L)RTDP , 2005 .

[5] Laurent El Ghaoui,et al. Robustness in Markov Decision Problems with Uncertain Transition Matrices , 2003, NIPS.

[6] Michael L. Littman,et al. An empirical evaluation of interval estimation for Markov decision processes , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[7] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[8] Olivier Coudert,et al. Verifying Temporal Properties of Sequential Machines without Building Their State Diagrams , 1990, CAV.

[9] Randal E. Bryant,et al. Symbolic Manipulation of Boolean Functions Using a Graphical Representation , 1985, 22nd ACM/IEEE Design Automation Conference.

[10] Masanori Hosaka,et al. Controlled Markov set-chains under average criteria , 2001, Appl. Math. Comput..

[11] Robert Givan,et al. Bounded-parameter Markov decision processes , 2000, Artif. Intell..

[12] R. Bellman. Dynamic programming. , 1957, Science.