Solving POMDPs Using Selected Past Events
暂无分享,去创建一个
[1] Karl Johan Åström,et al. Optimal control of Markov processes with incomplete state information , 1965 .
[2] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[4] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.
[5] M. Littman. The Witness Algorithm: Solving Partially Observable Markov Decision Processes , 1994 .
[6] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[7] Andrew McCallum,et al. Learning to Use Selective Attention and Short-Term Memory in Sequential Tasks , 1996 .
[8] Maja J. Matarić,et al. Learning to Use Selective Attention and Short-Term Memory in Sequential Tasks , 1996 .
[9] A. Cassandra,et al. Exact and approximate algorithms for partially observable markov decision processes , 1998 .
[10] Alain Dutech. Apprentissage d'environnement : approches cognitives et comportementales , 1999 .