暂无分享,去创建一个
[1] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs , 1978, Oper. Res..
[2] John N. Tsitsiklis,et al. The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..
[3] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[4] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[5] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[6] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .
[7] Xavier Boyen,et al. Tractable Inference for Complex Stochastic Processes , 1998, UAI.
[8] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .