Learning Factored Representations for Partially Observable Markov Decision Processes
暂无分享,去创建一个
[1] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..
[2] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs , 1978, Oper. Res..
[3] Biing-Hwang Juang,et al. Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..
[4] L. Rabiner,et al. An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.
[5] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.
[6] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .
[7] Gregory F. Cooper,et al. The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..
[8] Radford M. Neal. Connectionist Learning of Belief Networks , 1992, Artif. Intell..
[9] Lonnie Chrisman,et al. Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach , 1992, AAAI.
[10] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.
[11] Michael I. Jordan,et al. Mean Field Theory for Sigmoid Belief Networks , 1996, J. Artif. Intell. Res..
[12] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[13] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[14] Xavier Boyen,et al. Tractable Inference for Complex Stochastic Processes , 1998, UAI.
[15] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.
[16] Daphne Koller,et al. Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.
[17] Daphne Koller,et al. Reinforcement Learning Using Approximate Belief States , 1999, NIPS.