Evolutionary Search, Stochastic Policies with Memory, and Reinforcement Learning with Hidden State
暂无分享,去创建一个
[1] Lawrence J. Fogel,et al. Artificial Intelligence through Simulated Evolution , 1966 .
[2] Terrence J. Sejnowski,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cognitive Sciences.
[3] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[4] Tom M. Mitchell,et al. Reinforcement learning with hidden states , 1993 .
[5] Astro Teller,et al. The evolution of mental models , 1994 .
[6] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.
[7] Peter J. Angeline,et al. An evolutionary algorithm that constructs recurrent neural networks , 1994, IEEE Trans. Neural Networks.
[8] J. K. Kinnear,et al. Advances in Genetic Programming , 1994 .
[9] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] Andrew McCallum,et al. Learning to Use Selective Attention and Short-Term Memory in Sequential Tasks , 1996 .
[12] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[13] A. McCallum. Efficient Exploration in Reinforcement Learning with Hidden State , 1997 .
[14] Jürgen Schmidhuber,et al. Reinforcement Learning with Self-Modifying Policies , 1998, Learning to Learn.
[15] John Loch,et al. Using Eligibility Traces to Find the Best Memoryless Policy in Partially Observable Markov Decision Processes , 1998, ICML.
[16] Andrew W. Moore,et al. Gradient Descent for General Reinforcement Learning , 1998, NIPS.
[17] Katia P. Sycara,et al. Evolution of Goal-Directed Behavior from Limited Information in a Complex Environment , 1999, GECCO.
[18] John J. Grefenstette,et al. Evolutionary Algorithms for Reinforcement Learning , 1999, J. Artif. Intell. Res..