Transfer of learning by composing solutions of elemental sequential tasks
暂无分享,去创建一个
[1] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.
[2] Peter E. Hart,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.
[3] Richard E. Korf,et al. Macro-Operators: A Weak Method for Learning , 1985, Artif. Intell..
[4] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
[5] Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.
[6] C. Watkins. Learning from delayed rewards , 1989 .
[7] David S. Touretzky,et al. Advances in neural information processing systems 2 , 1989 .
[8] Glenn A. Iba,et al. A heuristic approach to the discovery of macro-operators , 2004, Machine Learning.
[9] Rodney A. Brooks,et al. A robot that walks; emergent behaviors from a carefully evolved network , 1989, Proceedings, 1989 International Conference on Robotics and Automation.
[10] Michael I. Jordan,et al. A Competitive Modular Connectionist Architecture , 1990, NIPS.
[11] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[12] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[13] Andrew G. Barto,et al. On the Computational Economics of Reinforcement Learning , 1991 .
[14] Michael I. Jordan,et al. Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks , 1990, Cogn. Sci..
[15] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[16] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[17] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .
[18] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[19] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.