Incremental self-improvement for life-time multi-agent reinforcement learning
暂无分享,去创建一个
[1] Pravin Varaiya,et al. Stochastic Systems: Estimation, Identification, and Adaptive Control , 1986 .
[2] P. W. Jones,et al. Bandit Problems, Sequential Allocation of Experiments , 1987 .
[3] Jürgen Schmidhuber,et al. Reinforcement Learning in Markovian and Non-Markovian Environments , 1990, NIPS.
[4] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .
[5] Stuart J. Russell,et al. Principles of Metareasoning , 1989, Artif. Intell..
[6] Steven Douglas Whitehead,et al. Reinforcement learning for the adaptive control of perception and action , 1992 .
[7] Andrew McCallum,et al. Overcoming Incomplete Perception with Utile Distinction Memory , 1993, ICML.
[8] Mark S. Boddy,et al. Deliberation Scheduling for Problem Solving in Time-Constrained Environments , 1994, Artif. Intell..
[9] Juergen Schmidhuber,et al. On learning how to learn learning strategies , 1994 .
[10] Michael I. Jordan,et al. Reinforcement Learning Algorithm for Partially Observable Markov Decision Problems , 1994, NIPS.
[11] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[12] Corso Elvezia. Discovering Solutions with Low Kolmogorov Complexity and High Generalization Capability , 1995 .
[13] Jürgen Schmidhuber. Discovering Solutions with Low Kolmogorov Complexity and High Generalization Capability , 1995, ICML.
[14] Corso Elvezia,et al. Environment-independent Reinforcement Acceleration , 1995 .
[15] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.
[16] Russell Greiner,et al. PALO: A Probabilistic Hill-Climbing Algorithm , 1996, Artif. Intell..
[17] Jürgen Schmidhuber,et al. Solving POMDPs with Levin Search and EIRA , 1996, ICML.
[18] Juergen Schmidhuber,et al. A General Method For Incremental Self-Improvement And Multi-Agent Learning In Unrestricted Environme , 1999 .
[19] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[20] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.