论文信息 - Generalization in Reinfor ement Learning and theUse of Observations-Based - 字舞流文

Generalization in Reinfor ement Learning and theUse of Observations-Based

Martin A. Riedmiller | L. Lauer

[1] Kee-Eung Kim,et al. Learning Finite-State Controllers for Partially Observable Environments , 1999, UAI.

[2] Michael L. Littman,et al. Memoryless policies: theoretical limitations and practical results , 1994 .

[3] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[4] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.

[5] Terrence J. Sejnowski,et al. A Parallel Network that Learns to Play Backgammon , 1989, Artif. Intell..