A Taxonomy for Abstract Environments.

AbstractA wide range of problems can be abstractly expressed in terms of two systems inter-acting over time with the second system providing feedback to the flrst on its successin achieving some goal. The agent{environment model in reinforment learning and thecontroller{plant model from control theory are both examples of this framework. Problemswhich can be expressed in terms of this framework include statistical sampling processes,prediction problems, classiflcation problems, decision problems, games and function opti-misation problems. While these problem classes are very common and strong relationshipsbetween them are apparent, they are not usually considered in toto . Here we formallydeflne many of these common problem classes as environments using the chronological sys-tem formalism. From this we establish the elementary relationships between the problemclasses giving a taxonomy of abstract environments. 1 Agent-Environment Model 1.1 Introduction The agent-environment model is very simple and yet surprisingly general. It consists of twoentities called the

[1]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[2]  Marcus Hutter Optimal Sequential Decisions based on Algorithmic Probability , 2003, ArXiv.

[3]  Ming Li,et al.  An Introduction to Kolmogorov Complexity and Its Applications , 1997, Texts in Computer Science.

[4]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5]  J. Doob Stochastic processes , 1953 .

[6]  Cristian S. Calude Information and Randomness , 1994, Monographs in Theoretical Computer Science An EATCS Series.

[7]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[8]  Marcus Hutter,et al.  Towards a Universal Theory of Artificial Intelligence Based on Algorithmic Probability and Sequential Decisions , 2000, ECML.

[9]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..