论文信息 - A Formal Measure of Machine Intelligence

A Formal Measure of Machine Intelligence

Abstract A fundamental problem in artiﬁcial intelligence isthat nobody really knows what intelligence is. Theproblem is especially acute when we need to con-sider artiﬁcial systems which are signiﬁcantly dif-ferent to humans. In this paper we approach thisproblem in the following way: We take a numberof well known informal deﬁnitions of human intelli-gence that have been given by experts, and extracttheir essential features. These are then mathemat-ically formalised to produce a general measure ofintelligence for arbitrary machines. We believe thatthis measure formally captures the concept of ma-chine intelligence in the broadest reasonable sense. 1 Introduction Most of us think that we recogniseintelligence whenwe see it, but we are not really sure how to pre-cisely deﬁne or measure it. We informally judgethe intelligence of others by relying on our past ex-periences in dealing with people. Naturally, thisnaive approach is highly subjective and imprecise.A more principled approach would be to use oneof the many standard intelligence tests that areavailable. Contrary to popular wisdom, these tests,when correctly applied by a professional, deliverstatisticallyconsistentresults andhaveconsiderablepower to predict the future performance of individ-uals in many mentally demanding tasks. However,while these tests work well for humans, if we wishto measure the intelligence of other things, perhapsof a monkey or a new machine learning algorithm,they are clearly inappropriate.One response to this problem might be to de-velop speciﬁc kinds of tests for speciﬁc kinds of en-tities; just as intelligence tests for children diﬀerto intelligence tests for adults. While this workswell when testing humans of diﬀerent ages, it comesundone when we need to measure the intelligenceof entities which are profoundly diﬀerent to eachother in terms of their cognitive capacities, speed,senses, environments in which they operate, and soon. To measure the intelligence of such diverse sys-tems in a meaningful way we must step back fromthe speciﬁcs of particular systems and establish theunderlying fundamentals of what it is that we arereally trying to measure. That is, we need to estab-lish a notion of intelligence that goes beyond thespeciﬁcs of particular kinds of systems.The diﬃculty of doing this is readily apparent.Consider, for example, the memory and numericalcomputation tasks that appear in some intelligencetests and which were once regarded as deﬁning hall-marks of human intelligence. We now know thatthese tasks are absolutely trivial for a machine andthus do not test the machine’s intelligence. Indeedeven the mentally demanding task of playing chesshas been largely reduced to brute force search. Astechnology advances, our concept of what intelli-gence is continues to evolve with it.How then are we to develop a concept of intelli-gence that is applicable to all kinds ofsystems? Anyproposed deﬁnition must encompass the essence ofhuman intelligence, as well as other possibilities, ina consistent way. It should not be limited to anyparticular set of senses, environments or goals, norshould it be limited to any speciﬁc kind of hard-ware, such as silicon or biological neurons. It shouldbe based on principles which are suﬃciently funda-mental so as to be unlikely to alter over time. Fur-thermore, the intelligence measure should ideally beformally expressed, objective, and practically real-isable.This paper approaches this problem in the fol-lowing way. In Section 2 we consider a range of def-initions of human intelligence that have been putforward by well known psychologists. From thesewe extract the most common and essential featuresand use them to create an informal deﬁnition ofintelligence. Section 3 then introduces the frame-1

Shane Legg | Marcus Hutter | S. Legg | Marcus Hutter

[1] Ming Li,et al. An Introduction to Kolmogorov Complexity and Its Applications , 2019, Texts in Computer Science.

[2] Shane Legg,et al. A Universal Measure of Intelligence for Artificial Agents , 2005, IJCAI.

[3] Marcus Hutter,et al. Universal Artificial Intellegence - Sequential Decisions Based on Algorithmic Probability , 2005, Texts in Theoretical Computer Science. An EATCS Series.

[4] A. M. Turing,et al. Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[5] F. L. Wells. The Measurement and Appraisal of Adult Intelligence , 1958 .

[6] J. Pickova. The Measurement and Appraisal of Adult Intelligence , 1959 .

[7] R. Sternberg,et al. Handbook of Intelligence , 2000 .

[8] W. V. Bingham. Aptitudes and aptitude testing , 1937 .

[9] Ofi rNw8x'pyzm,et al. The Speed Prior: A New Simplicity Measure Yielding Near-Optimal Computable Predictions , 2002 .

[10] Robert J. Sternberg,et al. Dynamic Testing: The Nature and Measurement of Learning Potential , 2001 .

[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12] W. Lewis Johnson. Needed: a new test of intelligence , 1992, SGAR.

[13] Matthew V. Mahoney,et al. Text Compression as a Test for Artificial Intelligence , 1999, AAAI/IAAI.

[14] José Hernández-Orallo,et al. Beyond the Turing Test , 2000, J. Log. Lang. Inf..

[15] Dr. Marcus Hutter,et al. Universal artificial intelligence , 2004 .

[16] L. Gottfredson. Mainstream science on intelligence: An editorial with 52 signatories, history, and bibliography , 1997 .