A Formal Measure of Machine Intelligence

Abstract A fundamental problem in artificial intelligence isthat nobody really knows what intelligence is. Theproblem is especially acute when we need to con-sider artificial systems which are significantly dif-ferent to humans. In this paper we approach thisproblem in the following way: We take a numberof well known informal definitions of human intelli-gence that have been given by experts, and extracttheir essential features. These are then mathemat-ically formalised to produce a general measure ofintelligence for arbitrary machines. We believe thatthis measure formally captures the concept of ma-chine intelligence in the broadest reasonable sense. 1 Introduction Most of us think that we recogniseintelligence whenwe see it, but we are not really sure how to pre-cisely define or measure it. We informally judgethe intelligence of others by relying on our past ex-periences in dealing with people. Naturally, thisnaive approach is highly subjective and imprecise.A more principled approach would be to use oneof the many standard intelligence tests that areavailable. Contrary to popular wisdom, these tests,when correctly applied by a professional, deliverstatisticallyconsistentresults andhaveconsiderablepower to predict the future performance of individ-uals in many mentally demanding tasks. However,while these tests work well for humans, if we wishto measure the intelligence of other things, perhapsof a monkey or a new machine learning algorithm,they are clearly inappropriate.One response to this problem might be to de-velop specific kinds of tests for specific kinds of en-tities; just as intelligence tests for children differto intelligence tests for adults. While this workswell when testing humans of different ages, it comesundone when we need to measure the intelligenceof entities which are profoundly different to eachother in terms of their cognitive capacities, speed,senses, environments in which they operate, and soon. To measure the intelligence of such diverse sys-tems in a meaningful way we must step back fromthe specifics of particular systems and establish theunderlying fundamentals of what it is that we arereally trying to measure. That is, we need to estab-lish a notion of intelligence that goes beyond thespecifics of particular kinds of systems.The difficulty of doing this is readily apparent.Consider, for example, the memory and numericalcomputation tasks that appear in some intelligencetests and which were once regarded as defining hall-marks of human intelligence. We now know thatthese tasks are absolutely trivial for a machine andthus do not test the machine’s intelligence. Indeedeven the mentally demanding task of playing chesshas been largely reduced to brute force search. Astechnology advances, our concept of what intelli-gence is continues to evolve with it.How then are we to develop a concept of intelli-gence that is applicable to all kinds ofsystems? Anyproposed definition must encompass the essence ofhuman intelligence, as well as other possibilities, ina consistent way. It should not be limited to anyparticular set of senses, environments or goals, norshould it be limited to any specific kind of hard-ware, such as silicon or biological neurons. It shouldbe based on principles which are sufficiently funda-mental so as to be unlikely to alter over time. Fur-thermore, the intelligence measure should ideally beformally expressed, objective, and practically real-isable.This paper approaches this problem in the fol-lowing way. In Section 2 we consider a range of def-initions of human intelligence that have been putforward by well known psychologists. From thesewe extract the most common and essential featuresand use them to create an informal definition ofintelligence. Section 3 then introduces the frame-1

[1]  Ming Li,et al.  An Introduction to Kolmogorov Complexity and Its Applications , 2019, Texts in Computer Science.

[2]  Shane Legg,et al.  A Universal Measure of Intelligence for Artificial Agents , 2005, IJCAI.

[3]  Marcus Hutter,et al.  Universal Artificial Intellegence - Sequential Decisions Based on Algorithmic Probability , 2005, Texts in Theoretical Computer Science. An EATCS Series.

[4]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[5]  F. L. Wells The Measurement and Appraisal of Adult Intelligence , 1958 .

[6]  J. Pickova The Measurement and Appraisal of Adult Intelligence , 1959 .

[7]  R. Sternberg,et al.  Handbook of Intelligence , 2000 .

[8]  W. V. Bingham Aptitudes and aptitude testing , 1937 .

[9]  Ofi rNw8x'pyzm,et al.  The Speed Prior: A New Simplicity Measure Yielding Near-Optimal Computable Predictions , 2002 .

[10]  Robert J. Sternberg,et al.  Dynamic Testing: The Nature and Measurement of Learning Potential , 2001 .

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  W. Lewis Johnson Needed: a new test of intelligence , 1992, SGAR.

[13]  Matthew V. Mahoney,et al.  Text Compression as a Test for Artificial Intelligence , 1999, AAAI/IAAI.

[14]  José Hernández-Orallo,et al.  Beyond the Turing Test , 2000, J. Log. Lang. Inf..

[15]  Dr. Marcus Hutter,et al.  Universal artificial intelligence , 2004 .

[16]  L. Gottfredson Mainstream science on intelligence: An editorial with 52 signatories, history, and bibliography , 1997 .