您的当前位置：首页 A Formal Definition of Intelligence for Artificial Systems

A Formal Definition of Intelligence for Artificial Systems

来源：画鸵萌宠网

AFormalDeﬁnitionofIntelligenceforArtiﬁcialSystems

ShaneLeggandMarcusHutter

IDSIA,Galleria2,Manno-Lugano6928,Switzerland

{shane,marcus}@idsia.ch

Afundamentaldifﬁcultyinartiﬁcialintelligenceisthatnobodyreallyknowswhatintelli-genceis,especiallyforsystemswithsenses,environments,motivationsandcognitivecapacitieswhichareverydifferenttoourown.Inourworkwetakeamainstreaminformalperspectiveonintelligenceandformaliseandgeneralisethisusingthereinforcementlearningframeworkandal-gorithmiccomplexitytheory.Theresultingformaldeﬁnitionofintelligencehasmanyinterestingpropertiesandhasreceivedattentioninboththeacademic[4,5]andpopularpress[2,1].

Althoughthereisnostrictconsensusamongexpertsoverthedeﬁnitionofintelligenceforhu-mans,mostdeﬁnitionssharemanykeyfeatures.Inallcases,intelligenceisapropertyofanentity,whichwewillcalltheagent,thatinteractswithanexternalproblemorsituation,whichwewillcalltheenvironment.Anagent’sintelligenceistypicallyrelatedtoitsabilitytosucceedwithre-specttooneormoreobjectives,whichwewillcallthegoal.Theemphasisonlearning,adaptationandﬂexibilitycommontomanydeﬁnitionsimpliesthattheenvironmentisnotfullyknowntotheagent.Thustrueintelligencerequirestheabilitytodealwithawiderangeofpossibilities,notjustafewspeciﬁcsituations.Puttingthesethingstogethergivesusourinformaldeﬁnition:Intelligencemeasuresanagent’sgeneralabilitytoachievegoalsinawiderangeofenvironments.Weareconﬁdentthatthisdeﬁnitioncapturestheessenceofmanycommonperspectivesonintelligence.Italsodescribeswhatwewouldliketoachieveinmachines:Averygeneralcapacitytoadaptandperformwellinawiderangeofsituations.

Toformalisethiswecombinetheextremelyﬂexiblereinforcementlearningframeworkwithalgorithmiccomplexitytheory.Inreinforcementlearningtheagentsendsitsactionstotheenvi-ronmentandreceivesobservationsandrewardsback.Theagenttriestomaximisetheamountofrewarditreceivesbylearningaboutthestructureoftheenvironmentandthegoalsitneedstoac-complishinordertoreceiverewards.Todenotesymbolsbeingsentwewillusethelowercasevari-ablenameso,randaforobservations,rewardsandactionsrespectively.Theprocessofinteractionproducesanincreasinghistoryofobservations,rewardsandactions,o1r1a1o2r2a2o3r3a3o4....Theagentissimplyafunction,denotedbyπ,whichisaprobabilitymeasureoveractionscon-ditionedonthecurrenthistory,forexample,π(a3|o1r1a1o2r2).Howtheagentgeneratesthisdistributionoveractionsisleftcompletelyopen,forexample,agentsarenotrequiredtobeTuringcomputable.

Theenvironment,denotedµ,issimilarlydeﬁned:∀k∈Ntheprobabilityofokrk,giventhecurrenthistoryisµ(okrk|o1r1a1o2r2a2...ok−1rk−1ak−1).Aswedesireanextremelygeneraldeﬁnitionofintelligenceforarbitrarysystems,ourspaceofenvironmentsshouldbeaslargeaspossible.Anobviouschoiceisthespaceofallprobabilitymeasures,howeverthiscausesseriousproblemsaswecannotevendescribesomeofthesemeasuresinaﬁniteway.Thesolutionistorequirethemeasurestobecomputable.Thisallowsforaninﬁnitespaceofpossibleenvironmentswithnoboundontheircomplexity.Italsopermitsenvironmentswhicharenon-deterministicasitisonlytheirprobabilitydistributionswhichneedtobecomputable.Additionallyweboundthe󰀂∞πtotalrewardtobe1toensurethatthefuturevalueVµ:=Ei=1riisﬁnite.Thisspace,denotedE,appearstobethelargestusefulspaceofenvironments.

Wewanttocomputethegeneralperformanceofanagentinunknownenvironments.Asthereareaninﬁnitenumberofenvironments,wecannotsimplytakeanexpectedvaluewithrespecttoauniformdistribution—wemustweightsomeenvironmentsmoreheavilythanothers.Ifweconsidertheagent’sperspectiveontheproblem,itisthesameasasking:Givenseveraldifferenthypotheseswhichareconsistentwiththeobservations,whichhypothesisshouldbeconsideredthemostlikely?ThisisafundamentalproblemininductiveinferenceforwhichthestandardsolutionistoinvokeOccam’srazor:Givenmultiplehypotheseswhichareconsistentwiththedata,the

simplestshouldbepreferred.Asthisisgenerallyconsideredthemostintelligentthingtodo,weshouldtestagentsinsuchawaythattheyare,atleastonaverage,rewardedforcorrectlyapplyingOccam’srazor.Thismeansthatouraprioridistributionoverenvironmentsshouldbeweightedtowardssimplerenvironments.

Aseachenvironmentisdescribedbyacomputablemeasure,wecanmeasurethecomplexityoftheseinthestandardwaybyconsideringtheirKolmogorovcomplexity.Speciﬁcally,ifUisapreﬁxuniversalTuringmachinethentheKolmogorovcomplexityofanenvironmentµisthelengthoftheshortestprogramonUthatcomputesµ,formallyK(µ):=minp{l(p):U(p)=µ}.Wecannowdeﬁnetheuniversalintelligenceofanagentπtosimplybeitsexpectedperformance,

󰀁

Υ(π):=2−K(µ)Vµ.

µ∈E

Itisclearbyconstructionthatuniversalintelligencemeasuresthegeneralabilityofanagent

toperformwellinaverywiderangeofenvironments,asrequiredbyourinformaldeﬁnitionofintelligencegivenearlier.Thedeﬁnitionplacesnorestrictionsontheinternalworkingsoftheagent;itonlyrequiresthattheagentiscapableofgeneratingoutputandreceivinginputwhichincludesarewardsignal.UniversalintelligencealsoreﬂectsOccam’srazorinanaturalway;likestandardintelligencetestsforhumanswhichdeﬁnethecorrectanswertoaquestiontobethesimplestconsistentwiththegiveninformation.

foranumberofbasicenvironments,suchassmallMDPs,andagentsByconsideringVµ

withsimplebutverygeneraloptimisationstrategies,itisclearthatΥcorrectlyorderstherelativeintelligenceoftheseagentsinanaturalway.Ifweconsiderahighlyspecialisedagent,forexampleIBM’sDeepBluechesssupercomputer,thenwecanseethatthisagentwillbeineffectiveoutsideofoneveryspeciﬁcenvironment,andthuswouldhavealowuniversalintelligencevalue.Thisisconsistentwithourviewofintelligenceasbeingahighlyadaptableandgeneralability.

AveryhighvalueofΥwouldimplythatanagentisabletoperformwellinmanyenviron-ments.Suchamachinewouldobviouslybeoflargepracticalsigniﬁcance.ThemaximalagentwithrespecttoΥisthetheoreticalAIXIagentwhichhasbeenshowntohavemanystrongoptimalityproperties,includingbeingself-optimisinginallenvironmentsinwhichthisisatallpossibleforageneralagent[3].Suchresultsconﬁrmthefactthatagentswithhighuniversalintelligenceareverypowerfulandadaptable.

UniversalintelligencespanssimpleadaptiveagentsrightuptosuperintelligentagentslikeAIXI,unlikethepass-failTuringtestwhichisusefulonlyforagentswithnearhumanintelligence.Furthermore,theTuringtestcannotbefullyformalisedasitisbasedonsubjectivejudgements.PerhapsanevenbiggerproblemisthattheTuringtestishighlyanthropocentric,indeedmanyhavesuggestedthatitisreallyatestofhumannessratherthanintelligence.Universalintelligencedoesnothavetheseproblemsasitisformallyspeciﬁedintermsofthemorefundamentalconceptofcomplexity.

References[1]C.Fi´evet.Mesurerl’intelligenced’unemachine.InLeMondedel’intelligence,volume1,

pages42–45,Paris,November2005.Mondeopublishing.

[2]D.Graham-Rowe.Spottingthebotswithbrains.InNewScientistmagazine,volume2512,

page27,13August2005.

[3]M.Hutter.UniversalArtiﬁcialIntelligence:SequentialDecisionsbasedonAlgorithmicProb-ability.Springer,Berlin,2004.300pages,http://www.idsia.ch/∼marcus/ai/uaibook.htm.[4]S.LeggandM.Hutter.Auniversalmeasureofintelligenceforartiﬁcialagents.InProc.21st

InternationalJointConf.onArtiﬁcialIntelligence(IJCAI-2005),Edinburgh,2005.

[5]S.LeggandM.Hutter.Aformalmeasureofmachineintelligence.InProc.Annualmachine

learningconferenceofBelgiumandTheNetherlands(Benelearn-2006),Ghent,2006.

因篇幅问题不能全部显示，请点此查看更多更全内容

查看全文