STUART RUSSELL COMPUTER SCIENCE DIVISION UC...

RATIONALITY AND INTELLIGENCE

STUART RUSSELL

COMPUTER SCIENCE DIVISION

UC BERKELEY

Joint work with Eric Wefald, DevikaSubramanian, Shlomo Zilberstein, OtharHansson, Andrew Mayer, Gary Ogasawara,Tim Huang, Ron Parr, Keiji Kanazawa,Daphne Koller, Jonathan Tash, Peter Norvig,and Jeff Forbes.

Includes ideas by Eric Horvitz, Michael Fehling,Jack Breese, Michael Bratman, Tom Dean,Martha Pollack, and others.

Outline1. Constructive definitions of Intelligence

2. Some silly old definitions

3. A silly new definition

Three kinds of AIModelling human cognition

“Look! My model of humans is accurate!”

Building useful artifacts“Look! PBTS made a small fortune!”

Creating Intelligence“Look! My system is Intelligent!!”“No it isn’t!” “Yes it is!” etc.

Why constructive de�nitions?Avoid silly arguments, G & T.Need a formal relationship between

input/structure/output and Intelligencewhile avoiding overly narrow definitions thatlead to sterile and irrelevant research!

Constructive de�nitions . . .

Suppose a definition Int is proposed

“Look! My system is Int!”Is the claim interesting?Is the claim sometimes true?What research do we do on Int?

Candidates for Int

And the candidates for Best Formal Definitionof Intelligence are as follows:} Int1: Perfect rationality} Int2: Calculative rationality} Int3: Metalevel rationality} Int4: Bounded optimality

Agents and environmentsBUY SELL BUYBUY SELL

Agents perceive O and act A in environment EAn agent function f : O�! A

specifies an act for any percept sequence

Global measure V(f, E) evaluates f in E

Int1 = perfect rationalityAgent fopt is perfectly rational:

fopt = argmaxf V(f, E)i.e., the best possible behaviour

“Look! My system is perfectly rational!”Very interesting claimVERY seldom possibleResearch relates global measure tolocal constraints, e.g., maximizing utility

Machines and programsBUY SELL(hold) (hold) (hold)

Agent is a machine M running a program pThis defines an agent function f = Agent(p, M)

Int2 = calculative rationalityp is calculatively rational if Agent(p, M) = fopt

when M is infinitely fasti.e., p eventually computes the best action

“Look! My system is calculatively rational!”Useless in real-time* worldsQuite often trueResearch on calculative tools, e.g.logical planners, influence diagrams

The calculative toolboxThe toolbox is almost empty!!Need tools for

learning, modelling,deciding, compiling

in environments that are(non)deterministic,(partially) observable,discrete/continuous, static/dynamic

ComplexityCalculative rationality describes“in principle” capability

NP/PSPACE-completeness) trade off decisionquality for computation

Int3: metalevel rationalityAgent(p, M) is metalevelly rational if it controls

its computations optimally

“Look! My system is metalevelly rational!”Very interesting claimVERY seldom possibleResearch on rational metareasoning

Rational metareasoningDo the Right Thinking:} Computations are actions} Cost=time Benefit=better decisions} Value� benefit minus cost

General agent program:Repeat until no computation has value > 0:

Do the best computationDo the current best action

Anytime algorithmsDecision quality that improves over time

quality

benefit

costRational metareasoning applies triviallyAnytime tools!

Fine-grained metareasoningExplicit model of effects of computations

? ? ? ?

) selection as well as termination

Compiled into efficient formulafor value of computation

Applications in search, games, MDPs showimprovement over standard algorithms

Algorithms in AIMetareasoning replaces clever algorithms!

ALGORITHMS

Int4: bounded optimalityAgent(popt, M) is bounded-optimal iff

popt = argmaxpV(Agent(p, M), E)i.e., the best program given M.

Look! My system is bounded-optimal!Very interesting claimAlways possibleResearch on all sorts of things

Nonlocal constraints!Translates into nonlocal constraints on action) Optimize over programs, not actions

Similar conclusions reached in other fields:Economics: Herb Simon and othersGame theory: Prisoners’ Dilemma

Robert Aumann, Wed. 10.30 a.m.Philosophy: Dennett’s Moral First-Aid ManualPolitics: Toffler’s* Creating a New Civilization

Example: Sorting mailreject

mail sortcamera

Probability

E: Letters arrive at random timesM: Runs one or more neural networks

popt is a sequence of networkscomputable from arrival distribution

Asymptotic bounded optimalityStrict bounded optimality is too fragile

p is asymptotically bounded-optimal (ABO) iff9k V(Agent(p, kM), E) � V(Agent(popt, M), E)I.e., speeding up M by k compensates

for p’s inefficiency

Worst-case ABO and average-case ABOgeneralize classical complexity

Complex real-time systemsLet pi be ABO for a fixed deadline at t = 2i�

p0 ppp 321

Sequence is ABO for any deadline distributionAs good as knowing the deadline in advance!

Complex systems contd.Use the doubling construction to buildcomposite anytime systems

Fixed deadline) allocation to components is easy) “compiler” for complex systems

Metalevel reinforcement learningObject-level reinforcement learning:

learn long-term rewards for actions fromshort-term rewards

Metalevel reinforcement learning:learn long-term rewards for computations

Criterion for “valid” update rules:convergence to bounded optimality

What next?} Prove convergence to bounded optimalitywithin fixed software architectures} Prove dominance between architectures} Develop a “grammar” of AI architectures} Learning and bounded optimality U.C. BERKELEY

Bounded optimal solutionsWOW !!

Conclusions} Computational limitationsBrains cause minds} Tools in, algorithms out (eventually)} Bounded optimality:Fits intuitive idea of IntelligenceA bridge between theory and practice} Crisis: LAP–FOPLBMLDTHTNPOPMEA

STUART RUSSELL COMPUTER SCIENCE DIVISION UC...

Documents

Transcript of STUART RUSSELL COMPUTER SCIENCE DIVISION UC...

Les Barricades Mysterieuses - David Russell

Indigenous Cultural Rights and Identity Politics in Canada 1...2019/08/04 · 92 Volume 18, Issue 1, 2013 Indigenous Cultural Rights and Identity Politics in Canada came to be known

Bakhshðlı - Freemisraim3.free.fr/divers2/histoire_des_nombres.pdf8 D. FRAWLEY, Gods, Sages and Kings. Vedic secrets of ancient civilization, Motilal Banarsidass Publishers, Delhi,

Le culot de Russell Banks - infocom.ulg.ac.be · Le Monde des livres Le culot de Russell Banks Supplément Vendredi 23 mars 2012 - 68e année - N°2o8g3 - 1,50 € - France métropolitaine

Christopher Hookway* RUSSELL ET LA POSSIBILITÉ DU ...

Le nom propre et la nomination : Russell et Gardiner avec ... · UNIVERSITÉ DE PARIS VIII, ST. - DENIS Département de psychanalyse LE NOM PROPRE ET LA NOMINATION: RUSSELL ET GARDINER

Polluting Politics

Zizek Lacan Marx Psychcology and Politics

RUSSell Hobbs Prenez I'É coté HAPPY DAYS Jouez et Gagnez ...imagesdescriptif.auchan.fr/docs/doc92796.pdfPour profiter de cette offre spéciale et recevoir votre 2ème produit Russell

WALTER et Lao RUSSELL - AbundantHope

Les Jack Russell Terriers. Definition Jack Russell veut dire un petit chien ou un chien qui travail.

Fair Politics - Cercle de coopérationcercle.lu/wp-content/uploads/2020/02/FAIR_POLITICS_2017... · 2020. 2. 25. · Baromètre 2017 « Fair Politics – Pour une meilleure cohérence

Catalogo Russell FR

Loeillet Russell

Kurt Russell - data.bnf.fr

D Russell Arr. Loeillet Suite

Etude Russell Reynolds sur les conseil d'administration

Is Economics Desanchanting Politics Pre-Revolution ...

Inventions for many voices: poetry, music and politics in ...

James Smalls - The Colonial Politics of Ethnographic Sculpture, 1850-1880