.pf{position:relative;background-color:#fff;overflow:hidden;margin:0;border:0}.pc{position:absolute;border:0;padding:0;margin:0;top:0;left:0;width:100%;height:100%;overflow:hidden;display:block;transform-origin:0 0;-ms-transform-origin:0 0;-webkit-transform-origin:0 0}.bi{position:absolute;border:0;margin:0}.c{position:absolute;border:0;padding:0;margin:0;overflow:hidden;display:block}.t{position:absolute;white-space:pre;font-size:1px;transform-origin:0 100%;-ms-transform-origin:0 100%;-webkit-transform-origin:0 100%;unicode-bidi:bidi-override;-moz-font-feature-settings:"liga" 0}.t:after{content:''}.t:before{content:'';display:inline-block}.t span{position:relative;unicode-bidi:bidi-override}._{display:inline-block;color:transparent;z-index:-1}.pi{display:none}@media screen{.pf{margin:13px auto;box-shadow:1px 1px 3px 1px #333;border-collapse:separate}}.ff1{font-family:ff1;line-height:.958008;font-style:normal;font-weight:400;visibility:visible}.ff2{font-family:ff2;line-height:.910156;font-style:normal;font-weight:400;visibility:visible}.ff3{font-family:ff3;line-height:1.432;font-style:normal;font-weight:400;visibility:visible}.ff4{font-family:ff4;line-height:.958008;font-style:normal;font-weight:400;visibility:visible}.ff5{font-family:ff5;line-height:.682617;font-style:normal;font-weight:400;visibility:visible}.ff6{font-family:ff6;line-height:.958008;font-style:normal;font-weight:400;visibility:visible}.ff7{font-family:ff7;line-height:.758789;font-style:normal;font-weight:400;visibility:visible}.ff8{font-family:ff8;line-height:.958008;font-style:normal;font-weight:400;visibility:visible}.m0{transform:matrix(.320260,0,0,.320260,0,0);-ms-transform:matrix(.320260,0,0,.320260,0,0);-webkit-transform:matrix(.320260,0,0,.320260,0,0)}.v2{vertical-align:-1.921805px}.v1{vertical-align:10.089477px}.lsa{letter-spacing:0}.ls0{letter-spacing:.004324px}.lsb{letter-spacing:.008648px}.ls6{letter-spacing:.040358px}.ls2{letter-spacing:.096090px}.ls4{letter-spacing:.208035px}.ls5{letter-spacing:.41511px}.ls9{letter-spacing:.576542px}.ls8{letter-spacing:.895561px}.ls7{letter-spacing:27.830622px}.ls3{letter-spacing:36.959197px}.ls1{letter-spacing:42.724613px}.sc1{text-shadow:-.015em 0 #000,0 .015em #000,.015em 0 #000,0 -.015em #000}.sc0{text-shadow:-.015em 0 transparent,0 .015em transparent,.015em 0 transparent,0 -.015em transparent}@media screen and (-webkit-min-device-pixel-ratio:0){.sc1{-webkit-text-stroke:.015em #000;text-shadow:none}.sc0{-webkit-text-stroke:.015em transparent;text-shadow:none}}.ws2{word-spacing:-39.877458px}.ws3{word-spacing:-11.165688px}.ws4{word-spacing:-11.054704px}.ws1{word-spacing:-11.050380px}.ws0{word-spacing:-11.046056px}.ws5{word-spacing:0}._0{margin-left:-1.292894px}._1{width:1.304746px}._3{width:2.367504px}._2{width:30.048386px}.fc1{color:transparent}.fc0{color:#000}.fs2{font-size:25.944371px}.fs1{font-size:39.877458px}.fs0{font-size:56.212803px}.y2e{bottom:-381.837425px}.y2d{bottom:-354.607648px}.y2c{bottom:-327.372742px}.y2b{bottom:-310.447095px}.y2a{bottom:-293.367578px}.y29{bottom:-276.44193px}.y28{bottom:-259.516283px}.y27{bottom:-242.398298px}.y26{bottom:-225.472651px}.y25{bottom:-208.547003px}.y24{bottom:-191.621355px}.y23{bottom:-174.541838px}.y22{bottom:-157.616191px}.y21{bottom:-140.690543px}.y20{bottom:-123.764896px}.y1f{bottom:-106.685379px}.y1e{bottom:-89.759731px}.y1d{bottom:-72.834083px}.y1c{bottom:-55.908436px}.y1b{bottom:-38.828919px}.y1a{bottom:-21.877626px}.y19{bottom:-4.951979px}.y0{bottom:0}.y18{bottom:11.973669px}.y17{bottom:29.053186px}.y16{bottom:45.978833px}.y15{bottom:62.904481px}.y14{bottom:79.983998px}.y13{bottom:96.909646px}.y12{bottom:113.835293px}.y11{bottom:130.760941px}.y10{bottom:147.840458px}.yf{bottom:164.766105px}.ye{bottom:181.691753px}.yd{bottom:198.655868px}.yc{bottom:215.735385px}.yb{bottom:232.661033px}.ya{bottom:249.58668px}.y9{bottom:266.512328px}.y8{bottom:283.591845px}.y7{bottom:300.517492px}.y6{bottom:317.44314px}.y5{bottom:334.368787px}.y4{bottom:351.448305px}.y3{bottom:368.373952px}.y2{bottom:397.301422px}.y1{bottom:507.293741px}.h6{height:-396.14729px}.h5{height:29.908094px}.h3{height:42.159602px}.h4{height:45.779322px}.h2{height:507.295022px}.h1{height:919.147323px}.h0{height:1014.588763px}.w4{width:27.081036px}.w3{width:27.388775px}.w2{width:27.696514px}.w5{width:28.465862px}.w1{width:783.997438px}.w0{width:784px}.x0{left:0}.x1{left:91.984743px}.x2{left:115.065171px}.x9{left:119.681257px}.x3{left:132.447298px}.x6{left:138.140471px}.xa{left:147.680381px}.x4{left:155.527727px}.x5{left:172.761114px}.xb{left:175.376895px}.x7{left:184.301328px}.xc{left:203.073410px}

CS486 Lecture Notes - Lecture 9: Markov Decision Process, Discounting

The markov decision process 10. 29/31. 18: a complete markov decision process represents a sequential decision problem with the following qualities: The states, actions, transition model, and reward function are defined: all transitions are markovian (the future is independent of the past given the present) Is there a finite (fixed # of time left) or infinite (no end time or deadline) horizon: a finite horizon means the problem is non-stationary and is harder to model. To calculate the utility of a sequence of states : additive rewards u(s0, s1, s2, ) = r(s0) + r(s1) + r(s2) + , discounted rewards u(s0, s1, s2, ) = r(s0) + r(s1) + 2r(s2) + . There is a chance tomorrow may not come with an infinite sequence of states, the total additive rewards is infinite whereas the total discounted rewards is finite: given u(s), determine the optimal policy as follows . The first term is the immediate reward of reaching state s.

Canada

Introduction to Artificial Intelligence

Computer Science

Alice Gao

University of Waterloo

Logic and Computation

Object-Oriented Software Development

Introduction to Database Management

Computer Security and Privacy

Organizational Behaviour

Professional and Business Ethics

Introductory Psychology

Cognitive Processes

Psychology of Death and Dying

Personality

Evolutionary Psychology

Introduction to Philosophy: Knowledge and Reality

Problem Solving

Professional Responsibility in Computing

ECON 702 Midterm: Krueger_Aug2011-1_702_0

ECON 702 Midterm: 702 PrelimAugust2018 Dirk Krueger

ECON 702 Midterm: PrelimAugust2012-1_702_Krueger

All Subjects

Accounting

Mathematics

Statistics

Calculus

Psychology

Management

Biology

Physics

Chemistry

Electrical Engineering

Mechanical Engineering

Marketing

Music

History

Project Management

Business

Architecture

Geography

Science

Nursing

Information Technology

Engineering

English

Finance

Philosophy

Astronomy

Communications

Sociology

Anthropology

Prealgebra

Algebra

Precalculus

Ethics

Geometry

Probability

Economics

CS486 Lecture Notes - Lecture 9: Markov Decision Process, Discounting

Document Summary

Get access

Related Documents

ECON 702 Midterm: Krueger_Aug2011-1_702_0

ECON 702 Midterm: 702 PrelimAugust2018 Dirk Krueger

ECON 702 Midterm: PrelimAugust2012-1_702_Krueger