.pf{position:relative;background-color:#fff;overflow:hidden;margin:0;border:0}.pc{position:absolute;border:0;padding:0;margin:0;top:0;left:0;width:100%;height:100%;overflow:hidden;display:block;transform-origin:0 0;-ms-transform-origin:0 0;-webkit-transform-origin:0 0}.bi{position:absolute;border:0;margin:0}.c{position:absolute;border:0;padding:0;margin:0;overflow:hidden;display:block}.t{position:absolute;white-space:pre;font-size:1px;transform-origin:0 100%;-ms-transform-origin:0 100%;-webkit-transform-origin:0 100%;unicode-bidi:bidi-override;-moz-font-feature-settings:"liga" 0}.t:after{content:''}.t:before{content:'';display:inline-block}.t span{position:relative;unicode-bidi:bidi-override}._{display:inline-block;color:transparent;z-index:-1}.pi{display:none}@media screen{.pf{margin:13px auto;box-shadow:1px 1px 3px 1px #333;border-collapse:separate}}.ff1{font-family:ff1;line-height:.893555;font-style:normal;font-weight:400;visibility:visible}.ff2{font-family:ff2;line-height:.910156;font-style:normal;font-weight:400;visibility:visible}.ff3{font-family:ff3;line-height:.871094;font-style:normal;font-weight:400;visibility:visible}.ff4{font-family:ff4;line-height:.666504;font-style:normal;font-weight:400;visibility:visible}.ff5{font-family:ff5;line-height:.861816;font-style:normal;font-weight:400;visibility:visible}.ff6{font-family:ff6;line-height:.682617;font-style:normal;font-weight:400;visibility:visible}.m0{transform:matrix(.320260,0,0,.320260,0,0);-ms-transform:matrix(.320260,0,0,.320260,0,0);-webkit-transform:matrix(.320260,0,0,.320260,0,0)}.ls7{letter-spacing:-.216203px}.ls6{letter-spacing:0}.ls4{letter-spacing:.096090px}.ls5{letter-spacing:.152143px}.ls2{letter-spacing:.200188px}.ls0{letter-spacing:.412387px}.ls3{letter-spacing:29.683883px}.ls1{letter-spacing:43.697046px}.sc0{text-shadow:-.015em 0 transparent,0 .015em transparent,.015em 0 transparent,0 -.015em transparent}@media screen and (-webkit-min-device-pixel-ratio:0){.sc0{-webkit-text-stroke:.015em transparent;text-shadow:none}}.ws0{word-spacing:-10.641996px}.ws1{word-spacing:0}._0{margin-left:-1.008418px}._1{width:1.086023px}._2{width:2.728552px}.fc1{color:transparent}.fc0{color:#000}.fs0{font-size:48.045131px}.y2c{bottom:-381.874481px}.y2b{bottom:-363.275375px}.y2a{bottom:-344.336729px}.y29{bottom:-325.4236px}.y28{bottom:-306.831033px}.y27{bottom:-287.917904px}.y26{bottom:-269.325336px}.y25{bottom:-250.373612px}.y24{bottom:-231.473562px}.y23{bottom:-212.867916px}.y22{bottom:-194.916471px}.y21{bottom:-176.323904px}.y20{bottom:-157.385258px}.y1f{bottom:-138.792691px}.y1e{bottom:-119.879562px}.y1d{bottom:-101.928118px}.y1c{bottom:-83.33555px}.y1b{bottom:-64.742983px}.y1a{bottom:-45.791258px}.y19{bottom:-27.198691px}.y18{bottom:-8.285562px}.y0{bottom:0}.y17{bottom:10.627567px}.y16{bottom:29.220134px}.y15{bottom:48.15878px}.y14{bottom:66.751347px}.y13{bottom:85.664476px}.y12{bottom:104.257044px}.y11{bottom:122.208488px}.y10{bottom:140.801055px}.yf{bottom:159.739701px}.ye{bottom:178.332268px}.yd{bottom:197.245397px}.yc{bottom:216.158526px}.yb{bottom:234.751094px}.ya{bottom:253.702818px}.y9{bottom:272.295386px}.y8{bottom:290.24683px}.y7{bottom:308.839397px}.y6{bottom:327.752526px}.y5{bottom:346.345094px}.y4{bottom:365.283739px}.y3{bottom:383.235184px}.y2{bottom:400.866066px}.y1{bottom:507.293741px}.h3{height:32.561837px}.h5{height:33.289082px}.h4{height:33.359461px}.h2{height:507.295022px}.h1{height:510.495064px}.h0{height:1014.588763px}.w1{width:783.997438px}.w0{width:784px}.x0{left:0}.x1{left:91.986025px}.x2{left:115.066454px}.x3{left:138.178938px}.x4{left:161.259495px}

CIS 140 Lecture 8: L8- reinforcement learning

Negative punishment- take away something good (time out) Conditioned stimulus (cs) -&gt; unconditioned stimulus (us)-&gt; unconditioned response (ur) Like operant conditioning, enables behavior based on prediction. Conditioned stimulus (cs) -&gt; conditioned response (cr) Ex: pavlov"s dogs- learned to associate sound with food. Often, ur and cr are same action. Ex: immune sys, pancreas- respond based on predictions from environment. Ex: advertising- babies and tires and happiness. Involved placing neutral signal before reflex (ur: focuses on involuntary, automatic behaviors, helps predict when reflex (ur) will be useful, stimulus -&gt; reaction. Operant: applying reinforcement or punishment after behavior, strengthens or weakens voluntary behaviors, helps predict which behaviors will be rewarded, behavior -&gt; consequence. Mutant flies- some better with classical, some better with operant (double association) double association checking- two measures and two manipulations. Evidence that two measures are mediated by separate processes modeling classical conditioning typical learning curve. Association strength grows more slowly as number of trials increase, and reaches an asymptote.

United States

Introduction to Cognitive Science

Computer & Information Science

brainar

University of Pennsylvania

Introduction to Computer Systems

Mathematical Foundations of Computer Science

Introduction to Computer Programming

Calculus, Part II

General Chemistry I

Atoms, Bits, Circuits and Systems

Programming Languages and Techniques I

Programming Languages and Technigues II

Automata, Computability, and Complexity

Japanese Popular Culture

Engineering Ethics

Introduction to Linguistics

Calculus, Part III

Linear Algebra

Engineering Electromagnetics

Elementary Korean I

Elementary Korean II

PSYC 101 Lecture Notes - Lecture 15: Wechsler Adult Intelligence Scale, Classical Conditioning, Reinforcement

PSY 101 Lecture Notes - Lecture 8: Operant Conditioning, Slot Machine, Reinforcement

PSYCO333 Chapter Notes - Chapter 10: Classical Conditioning, Reinforcement, Stimulus Control

CIS 140 Lecture 8: L8- reinforcement learning

Document Summary

Get access

Related Documents

PSYC 101 Lecture Notes - Lecture 15: Wechsler Adult Intelligence Scale, Classical Conditioning, Reinforcement

PSY 101 Lecture Notes - Lecture 8: Operant Conditioning, Slot Machine, Reinforcement

PSYCO333 Chapter Notes - Chapter 10: Classical Conditioning, Reinforcement, Stimulus Control