37
Instrumental Conditioning II

Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Embed Size (px)

Citation preview

Page 1: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Instrumental Conditioning II

Page 2: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Delay of Reinforcement

StartDelayChoice

Correct

Incorrect

Grice (1948)

Goal

Reward or No Reward

Page 3: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Grice (1948) Results

2030405060708090

10025 100

175

250

325

400

475

550

625

700

Trials

Per

cen

t C

orr

ect

0s

5s

2s1.2s

0.5s

10s

Page 4: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Overcoming the effects of delay

• Secondary reinforcers

• “Marking” procedure

Page 5: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Lieberman, McIntosh & Thomas (1979)

Page 6: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Reinforcement Punishment

Positive contingency

Negative contingency

Chocolate Bar Electric Shock

Excused from Chores

No TV privileges

Effect on Rate Behavior

Page 7: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward
Page 8: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward
Page 9: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward
Page 10: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Professor Drew

Page 11: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Anticipatory Contrast - Crespi (1942)

00.5

11.5

22.5

33.5

44.5

2 4 6 8 10 12 14 16 18 20 2 4 6 8

Trials

Run

ning

Spe

ed (f

t/se

c)

256-16 Pellets16-16 Pellets1 - 16 Pellets

Rats run down maze to find food pellets in goal arm.

Page 12: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

What is a reinforcer?

Operational Definition (behaviorists): That which increases the probability of the response that preceded it.

Thorndike: A stimulus that produces a “satisfying state of affairs”

Page 13: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Drive Reduction Theory

Amt of H2O in body

Compare with Set Point

Seek water/ don’t seek water

drives

Page 14: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Drive Reduction Considered: Are reinforcers necessary for survival?

– Eating to excess

– Drugs of Abuse

– “Pleasure centers” of the brain

Page 15: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward
Page 16: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Behavioral Regulation View: The Premack Principle

• Behaviors are reinforcing, not stimuli

• To predict what will be reinforcing, observe the baseline frequency of different behaviors

• Highly probable behaviors will reinforce less probable behaviors

Page 17: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Premack Revised: The Response Deprivation Hypothesis

• Low frequency behaviors can reinforce high frequency behaviors (and vice versa)

• All behaviors have a preferred frequency = the behavioral bliss point

• Deprivation below that frequency is aversive, and organisms will work to remedy this

Timberlake & Allison (1974)

Page 18: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Response deprivation hypothesis

.25 .5 .75

The ice cream scale (in pints)

1.0 1.25 1.5 1.75 2.0 2.25 2.5

Bliss point

(1.0 pints/night)

Will work to avoid ice creamWill work to obtain

Page 19: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Contiguity versus Contingency in operant conditioning

Page 20: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Degraded Contingency Effect

= bar press = food

Perfect contingency

Strong Responding

Degraded contingency

Weak Responding

Page 21: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

G.V. Thomas (1983)

Contiguity pitted against contingency

“Free” reinforcers given every 20s

Lever press advances delivery of pellet, but cancels pellet for next 20-s interval

So if you press at second 2, you get a pellet immediately, but you get no pellet during seconds 3-20 and 21-40.

20s 40s 60s

Page 22: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

G.V. Thomas (1983)

Contiguity pitted against contingency

So if you press at second 2, you get a pellet immediately, but you get no pellet during seconds 3-20 and 21-40.

20s 40s 60s

Lever press here

Lose this pellet

Page 23: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

“Superstitious Behavior”

• Suggested that temporal contiguity more important than contingency

• 15-s FT, no response requirement

• “adventitious reinforcement”

“In 6 out of 8 cases the resulting responses were so clearly defined that two observers could agree perfectly in counting instances. One bird was conditioned to turn counter-clockwise about the cage, making 2 or 3 turns between reinforcements. Another repeatedly thrust its head into one of the upper corners of the cage….”

Page 24: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Orienting toward feeder

Pecking near feeder

Moving along wall

¼ turn

Page 25: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

“Misbehavior” and the limits of operant conditioning

Page 26: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Limits of Operant Conditioning

• Some behaviors can’t be conditioned– Yawning– Scratching

• Belongingness– Presentation of a female won’t reinforce biting

• “Misbehavior”

Page 27: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Marian Breland Bailey – How to train a chicken

Page 28: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

The famous dancing chicken

Page 29: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

What is learned in operant conditioning?

Page 30: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R

What is learned?

Edwin Guthrie: mere contiguity of a stimulus and a behavior stamps in that S-R; reinforcement is not necessary

Page 31: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R

What is learned?

Thorndike:Reinforcement “stamps in” this connection

Page 32: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R

O

What is learned?

?

Page 33: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R O

2-Process Theory

operant

Pavlovian

Page 34: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R

CR

2-Process Theory

operant

Pavlovian

Page 35: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Evidence for 2-process theoryPavlovian-Instrumental Transfer

Phase 1 Phase 2 Test

LeverFood LightFood Light: #Presses?No Light: #Presses?

# Presses

Light No CS

The presence of the CS intensifies operant responding

Page 36: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

S R O?

?

What is learned?

Does the Pavlovian S-O association activate a vague emotional state or a specific mental representation of the outcome?

Page 37: Instrumental Conditioning II. Delay of Reinforcement Start DelayChoice Correct Incorrect Grice (1948) Goal Reward or No Reward

Specific Outcome RepresentationsTrapold

Phase 1 Phase 2 Test

(operant) (classical)

R LeverPellet TonePellet Tone:Left? Right?

L LeverSucrose LightSucrose Light:Left? Right?

# Presses

Light Noise

Left

Right