42
True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Embed Size (px)

Citation preview

Page 1: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

True and Error Models of Response Variation in

Judgment and Decision Tasks

Michael H. Birnbaum

Page 2: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

OverviewI review three papers that are available at my Website that involve application and evaluation of models of variability in response to choice problems.•Two papers are co-authored with Jeff Bahra on with tests of transitivity, stochastic dominance, and restricted branch independence. •Our findings consistently rule out assumptions of iid that are required in certain models, such as the approach of Regenwetter, Dana, and Davis-Stober (2011) Psych Review.•Transitivity is often satisfied, but a few show evidence of intransitive preferences.•No individual satisfied CPT or the priority heuristic.

Page 3: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Testing Algebraic Models with Error-Filled Data

• Algebraic models assume or imply formal properties such as stochastic dominance, coalescing, transitivity, gain-loss separability, etc.

• But these properties will not hold if data contain “error.”

Page 4: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Some Proposed Solutions

• Neo-Bayesian approach (Myung, Karabatsos, & Iverson.

• Cognitive process approach (Busemeyer)• “Error” Theory (“Error Story”) approach

(Thurstone, Luce) combined with algebraic models.

• Random preference model: choices independent; no errors: variability due to iid sampling from mixture. Loomes & Sugden.

Page 5: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Variations of Error Models

• Thurstone, Luce: errors related to separation between subjective values. Case V: SST (scalability).

• Harless & Camerer: errors assumed to be equal for certain choices.

• Sopher & Gigliotti: Allow each choice to have a different rate of error, assumed transitivity.

• Birnbaum proposed using repetitions within block as estimates of error rates. Birnbaum & Gutierrez, 2007; Birnbaum & Schmidt, 2008.

Page 6: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Basic Assumptions of TE model (2 errors model)

• Each choice in an experiment has a true choice probability, p, and an error rate, e.

• The error rate is estimated from (and is the “reason” given for) inconsistency of response to the same choice by same person over repetitions

Page 7: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

One Choice, Two Repetitions

A B

A

B

pe2 (1 p)(1 e)2 p(1 e)e (1 p)(1 e)e

p(1 e)e (1 p)(1 e)e

p(1 e)2 (1 p)e2

Page 8: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Choices are not Independent

• In this model, choices are not independent, in general.

• If there is a mixture of true preferences, there will be violations of independence.

• This contrasts with the assumption of iid used by Regenwetter and colleagues.

Page 9: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Solution for e

• The proportion of preference reversals between repetitions allows an estimate of e.

• Both off-diagonal entries should be equal, and are equal to:

(1 e)e

Page 10: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Estimating eProbability of Reversals in Repeated Choice

0

0.1

0.2

0.3

0.4

0.5

0 0.1 0.2 0.3 0.4 0.5

Error Rate (e)

Page 11: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Estimating p

Observed = P(1 - e)(1 - e)+(1 - P)ee

0.00

0.20

0.40

0.60

0.80

1.00

0.00 0.20 0.40 0.60 0.80 1.00

True Choice Probabiity, P

Error Rate = 0Error Rate = .02Error Rate = .04Error Rate = .06Error Rate = .08Error Rate = .10Error Rate = .12Error Rate = .14Error Rate = .16Error Rate = .18Error Rate = .20Error Rate = .22Error Rate = .24Error Rate = .26Error Rate = .28Error Rate = .30Error Rate = .32Error Rate = .34Error Rate = .36Error Rate = .38Error Rate = .40Error Rate = .42Error Rate = .44Error Rate = .46Error Rate = .48Error Rate = .50

Page 12: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Testing if p = 0

Test if P = 0

0

0.1

0.2

0 0.1 0.2 0.3 0.4 0.5

Probability of Reversals 2e(1 - e)

Page 13: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

True and Error Model to Individuals

• When applied to individuals, it is assumed that each person has a “true” set of preferences within a trial block.

• True preferences might differ between blocks, if the person has a mixture. If so, violates independence.

• A mixture could arise if a person’s parameters change in response to experience.

Page 14: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

By Testing Individuals…• we can tailor the experiment (or

devise a fish net) to “catch” violations that otherwise slip through a static study of a property.

• we can see if people “learn” from internal feedback and change their behavior.

• we can test if a model that holds for one property can predict the results of other tests.

Page 15: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Recent Studies with Jeffrey Bahra

• Tested participants with many replications.

• Tests of transitivity• Basic tests of critical properties,

including stochastic dominance, coalescing, LCI, UCI, and others.

• Today: RBI and SD (+ transitivity)

Page 16: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Stochastic Dominance

A: 10 tickets to win $10 5 tickets to win $90 85 tickets to win $98

B: 5 tickets to win $10 5 tickets to win $12 90 tickets to win $99Each person received two such problems in

each repetition block of 107 choice problems. Blocks were separated by at least 50 unrelated choice problems.

Page 17: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

SD is a critical property

• This test of stochastic dominance lies outside the probability simplex on three branch gambles.

• CPT with ANY strictly monotonic utility function and decumulative weighting function must satisfy stochastic dominance in this choice.

• We don’t need to estimate any parameters or assume any particular functions to refute CPT.

Page 18: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Non-nested Models

Page 19: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Testing CPT

• Coalescing• Stochastic

Dominance• Lower Cum.

Independence• Upper

Cumulative Independence

• Upper Tail Independence

• Gain-Loss Separability

TAX:Violations of:

Page 20: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Testing TAX Model

• 4-Distribution Independence (RS’)

• 3-Lower Distribution Independence

• 3-2 Lower Distribution Independence

• 3-Upper Distribution Independence (RS’)

• Res. Branch Indep (RS’)

CPT: Violations of:

Page 21: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Critical Tests: LS Models

• CPT and TAX satisfy transitivity• LS Models violate transitivity (includes PH)• LS Models satisfy priority dominance,

integrative independence, and interactive independence. This properties systematically violated. See my JMP 2010 article.

• PH satisfies SD in these tests and it also violates RBI in the same way as CPT.

Page 22: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Restricted Branch Independence

• Weaker version of Savage’s “sure thing” ax.

• 3 equally likely events: slips in urn.• (x, y, z) := prizes x, y, or z, x < y < z

• RBI: (x, y, z) f (x', y', z)

(x, y, z') f (x', y', z')

Page 23: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

TAX, CPT, PH Violate RBI

• 0 < z < x' < x < y < y' < z'• (x, y) is “Safe”, S• (x', y') is “Risky”, R• (z, x, y) f (z, x', y') wLu(z) + wMu(x) + wHu(y) >

wLu(z) + wMu(x') + wHu(y')

Page 24: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

TAX: SR' Violations

wLwM

< u(y') - u(y)u(x) - u(x')

< wMwH

SR': S = (x, y, z) R = (x', y', z) AND

S' = (x, y, z') R' = (x', y', z')

Page 25: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

S = (z, x, y) vs R = (z, 5, 95)

Page 26: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

RBI distinguishes models of RDM

•EU and SWU (Edwards, 54) imply RBI•Original prospect theory implies RBI•Cumulative Prospect theory violates RBI•CPT with inverse-S weighting function implies RS’ pattern of violations• TAX model violates RBI with SR’ pattern

Page 27: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum
Page 28: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum
Page 29: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Testing iid Assumptions

Each person’s data: rows represent trial blocks and the columns represent choice problems. Smith & Batchelder (2008) technique: random permutations within columns for each person. We then calculate two statistics on original data and on 10,000 simulations of the data. Variance of preference reversals and correlation between mean preference reversals and difference between repetitions.

Page 30: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

A Test of Independence & Stationarity

• Within each subject, calculate the number of reversals between each pair of repetitions. 107 choices.

• 20 reps, so 20*19/2 = 190 pairs of reps.• Correlate the average no. reversals with

the difference in reps. For 59 participants, mean r = 0.69. Only 6 were negative. Similar results for Regenwetter et al data.

• Second test: Variance of no. reversals.

Page 31: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Tests of Independence

Page 32: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Transitivity Findings• Results were surprisingly transitive.• True and Error Model Fit data fairly well.• Data violate independence of choices.• People differ in true preferences and

people differ in “noise” levels.• No one satisfied the priority heuristic• One person satisfied linked viols of

transitivity consistent with LS models.

Page 33: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Very Few Intransitive Cases

• No one showed pattern predicted by PH in all three designs.

• Of the 59*3 = 177 Matrices, perhaps 4 show credible evidence of intransitivity.

• This change of procedure did not produce the higher rates of intransitivity conjectured.

Page 34: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Allais CC Paradox (JMP’04)• Choose Between: A = ($40, 0.2; $2, 0.8) B = ($98, 0.1; $2, 0.9)• Choose Between: C = ($98, 0.8; $40, 0.2) D = ($98, 0.9; $2, 0.1)• Many Choose B > A and C > D

Page 35: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Analysis of the Paradox: ($40, .2; $2, .8) f ($98, .1; $2, .9)

(Coalescing)

($40, .1;$40, .1;$2, .8) f ($98, .1;$2,.1;$2, .8)

(RBI)

($40, .1;$40, .1;$98, .8) f ($98, .1;$2,.1;$98, .8)

(Coalescing)

($98, .8; $40, 0.2) f ($98, .9; $2, .1)

Page 36: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Four Theories Compared

RBI holds (**cancellation)

RBI fails

Coalescing holds (*combination)

EU, CPT**, OPT*

CPTInverse-S => RS’

Coalescing fails

OPT TAX, RAMSR’

Page 37: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Error Models & EU

• One error model: P(SR’) = P(RS’)• Two error model allows P(RS’)>P(SR’)• Two error model: P(R’) > 1/2 implies P(R) > 1/2• Four error model allows above• All models imply P(SR) = P(RS);

P(SS,SR’)=P(SR’,SS), etc.

Page 38: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Results (refute 4 error EU)

SS’ SR’ RS’ RR’

SS’ 36 7 21 3

SR’ 5 4 1 3

RS’ 18 4 69 11

RR’ 1 2 12 14

Page 39: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Results (split form RBI)

SS’ SR’ RS’ RR’

SS’ 42 24 4 1

SR’ 17 21 2 16

RS’ 7 4 14 7

RR’ 1 6 6 39

Page 40: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Summary

• Most people conform to transitivity• A very small number do not, but not

consistently in all three designs as predicted by LS models.

• Data appear to violate the assumptions of random utility model; in particular, they show evidence against the assumptions of i.i.d.

Page 41: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Summary (cont’d)

• No one satisfied CPT, except those consistent with EU

• No one satisfied PH• People appear to change between

blocks. We suspect that parameters change systematically during the study.

• Suspect TE model oversimplified and incomplete.

Page 42: True and Error Models of Response Variation in Judgment and Decision Tasks Michael H. Birnbaum

Available at my Website

• Tests of LS models (JMP 2010)• Psych Review Comment RP Model (2011)• Reanalysis of Regenwetter et al data

testing iid, in JDM.• Birnbaum & Bahra: Testing transitivity in

Linked Designs (submitted)• Birnbaum & Bahra: Testing SD and RBI

(submitted)• Soon: Birnbaum & Schmidt Allais

paradoxes with 4 errors model