49
Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions Predicting Human Behavior In Games James Wright March 18, 2014 1

Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

  • Upload
    lamdan

  • View
    213

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Predicting Human Behavior In Games

James Wright

March 18, 2014

1

Page 2: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Behavioral Game Theory

• Many of game theory’s recommendations are verycounter-intuitive.

• Do people actually follow them?

• No. A large body of experiments demonstrates otherwise.

• Behavioral game theory: Aims to model actual humanbehavior in games.

2

Page 3: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Behavioral Game Theory

• Many of game theory’s recommendations are verycounter-intuitive.

• Do people actually follow them?

• No. A large body of experiments demonstrates otherwise.

• Behavioral game theory: Aims to model actual humanbehavior in games.

2

Page 4: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 2.• Higher player gets lower number, minus penalty of 2.

3

Page 5: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

..... 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 2.• Higher player gets lower number, minus penalty of 2.

3

Page 6: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 2.• Higher player gets lower number, minus penalty of 2.

3

Page 7: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 2.• Higher player gets lower number, minus penalty of 2.

• Give this game a try. Play any opponent only once.

3

Page 8: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 30.• Higher player gets lower number, minus penalty of 30.

• Now play a different opponent with a larger penalty.

3

Page 9: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100

.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 30.• Higher player gets lower number, minus penalty of 30.

• Traveler’s Dilemma has a unique Nash equilibrium.

3

Page 10: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100

.

99− 2 = 97

. 99 + 2 = 101

.

98 + 2 = 100

. 98− 2 = 96.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 30.• Higher player gets lower number, minus penalty of 30.

• Traveler’s Dilemma has a unique Nash equilibrium.

3

Page 11: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101

.

98 + 2 = 100

. 98− 2 = 96

.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 30.• Higher player gets lower number, minus penalty of 30.

• Traveler’s Dilemma has a unique Nash equilibrium.

3

Page 12: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Fun Game: Traveler’s Dilemma

..

2

.

3

.

4

.

96

.

97

.

98

.

99

.

100

....

. . .

....

. 100.

100

. 96 + 2 = 98.

96− 2 = 94

.

100

. 100.

99− 2 = 97

. 99 + 2 = 101.

98 + 2 = 100

. 98− 2 = 96

.

2

.2

• Two players pick a number (2-100) simultaneously.

• If they pick the same number, that is their payoff.

• If they pick different numbers:• Lower player gets lower number, plus bonus of 30.• Higher player gets lower number, minus penalty of 30.

• Traveler’s Dilemma has a unique Nash equilibrium.

3

Page 13: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Comparing Behavioral Models[Wright & Leyton-Brown 2010]

100

1010

1020

1030

1040

1050

1060

COMBO9 SW94 SW95 CGCB98 GH01 HSW01 CVH03 HS07 SH08 RPC09

Like

lihoo

d im

prov

emen

t ove

r uni

form

dis

tribu

tion

Dataset

LkPoisson-CH

QREQLk

• Many behavioral models have been proposed.• First study to compare prediction performance of several atonce.

• One model performed clearly better than the others.

4

Page 14: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Comparing Behavioral Models[Wright & Leyton-Brown 2010]

100

1010

1020

1030

1040

1050

1060

COMBO9 SW94 SW95 CGCB98 GH01 HSW01 CVH03 HS07 SH08 RPC09

Like

lihoo

dim

prov

emen

tove

runi

form

dist

ribut

ion

Dataset

LkPoisson-CH

QREQLk

• Many behavioral models have been proposed.• First study to compare prediction performance of several atonce.

• One model performed clearly better than the others.4

Page 15: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Two main ideas

..1 Quantal utility maximization instead of utility maximization.

..2 Iterative reasoning instead of equilibrium.

5

Page 16: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Two main ideas

..1 Quantal utility maximization instead of utility maximization.

..2 Iterative reasoning instead of equilibrium.

5

Page 17: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

0

1

A B C

??

1

A B C

6

Page 18: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

1

A B C

6

Page 19: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

6

Page 20: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

Level 1

0

1

A B C

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

6

Page 21: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

Level 1

$4.01 $4.00 $0.25

0

1

A B C

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

6

Page 22: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

Level 1

$4.01 $4.00 $0.25

0

1

A B C

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

6

Page 23: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Iterative reasoning

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

Level 2

$7.54 $3.25 $0.05

0

1

A B C

Level 1

$4.01 $4.00 $0.25

0

1

A B C

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

Level 1

$4.01 $4.00 $0.25

0

1

A B C

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

6

Page 24: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Level-0

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

• Level-0 agents’ actions influence the behavior of every otherlevel.

• Predictions of iterative models can change dramatically iflevel-0 predictions change.

• It is unlikely that anyone actually picks actions uniformly.• Not knowing expected value is different from knowing nothing.• Level-0 agents could use all sorts of heuristics.

• Can we do a better job of predicting level-0 actions?

7

Page 25: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Level-0

Level 0

$??? $???? $????

0

1

A B C

??$??? $???? $????

0

1

A B C

• Level-0 agents’ actions influence the behavior of every otherlevel.

• Predictions of iterative models can change dramatically iflevel-0 predictions change.

• It is unlikely that anyone actually picks actions uniformly.• Not knowing expected value is different from knowing nothing.• Level-0 agents could use all sorts of heuristics.

• Can we do a better job of predicting level-0 actions?7

Page 26: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Level-0 meta-model[Wright & Leyton-Brown, 2014 (submitted)]

• Define a “meta-model” that predicts a distribution of level-0actions.

• Based on features of the actions that don’t require beliefsabout the other agents’ actions.

• Use an existing iterative model (quantal cognitive hierarchy)on top of the improved level-0 prediction to make predictions.

8

Page 27: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Level-0 meta-model[Wright & Leyton-Brown, 2014 (submitted)]

• Define a “meta-model” that predicts a distribution of level-0actions.

• Based on features of the actions that don’t require beliefsabout the other agents’ actions.

• Use an existing iterative model (quantal cognitive hierarchy)on top of the improved level-0 prediction to make predictions.

8

Page 28: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Features

Five binary features:

..1 Minmin Unfairness

..2 Maxmax payoff (“Optimistic”)

..3 Maxmin payoff (“Pessimistic”)

..4 Minimax regret

..5 Efficiency (Total payoffs)

9

Page 29: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Linear model

For each action, compute weighted sum of informative features,plus a noise weight:

w0 +∑f ∈F

wf I (f )f (ai )

(An action is informative if it can distinguish at least one pair ofactions.)Predict each action w.p. proportional to its weighted sum.

10

Page 30: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Linear model

For each action, compute weighted sum of informative features,plus a noise weight:

w0 +∑f ∈F

wf I (f )f (ai )

(An action is informative if it can distinguish at least one pair ofactions.)

Predict each action w.p. proportional to its weighted sum.

10

Page 31: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Linear model

For each action, compute weighted sum of informative features,plus a noise weight:

w0 +∑f ∈F

wf I (f )f (ai )

(An action is informative if it can distinguish at least one pair ofactions.)Predict each action w.p. proportional to its weighted sum.

10

Page 32: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 33: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 34: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 35: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 36: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 37: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 38: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Example

A B CX 100, 20 10, 67 30, 40Y 40, 35 50, 49 90, 70Z 41, 21 42, 22 40, 23

• Minimax regret is not informative (all have max-regret 60)

• 50, 49 is the fairest outcome, so Y is minmin unfairness.

• Y and Z have min payoff 40 (vs. 10 for X )

• Y leads to the best total utility (90 + 70 = 160)

• X has the highest best-case utility (100)

Action X ’s weight: w0 + wmaxmax

Action Y ’s weight: w0 + wminmin + wtotal + wfairness

Action Z ’s weight: w0 + wminmin

11

Page 39: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Data & Parameters

Name Source Games n

SW94 [Stahl and Wilson, 1994] 10 4005SW95 [Stahl and Wilson, 1995] 12 576CGCB98 [Costa-Gomes et al., 1998] 18 15662GH01 [Goeree and Holt, 2001] 10 500CVH03 [Cooper and Van Huyck, 2003] 8 2992HSW01 [Haruvy et al., 2001] 15 869HS07 [Haruvy and Stahl, 2007] 20 2940SH08 [Stahl and Haruvy, 2008] 18 1288

Combo8 400 samples from each 111 3200

• Set parameters (weights, level frequencies, etc.) and evaluatedperformance using cross validation on combined dataset:

..1 Divide data into 10 equal-sized random folds

..2 At step t: Choose maximum-likelihood parameters for datasetminus fold t (training folds) and compute likelihood of fittedmodel on fold t (test folds).

• Report sum of likelihoods of test folds.

12

Page 40: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Performance results

100

105

1010

1015

1020

1025

1030

1035

1040

1045

QCHLk CH

Like

lihood im

pro

vem

ent

over

u.a

.r.

Uniform L0Ordered Binary

Weighted Linear

Three iterative models:

..1 Quantal Cognitive Hierarchy

..2 Level-k

..3 Cognitive Hierarchy

Three level-0 meta-models:

..1 Uniform L0

..2 Ordered Binary

..3 Weighted Linear13

Page 41: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Parameter analysis

• Maximum likelihood fits do not tell us how important oridentified each feature is.

• The models produce probabilistic predictions.

• So we can compute a posterior distribution over parameters:

Pr(. . . ,w0,wfairness,wmaxmax, . . . | D)

• Distribution tells us how important and/or identifiedparameters are.

14

Page 42: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Parameter analysis

• Maximum likelihood fits do not tell us how important oridentified each feature is.

• The models produce probabilistic predictions.

• So we can compute a posterior distribution over parameters:

Pr(. . . ,w0,wfairness,wmaxmax, . . . | D)

• Distribution tells us how important and/or identifiedparameters are.

14

Page 43: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Parameter analysis: Weights

0

0.2

0.4

0.6

0.8

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

cum

ulat

ive

prob

abilit

y

feature weight

fairnessmaxmaxmaxmin

regrettotal

• Fairness is by far the highest weighted feature.

• All the features seem reasonably well identified.

15

Page 44: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Parameter analysis: Levels

0

0.2

0.4

0.6

0.8

1

0 0.1 0.2 0.3 0.4 0.5 0.6

Cum

ula

tive p

robabili

ty

L0

Uniform L0Ordered veto

Weighted linear

0

0.2

0.4

0.6

0.8

1

0 0.05 0.1 0.15 0.2 0.25

Cum

ula

tive p

robabili

ty

L1

0

0.2

0.4

0.6

0.8

1

0 0.05 0.1 0.15 0.2 0.25

Cum

ula

tive p

robabili

ty

L2

• Weighted linear =⇒ lower variance estimates• ∼Half the population is level-0!

16

Page 45: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Conclusions

• Weighted linear meta-model for level-0 agents dramaticallyimproved the performance of all three iterative models.

• Almost erases the difference between the models themselves.

• Strong evidence for the existence of level-0 agents.• For any meta-model, including uniform!• Contrary to conventional wisdom.

17

Page 46: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Thanks!

• Weighted linear meta-model for level-0 agents dramaticallyimproved the performance of all three iterative models.

• Almost erases the difference between the models themselves.

• Strong evidence for the existence of level-0 agents.• For any meta-model, including uniform!• Contrary to conventional wisdom.

18

Page 47: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis ConclusionsBibliography

Cooper, D. and Van Huyck, J. (2003).Evidence on the equivalence of the strategic and extensiveform representation of games.JET, 110(2):290–308.

Costa-Gomes, M., Crawford, V., and Broseta, B. (1998).Cognition and behavior in normal-form games: anexperimental study.Discussion paper 98-22, UCSD.

Goeree, J. K. and Holt, C. A. (2001).Ten little treasures of game theory and ten intuitivecontradictions.AER, 91(5):1402–1422.

Haruvy, E. and Stahl, D. (2007).Equilibrium selection and bounded rationality in symmetricnormal-form games.JEBO, 62(1):98–119.

18

Page 48: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis Conclusions

Haruvy, E., Stahl, D., and Wilson, P. (2001).Modeling and testing for heterogeneity in observed strategicbehavior.Review of Economics and Statistics, 83(1):146–157.

Rogers, B. W., Palfrey, T. R., and Camerer, C. F. (2009).Heterogeneous quantal response equilibrium and cognitivehierarchies.JET, 144(4):1440–1467.

Stahl, D. and Haruvy, E. (2008).Level-n bounded rationality and dominated strategies innormal-form games.JEBO, 66(2):226–232.

Stahl, D. and Wilson, P. (1994).Experimental evidence on players’ models of other players.JEBO, 25(3):309–327.

Stahl, D. and Wilson, P. (1995).

18

Page 49: Predicting Human Behavior In Games - UBC …kevinlb/teaching/cs532l - 2013-14/Lectures... · Predicting Human Behavior In Games James Wright ... 3. Overview Comparing ... Level-0

Overview Comparing models Iterative models Meta-models Evaluation Parameter analysis ConclusionsOn players’ models of other players: Theory and experimentalevidence.GEB, 10(1):218–254.

18