Lecture 20 Zero Sum Games p2 14482
Announcements
Hw 6 due Friday
Exam 2 March 22 7pm
Exam 2 proTaeme
Readings
Vanderbei
Lecter GM 2.5
Finite choice 2 player Zero sum games
Payoff matrix
Nij amount Alice CRous wins if
Alice plays pure statesy iBob plays pure statesyj
Expected payoff for mixed strategies x y: x'My
pad myinxthly given fixed x
acy m xxT1hy given fixed y
Alice wants to minimize payoff, Bob wants to maximize
PCE Max poxis worst case optimal for Bob if
y2cg minacy
Good use 2 P to find I g given M
Lemme Last time
If Alice's mixed strategy is fixed
Bob's best response is a pure strategy
worst case payoff for Alice's x is min x^T M e_j
Alice wants to maximize the worst case
playoff over all possible mixedstrategies probablity distributions
max min x^T M e_j
Unfortunately this isn't a linear Program
12cg for maximizing a minimum
Introduce new variable U
maximize 2h is smaller than
each option
TRICK
MIXmin fi CD fund fmCx subj to
e pIfunctions
Max V toxx a
subject toa C optimum
If
U FmCxX C p
Our problem becomes
Tq.usubiect.to xelRUE xTµej for all i EEE Rm
Zxi I XiZ
Let I in IRN
Max U subj to
Em f i
XI 0
maxusubjtou.ITXTMEOT 0 13x J
earn
I E T
Similarly Bob wants to minimize daysminvsubj.to
Iv My20I I
Exercise: check these are dual
By strong duality PCI 2cg at
Definition: The value of the game is the optimal payoff
pcxT acy
A game with value 0 is called fair
Example: Rock-Paper-Scissors
R 0 43,43
n
awe 43
Lipford what is a basicmin V subj.to
I7
Yi 192 1 is alwaysYi 20 tight
yz.IO Need 2 more
Check I
Poker: Is bluffing useful?
ante: each player puts in 1
v chop
deal 1 card to each
bidding: A starts bidding, bet adds 1 to pot for each player, or pass
stops when:
bet followed by bet: best card wins
pass followed by pass: best card wins; bet followed by pass: player who bet wins
5 things can happen
A pass, B pass: 1 to highest card
Ap B b Ap H to B
Ia
Ab Bb 2 to Itc
PurestrategiesA can
Lpass if B bets passpass if B bets betbet
B can: pass no matter what, do what A did, do opposite of what A did, bet no matter what
A chooses which line of betting to use if have card 1, 2, or 3
A has 27 pure strategies
B has 64 pure strategies
Eliminate bad choices
If holding 3 don't pass; If holding 1 don't bet
Assume other player is smart
If A holds 2 and B bets
A knows B doesnt have 1 wait bet ETC
Shrink to 8 pure strategies for A, 4 for B
Solve LP: Yes bluffing is useful
Other kinds of games
Bimatrix games: Alice and Bob get own payoff matrix A, B
Zero sum: A = -B
Example: Prisoner's Dilemma
2 convicts, choice: rat or stay silent
both stay silent: Small punishment
both rot bothget med punone rats one silent rat goes free
silent gets big pun
MixedNasticiumwherepair I g each is best response
against each other
x tay mgxxTAy ffhafdu.ITXTAy mayxxTBy
E.g. rat is best strategy against rat
For a zero sum game the mixed Nash equilibrium is just pair of worst case optimal solutions
d
Office hours Grot recorded
Facts: payoff = x^T M y
Lamina For any mixed strategy for Alice
BC muinxTMejIe Bex minimum over pure strategies Ej for Bob
Proof using LP facts:
If we fix x, can use LP to find payoff
xTµ isJ myin Ethyl Egil Yi 30 a fixedvector
optimum is exactly payoff; know that some optimum will be at corner of polyhedron
corner of polyhedron
Ey c Rml Egil go 203
93 corners are bfs
which correspond to pure strategies
Ty