Upload
maili
View
46
Download
2
Tags:
Embed Size (px)
DESCRIPTION
Characterizing distribution rules for cost sharing games. Thesis defense May 28, 2013. by Raga Gopalakrishnan Computing and Mathematical Sciences, Caltech. Distributed resource allocation problems. - PowerPoint PPT Presentation
Citation preview
Characterizing distribution rules for cost sharing games
Thesis defenseMay 28,
2013byRaga Gopalakrishnan
Computing and Mathematical Sciences, Caltech
Distributed resource allocation problems• Allocate scarce resources to distributed
agents, such that some objective function is optimized.
• Several examples throughout this talk…
Example: Network formation
S1
S2
D1
D2
61
6
1
1
6
1
Problems:• NP-hard, not scalable,
not reliable, …• Agents are people; not
everyone may be happy with the outcome.
Centralized optimization:• Build the optimal
network (cost 10).• Recover this cost
from S1 and S2.
[ Jackson 2003 ]
Example: Network formation
S1
S2
D1
D2
61
6
1
1
6
1
Distributed solution:• Let sources play a
noncooperative game by choosing the edges they want and pay for them.
• Q: How to share the cost of the common edge?
Outcome depends on
how this cost is shared!
[ Jackson 2003 ]
Example: Network formation game
S1
S2
D1
D2
61
3+3
1
1
6
1
S1
S2
D1
D2
61
5+1
1
1
6
1
optimal network is a Nash
equilibrium
sub-optimal Nash
equilibrium
Q: How to share the cost of the common edge?
Option 1: S1 pays 5 S2 pays 1
Option 2: S1 pays 3 S2 pays 3
[ Anshelevich et al. 2004 ]
Cost sharing games
Network formation
Facility location
Multi-project management
S1
S2
D1
D2
? ??
????
??
?
Common Feature:The distribution rules used
to share the global cost/revenue determine the
players’ local utility functions,
[ Anshelevich et al. 2004 ]
[ Vetta 2002 ]
Cost sharing games
Network formation
Facility location
Multi-project management
S1
S2
D1
D2
? ??
????
??
?
Common Feature:The distribution rules used
to share the global “welfare” determine the
players’ local utility functions,
and hence the outcome!
[ Anshelevich et al. 2004 ]
[ Vetta 2002 ]
Cost sharing games
Network formation
Facility location
Multi-project management
S1
S2
D1
D2
? ??
????
??
?
Goal:Design distribution
rules that result in a “desirable” outcome.
[ Anshelevich et al. 2004 ]
[ Vetta 2002 ]
Goal of the thesis:
Network formation
Facility locationMulti-projectmanagement
S1
S2
D1
D2
? ??
????
???
Characterize the space of all distribution rules
that result in a “desirable” outcomefor a broad class of cost sharing games.
[ G., Marden, Wierman. NetGCooP 2011. ][ G., Marden, Wierman. EC 2013. ] [ full version under submission to MOR. ]
Goal of the thesis:
Network formation
Facility locationMulti-projectmanagement
S1
S2
D1
D2
? ??
????
???
Characterize the space of all distribution rules
that result in a stable outcomefor a broad class of cost sharing games.
[ G., Marden, Wierman. NetGCooP 2011. ][ G., Marden, Wierman. EC 2013. ] [ full version under submission to MOR. ]
“equilibrium”+efficiency,
+tractability, …
Scenario: Heterogeneous multi-server system
lFCFS
scheduling
m1
m2
mm
?
Variations – distributed resource allocationScheduling games in service systems
1Focus: Designing dispatch policies[ G., Doroudi, Wierman. MAMA 2011. ]
Staffing games in service systems
2Focus: Investigating square-root staffing policy for large systems[ Working paper, in preparation for submission to OR. ]Joint work with Ward and Wierman.
staffing
?
Variations – distributed resource allocation
? ? ?
? ? ?
? ? ?
Scenario:Content distribution
network
Content caching and replicationFocus: Investigating for and computing pure Nash equilibria[ G., Kanoulas, Karuturi, Rangan, Rajaraman, Sundaram. LATIN 2012. ]
Variations – distributed resource allocation
Scenario: Distributed control
Sensor coverage
?
Wireless frequency selection
F1 F2 F3 F1 F2 F3
?
frequency
frequency
Wireless access point assignment
?
• No people involved! Objective function is not cost/revenue.
• pretend that agents are people, model them as players in a “cost” sharing game.
Game theoretic controlFocus: Designing the entire game[ G., Marden, Wierman. HotMetrics 2010. ]
Variations – distributed resource allocation
VM
VM
VM
VM
VM
VM
VM
VM?
??
?
?
Scenario:Bandwidth allocation in
datacenter networks
Bandwidth allocation in multi-tenant datacentersFocus: Designing a robust bandwidth allocation scheme[ working paper ]Joint work with Attar, Jeyakumar, Narayana, Prabhakar.
Goal of the thesis:
Network formation
Facility locationMulti-projectmanagement
S1
S2
D1
D2
? ??
????
???
Characterize the space of all distribution rules
that result in a stable outcomefor a broad class of cost sharing games.
[ G., Marden, Wierman. NetGCooP 2011. ][ G., Marden, Wierman. EC 2013. ] [ full version under submission to MOR. ]
Model
players (researchers)
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇 𝒓 }𝒓∈𝑹)
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model
players (researchers) resources (projects)
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇 𝒓 }𝒓∈𝑹)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model 𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇 𝒓 }𝒓∈𝑹)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝓐𝟒= {{𝒓 𝟏 } , {𝒓𝟑 ,𝒓𝟒 }}
𝒂𝒊∈𝓐𝒊⊆𝟐𝑹
𝒂𝟒=¿
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model 𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇 𝒓 }𝒓∈𝑹)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝑾 𝒓 :𝟐𝑵→ℝ
𝑾 𝒓𝟑( {𝟒 ,𝟔} )
Denote the set of all (distinct) local welfare functions:𝕎𝑮=¿𝒓∈𝑹 {𝑾 𝒓 }
Global welfare:
𝓦 (𝒂 )=∑𝒓∈𝑹
𝑾 𝒓 ({𝒂}𝒓 )
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model 𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇 𝒓 }𝒓∈𝑹)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝑾 𝒓𝟑( {𝟒 ,𝟔} )
𝒇 𝒓 :𝑵 ×𝟐𝑵→ℝ
𝒇𝒓𝟑 (𝟒 , {𝟒 ,𝟔 })
𝒇 𝒓 𝟑(𝟔 , {𝟒 ,𝟔} )
Scalability assumption:𝑾 𝒓 𝒊=𝑾 𝒓 𝒋
⟹ 𝒇 𝒓 𝒊= 𝒇 𝒓 𝒋
Denote by𝒇 𝒓 𝒇𝑾 𝒓
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model 𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , { 𝒇𝑾 }𝑾 ∈𝕎𝑮
)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝑾 𝒓𝟑( {𝟒 ,𝟔} )
𝒇𝑾 :𝑵 ×𝟐𝑵→ℝ
𝒇𝒓𝟑 (𝟒 , {𝟒 ,𝟔 })
𝒇 𝒓 𝟑(𝟔 , {𝟒 ,𝟔} )
Scalability assumption:𝑾 𝒓 𝒊=𝑾 𝒓 𝒋
⟹ 𝒇 𝒓 𝒊= 𝒇 𝒓 𝒋
Denote by𝒇 𝒓 𝒇𝑾 𝒓
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model 𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
𝑾 𝒓𝟑( {𝟒 ,𝟔} )
𝒇𝑾 :𝑵 ×𝟐𝑵→ℝ
𝒇𝒓𝟑 (𝟒 , {𝟒 ,𝟔 })
𝒇 𝒓 𝟑(𝟔 , {𝟒 ,𝟔} )
Scalability assumption:𝑾 𝒓 𝒊=𝑾 𝒓 𝒋
⟹ 𝒇 𝒓 𝒊= 𝒇 𝒓 𝒋
Denote by𝒇 𝒓 𝒇𝑾 𝒓
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Model𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒 𝟏
𝟒
𝟐𝟒
𝟔
𝟐
𝟑
𝟑
𝟔
𝟓An allocation𝒂= (𝒂𝟏 ,𝒂𝟐 ,⋯ ,𝒂𝒏 )
Players’ overall utility is the sum of their shares across all the resources
they choose𝓤𝒊 (𝒂)=∑𝒓∈𝒂 𝒊
𝒇 𝒓 (𝒊 , {𝒂}𝒓 )
Example: 𝒂𝟒={𝒓𝟑 ,𝒓 𝟒 }𝓤𝟒 (𝒂)= 𝒇 𝒓𝟑(𝟒 , {𝟒 ,𝟔 } )+ 𝒇 𝒓𝟒
(𝟒 , {𝟏 ,𝟐 ,𝟒 } )𝟒
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
Model𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒 𝟏
𝟒
𝟐𝟒
𝟔
𝟐
𝟑
𝟑
𝟔
𝟓An allocation𝒂= (𝒂𝟏 ,𝒂𝟐 ,⋯ ,𝒂𝒏 )
Players’ overall utility is the sum of their shares across all the resources
they choose𝓤𝒊 (𝒂)=∑𝒓∈𝒂 𝒊
𝒇 𝒓 (𝒊 , {𝒂}𝒓 )
Example: 𝒂𝟒={𝒓𝟑 ,𝒓 𝟒 }𝓤𝟒 (𝒂)= 𝒇 𝒓𝟑(𝟒 , {𝟒 ,𝟔 } )+ 𝒇 𝒓𝟒
(𝟒 , {𝟏 ,𝟐 ,𝟒 } )𝟒
Design!
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
Model
a broad model that includes:
• Multi-project management
• Network formation games
• Facility location games• Multicast games• Congestion games• Routing games• Coverage games• …
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
[ Anshelevich et al. 2004 ]
[ Vetta 2002 ]
[ Chekuri et al. 2007 ]
[ Rosenthal 1973 ]
[ Roughgarden and Tardos 2002 ]
[ Marden and Wierman 2008,2013 ]
𝟒𝟑
𝟏 𝟐
𝟓 𝟔 𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏 𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
Goal:
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Given the set of players
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏 𝒓𝟏
𝒓𝟑
𝒓𝟐
𝒓𝟒
Goal:
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Given the set of players and the local welfare functions,
𝑾 𝒓𝟑( ∙ ) 𝑾 𝒓 𝟒
( ∙ )
𝑾 𝒓𝟏( ∙ ) 𝑾 𝒓𝟐
( ∙ )
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
Goal:
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Given the set of players and the local welfare functions, design local distribution rules
𝒇𝑾 𝒓 𝟏 ( ∙ ,∙ )
𝑾 𝒓𝟑( ∙ ) 𝑾 𝒓 𝟒
( ∙ )
𝒇𝑾 𝒓 𝟐 ( ∙ ,∙ )
𝒇𝑾 𝒓 𝟑 ( ∙ ,∙ ) 𝒇𝑾 𝒓 𝟒 ( ∙ , ∙ )
𝑾 𝒓𝟏( ∙ ) 𝑾 𝒓𝟐
( ∙ )
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
Goal:
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Given the set of players and the local welfare functions, design local distribution rules
that result in a “desirable” outcome
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
Goal:
𝒇𝑾 𝒓 𝟏 ( ∙ ,∙ )
𝑾 𝒓𝟑( ∙ ) 𝑾 𝒓 𝟒
( ∙ )
𝒇𝑾 𝒓 𝟐 ( ∙ ,∙ )
𝒇𝑾 𝒓 𝟑 ( ∙ ,∙ ) 𝒇𝑾 𝒓 𝟒 ( ∙ , ∙ )
𝑾 𝒓𝟏( ∙ ) 𝑾 𝒓𝟐
( ∙ )
𝟒𝟑
𝟏 𝟐
𝟓 𝟔
Given the set of players and the local welfare functions, design local distribution rules
that result in a “desirable” outcomeregardless of the set of resources and players’
action sets.
𝓐𝟔
𝓐𝟓
𝓐𝟒
𝓐𝟑
𝓐𝟐
𝓐𝟏
Goal:
𝒇𝑾 𝒓 𝟏 ( ∙ ,∙ )
𝑾 𝒓𝟑( ∙ ) 𝑾 𝒓 𝟒
( ∙ )
𝒇𝑾 𝒓 𝟐 ( ∙ ,∙ )
𝒇𝑾 𝒓 𝟑 ( ∙ ,∙ ) 𝒇𝑾 𝒓 𝟒 ( ∙ , ∙ )
𝑾 𝒓𝟏( ∙ ) 𝑾 𝒓𝟐
( ∙ )
𝓖 (𝑵 ,𝕎 , 𝒇𝕎 )Class of all games
with set of players ,set of local welfare functions ,
corresponding local distribution rules
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
Given the set of players and the local welfare functions, design local distribution rules
that result in a “desirable” outcomeregardless of the set of resources and players’
action sets.
Goal:
Given any and ,design
such that all games in are “desirable”.
𝓖 (𝑵 ,𝕎 , 𝒇𝕎 )Class of all games
with set of players ,set of local welfare functions ,
corresponding local distribution rules
Goal:
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
“desirable” properties for a gameStatic:
• Stability• Equilibrium concepts
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time
computable
• Locality• Separable
• …
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• …
“desirable” properties for a gameStatic:
• Stability• Equilibrium concepts
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time
computable
• Locality• Separable
• …
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• …
“desirable” properties for a gameStatic:
• Stability• Equilibrium concepts
“desirable” properties for a gameStatic:
• Stability• pure Nash
equilibrium
• dominant strategy equilibrium
• mixed Nash equilibrium
• correlated equilibrium
• coarse-correlated equilibrium
• …
∀ 𝒊 𝒂𝒊𝑵𝑬∈𝐚𝐫𝐠𝐦𝐚𝐱
𝒂 𝒊∈𝓐𝒊
𝓤𝒊 (𝒂𝒊 ,𝒂−𝒊𝑵𝑬 )
An allocation/outcome that satisfies:
“desirable” properties for a gameStatic:
• Stability• pure Nash
equilibrium
• Efficiency
∀ 𝒊 𝒂𝒊𝑵𝑬∈𝐚𝐫𝐠𝐦𝐚𝐱
𝒂 𝒊∈𝓐𝒊
𝓤𝒊 (𝒂𝒊 ,𝒂−𝒊𝑵𝑬 )
An allocation/outcome that satisfies:
Price of Anarchy (PoA)Ratio of the optimum
welfare to the welfare of the worst Nash
equilibrium.
Price of Stability (PoS)Ratio of the optimum
welfare to the welfare of the best Nash equilibrium.
Goal: Given any and , design such that all games in are “desirable”.
“desirable” properties for a gameStatic:
• Stability• pure Nash
equilibrium
• Efficiency
∀ 𝒊 𝒂𝒊𝑵𝑬∈𝐚𝐫𝐠𝐦𝐚𝐱
𝒂 𝒊∈𝓐𝒊
𝓤𝒊 (𝒂𝒊 ,𝒂−𝒊𝑵𝑬 )
An allocation/outcome that satisfies:
Price of Anarchy (PoA)Ratio of the optimum
welfare to the welfare of the worst Nash
equilibrium.
Price of Stability (PoS)Ratio of the optimum
welfare to the welfare of the best Nash equilibrium.
Goal: Given any and , design such that all games in possess a pure Nash equilibrium.
Existing distribution rulesThe Shapley value [ Shapley 1953 ]
A player’s share of the welfare should depend on their
“average” marginal contribution
Intuition:• Imagine the players in arriving one at a time to the
resource, according to some order.• When player arrives, depending on when he
arrived, he sees some subset of players already present.
• Player can be thought of as contributing .• The Shapley value is his expected marginal
contribution over all possible orders, assuming each order is equally likely.
• Example: when all players are ‘identical’, .
𝒇 𝑺𝑽𝑾 (𝒊 ,𝑺 )= ∑
𝑻⊆𝑺¿𝒊 }¿ ¿¿ ¿¿
Properties of the Shapley valueStatic:
• Stability• pure Nash
equilibrium
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time
computable
• Locality• Separable
• …
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• …
Static:• Stability
• pure Nash equilibrium
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time
computable
• Locality• Separable
• …
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• …
Properties of the Shapley value
Static:• Stability
• pure Nash equilibrium
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time
computable
• Locality• Separable
• …
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• …
Properties of the Shapley value
“desirable” properties for a gameStatic:
• Stability• pure Nash
equilibrium
Dynamic:• Existence of
distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
Potential gameThere exists a
player-independent
function , called the potential
function, which encodes utility
differences due to unilateral
deviations by any player.
∀ 𝒊 ∀𝒂−𝒊 ∀𝒂𝒊 ,𝒂𝒊′ 𝓤𝒊 (𝒂𝒊 ,𝒂−𝒊 )−𝓤𝒊 (𝒂𝒊
′ ,𝒂−𝒊 )=𝚽 (𝒂𝒊 ,𝒂−𝒊 )−𝚽 (𝒂𝒊′ ,𝒂− 𝒊 )
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the Shapley value
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the Shapley value
+ Optimal in many specific settings, e.g., network coding, network formation. [ Marden and Effros 2009 ] [ Roughgarden 2009]
‒ Lower bound of 2 for submodular welfare functions. [ Marden and Wierman 2013 ]
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the Shapley value
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the Shapley value
‒ Requires computing exponentially many marginal contributions
‒ Intractable in general [ Matsui and Matsui 2000 ]
Existing distribution rulesExtensions of the Shapley value
• Weighted Shapley value : :• Parameterized by a vector of strictly positive player
weights • Expected marginal contribution according to a
probability distribution (that depends on ) with full support on all orders.
• Generalized weighted Shapley value : :• Parameterized by a weight system , where is a vector
of strictly positive player weights, and is an ordered partition of the set of players.
• Expected marginal contribution according to a probability distribution (that depends on ) with support only on those orders that respect : players in arrive before players in when .
• They lead to weighted/generalized weighted potential games. Their properties are similar to those of the Shapley value.
Existing distribution rulesThe marginal contribution
[ Wolpert and Tumer 1999 ]
A player’s share of the welfare is theirmarginal contribution to the welfare
• Similar extensions: weighted marginal contribution ( ) and generalized weighted marginal contribution ( ) can be defined.
𝒇 𝑴𝑪𝑾 (𝒊 ,𝑺 )=𝑾 (𝑺)−𝑾 (𝑺 ¿{𝒊¿})
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the marginal contribution
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the marginal contribution
+ Potential function is the same as the welfare function!
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability = 1
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the marginal contribution
+ Potential function is the same as the welfare function!
‒ Price of anarchy can be arbitrarily bad. [ Marden and Wierman 2013 ]
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability = 1
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the marginal contribution
‒ Not budget-balanced in general.
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability = 1
• Budget-balance
• Tractability• Poly-time computable
• Locality• Separable
• …
Properties of the marginal contribution
+ Requires computing just one marginal contribution!
Research question:
If so: how do they perform?If not: we can optimize over for efficiency!
Are there distribution rules besides the Shapley value and marginal contribution families that always guarantee a pure Nash equilibrium for all games in ?
Cost sharing games
Research question:
Our (surprising) answer:
NO, for any given local welfare functions .
Cost sharing games
Are there distribution rules besides the Shapley value and marginal contribution families that always guarantee a pure Nash equilibrium for all games in ?
This is a much bigger “NO” than last year’s!
The inspiration for our workTheorem (CRV,MW):There exists a for which any budget-balanced distribution rules that guarantee a pure Nash equilibrium for all games must be equivalent to generalized weighted Shapley values on .[Chen et al. 2010, Marden and Wierman 2013]
Our resultTheorem:For any , any distribution rules that guarantee a pure Nash equilibrium for all games must be equivalent to generalized weighted Shapley values on some “ground” welfare functions , corresponding to the actual welfares distributed.
Cost sharing games
𝑮=(𝑵 ,𝑹 , {𝓐𝒊 }𝒊∈𝑵 , {𝑾 𝒓 }𝒓∈𝑹 , 𝒇𝕎𝑮)
If budget-balance is desired, then .
The inspiration for our workTheorem (CRV,MW):There exists a local welfare function for which all games in possess a pure Nash equilibrium for a budget-balanced if and only if there exists a weight system for which .[Chen et al. 2010, Marden and Wierman 2013]
Our resultTheorem:For any , all games in possess a pure Nash equilibrium if and only if there exists a weight system , and a mapping that maps each local welfare function to a corresponding “ground” welfare function such that . Moreover,
Cost sharing games
first
actual welfaredistributed
The inspiration for our workTheorem (CRV,MW):There exists a local welfare function for which all games in possess a pure Nash equilibrium for a budget-balanced if and only if there exists a weight system for which .[Chen et al. 2010, Marden and Wierman 2013]
Our second resultTheorem:For any , any distribution rules that guarantee a pure Nash equilibrium for all games must be equivalent to generalized weighted marginal contributions on some “ground” welfare functions .
Cost sharing games
PropositionTheorem:For any two welfare functions and , and any weight system ,
Any welfare function can be represented as a linear combination of inclusion functions as follows:
where, for each , the inclusion function is defined as:
denotes the set of contributing coalitions of . denotes the magnitude of their contributions in .
Shapley value = marginal contribution
PropositionTheorem:For any two welfare functions and , and any weight system ,
Our second resultTheorem:For any , all games in possess a pure Nash equilibrium if and only if there exists a weight system , and a mapping that maps each local welfare function to a corresponding “ground” welfare function such that . Moreover,
,where denotes the bijection that maps to .
Cost sharing games
call this map as
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balance
[ NetGCooP 2011 ]
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balanceThe space of
desirable distribution rules:Weighted Shapley
values on , .
The space of desirable
distribution rules:Weighted Shapley
values on , .
[ NetGCooP 2011 ]
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balance
• Arbitrary welfare functions from at each resource
• Weak Scalability:
• Any
The space of desirable
distribution rules:Weighted Shapley
values on , .
The space of desirable
distribution rules:Weighted Shapley
values on , .
[ NetGCooP 2011 ]
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
The space of desirable distribution rules:
Generalized weighted Shapley values on , .
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balance
• Arbitrary welfare functions from at each resource
• Weak Scalability:
• Any • No technical
assumption
[ NetGCooP 2011 ]
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
The space of desirable distribution rules:
Generalized weighted Shapley values on , .
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balance
• Arbitrary welfare functions from at each resource
• Weak Scalability:
• Any • No technical
assumption• Not necessarily
budget-balanced[ NetGCooP 2011 ] [ EC 2013 ]
What’s different from last time?Then Now
The space of desirable
distribution rules:Weighted Shapley
values on , .
The space of desirable distribution rules:
Generalized weighted Shapley values on ,
+ equivalent characterization
• Single ‘base’ welfare function scaled at each resource
• Scalability:
• Any submodular • Technical
assumption to exclude zero weights
• Budget-balance
• Arbitrary welfare functions from at each resource
• Weak Scalability:
• Any • No technical
assumption• Not necessarily
budget-balanced[ EC 2013 ][ NetGCooP 2011 ]
Theorem (marginal contributions):For any , all games in possess a pure Nash equilibrium if and only if there exists a weight system , and a mapping that maps each local welfare function to a corresponding “ground” welfare function such that . Moreover,
,where denotes a special bijection that maps to .
Our characterizationsTheorem (Shapley values):For any , all games in possess a pure Nash equilibrium if and only if there exists a weight system , and a mapping that maps each local welfare function to a corresponding “ground” welfare function such that . Moreover,
Static:
• Stability• pure Nash
equilibrium
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
Consequences of our characterizationsDynamic:
• Existence of distributed learning rules that converge to one of the stable outcomes
• Existence of distributed learning rules that converge to the most efficient stable outcome
• Good convergence rates
• Simple, intuitive learning dynamics perform well, e.g., “best-response”
• Potential game• guarantees stability
(pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
Consequences of our characterizations
• Potential games are necessary to ensure pure Nash equilibria in our model of cost sharing games!
• Both and , where , result in the same (weighted) potential game, with the same potential function .
• This provides a (previously unknown) closed form expression for the potential function for the weighted Shapley value.
(Weighted)
(Weighted)• Potential game
• guarantees stability (pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
Consequences of our characterizations
• New structure on distribution rules for optimizing for PoA/PoS.
• Optimize over the weights and ground welfare functions .
• If budget-balance is required• Set = . Just optimize over .
• If budget-balance is not required• Further reduction in the
space of parameters is possible.
• Theorem: Weights don’t matter!• Just optimize over .
• Future work: Can we obtain a precise characterization of the budget-balance vs. efficiency tradeoff?
(Weighted)• Potential game
• guarantees stability (pure Nash equilibrium), good dynamic properties
• Efficiency• Price of anarchy• Price of stability
• Budget-balance
• Tractability• Poly-time computable
Consequences of our characterizations• A design through Shapley
values provides control over budget-balance, since is the actual welfare distributed.• But, Shapley values are
hard to compute.• A design through marginal
contributions is tractable, however, is not the actual welfare distributed.• To control budget-balance,
we need to start with desired to be distributed, and then use the exponential time transformation to get .
• Can be thought of as a “preprocessing” step.
Recall: Basis representation of as a linear combination of inclusion functions:
where
“contributing coalition”
“magnitude of
contribution”
Proof sketch of main resultRestrict to the case (single welfare function)
𝑾= ∑𝑻∈𝓣 𝒒𝑻𝑾
𝑻𝑾𝑻 (𝑺 )={𝟏 , 𝑻 ⊆𝑺
𝟎 , 𝐨𝐭𝐡𝐞𝐫𝐰𝐢𝐬𝐞
Reduction to characterizing budget-balanced distribution rules:• Notice that does not directly affect strategic
behavior (it only does so through ).• and are the same for any two • Pick so that is a budget-balanced rule for .
Proof technique: Establish a series of necessary conditions on
Proof sketch when Proof technique: Establish a series of necessary conditions on
is completely specified by
𝒇 𝑻≔ 𝒇 𝑮𝑾𝑺𝑽𝑻 [𝝎𝑻 ]
where and
What is requiredof
is not formed in
is formed in
Don’t allocate welfare to any
player
Allocate welfare only to players in ,
independent of others
𝝀𝒊𝑻={𝒇 𝑻 (𝒊 ,𝑻 ) , 𝒊∈𝑺𝟏
𝑻
arbitrary , 𝒊∈𝑺𝟐𝑻
Proof sketch (arbitrary )Proof technique: Establish a series of necessary conditions on
What is requiredof
no coalition from is formed in
Don’t allocate welfare to any
playerAllocate welfare only to players in
these formed coalitions,
independent of others
a coalition from is formed in
𝒇𝑾≔ ∑𝑻∈𝓣 𝒒𝑻 𝒇
𝑻
𝒇𝑾≔ ∑𝑻∈𝓣 𝒒𝑻 𝒇 𝑮𝑾𝑺𝑽
𝑻 [𝝎𝑻 ]
𝒇𝑾≔ ∑𝑻∈𝓣 𝒒𝑻 𝒇 𝑮𝑾𝑺𝑽
𝑻 [𝝎∗ ]
𝒇𝑾≔ 𝒇 𝑮𝑾𝑺𝑽𝑾 [𝝎∗]
Universal weight system ( equivalent to
all the )
Key proof techniquesBasis framework:• Crucial to the modular structure of the proof.• Separates the following two steps:
– Focus on basis distribution rules• Independent of welfare function
– Focus on consistency of these distribution rules• Depends on the welfare function• Separate into cases (different sign combinations of the basis
coefficients), and provide a counterexample template for each case.
– Allows easily extending the proof to the case of multiple welfare functions.
• Exposes fundamental equivalence of Shapley value and marginal contribution rules.
Inclusion-exclusion framework:• Crucial to show that distribution rules must be
representable in the same basis framework.• Helps build inductive counterexamples by singling out
any desired inclusion function (unanimity game) from an arbitrary welfare function.
Limitations of our characterizations• Our goal was to characterize “universal” cost
sharing designs – that guarantee pure Nash equilibrium in all games with arbitrary resources and action sets.
• However, if the class of games of interest have more structure on the action sets, our characterizations claim nothing!
• There may be distribution rules other than generalized weighted Shapley values that guarantee pure Nash equilibria.
• E.g., “Single-selection” coverage games. [ Marden and Wierman 2008 ]
• Future work: What structure on action sets causes the restriction to Shapley value designs?
Extensions• Recall that players’ utility functions are given by:
• In addition, suppose players also have (possibly heterogeneous) private valuations over the set of outcomes, so that:
• Suppose . Then, our model and results extend to this situation if the distribution rules cannot depend on these values.
• But, more efficient designs could be possible if the distribution rules can depend on these private values.• Additional desirable property: “incentive
compatibility”
• (Challenging!) Future work: Can we design efficient, incentive compatible cost sharing mechanisms for such noncooperative settings?
Characterizing distribution rules for cost sharing games
Thesis defenseMay 28,
2013byRaga Gopalakrishnan
Computing and Mathematical Sciences, Caltech