Decomposition of Multi-Player Gamesdengji-zhao.net/publications/AI/2009/AI09_Slides.pdf ·...

Motivation Preliminaries Subgame Detection Solving Decomposable Games Experimental Results and Conclusion

Decomposition of Multi-Player Games

Dengji Zhao1 Stephan Schiffel2 Michael Thielscher2

1Intelligent Systems LaboratoryUniversity of Western Sydney, Australia

2Department of Computer ScienceDresden University of Technology, Germany

AI’09

Outline

1 MotivationThe Problem We StudiedGeneral Idea We Used

2 PreliminariesGame Description Language (GDL)

3 Subgame DetectionBasic Idea

4 Solving Decomposable GamesMotivationSubgame SearchGlobal Game Search

5 Experimental Results and ConclusionExperimental ResultsConclusion

Outline

1 MotivationThe Problem We StudiedGeneral Idea We Used

2 Preliminaries

3 Subgame Detection

4 Solving Decomposable Games

5 Experimental Results and Conclusion

The Problem We Studied

Deep Blue beats World Champion (1997)

Checker

Can Deep Blue play checkers?

General Game Playing

A General Game Player is a system thatable to accept a formal description of arbitrary gamesable to use such descriptions to play the games effectively

General Game Playing

A General Game Player is a system thatable to accept a formal description of arbitrary gamesable to use such descriptions to play the games effectively

Constraints:Time constraints vs. Very large games

Observation:Games contain independent parts (subgames)

Games Contain Subgames

Double-TicTacToe:

Games Contain Subgames

Improve game search by using their subgames?

Double-TicTacToe:

General Idea We Used

Decomposition

Widely recognized in AI PlanningAdapted to General Game Playing

1 How to decompose?2 How to improve search with decomposition?

Previous WorkSingle Player Games:M. Günther, S. Schiffel, and M. Thielscher: Factoringgeneral games. GIGA, 2009

General Idea We Used

Decomposition

Widely recognized in AI PlanningAdapted to General Game Playing

1 How to decompose?2 How to improve search with decomposition?

Previous WorkSingle Player Games:M. Günther, S. Schiffel, and M. Thielscher: Factoringgeneral games. GIGA, 2009

Outline

1 Motivation

2 PreliminariesGame Description Language (GDL)

3 Subgame Detection

Game Description Language (GDL)

What is GDL

Variant of Datalog (Prolog)Purely axiomatic (no algebra and arithmetics)Expressive power

1 n-player (n ≥ 1)2 deterministic3 perfect information

How to describe games in DGL

Player: role(xplayer).State: set of terms (fluents), {cell(1,1,b),...}

initial state: init(cell(1,1,b)).

Action: legal(Player,Action)⇐ true(cell(1,1,b)),...Transition: next(F)⇐ does(...),true(...),...Termination: terminal⇐ line(x).Goal/Payoff: goal(xplayer, 100)⇐ line(x).

Keywords: role, init, legal, does, next, terminal, goal, and true.

Game Nim in GDL

( role player1 )( role player2 )

( i n i t ( heap a 1 ) )( i n i t ( heap b 2 ) )( i n i t ( c o n t r o l p layer1 ) )

(<= ( legal ?p ( reduce ?x ?n ) )( true ( c o n t r o l ?p ) )( true ( heap ?x ?m) )( sma l le r ?n ?m) )

(<= ( next ( heap ?x ?n ) )( does ?p ( reduce ?x ?n ) ) )

. . .(<= terminal

( true ( heap a 0 ) )( true ( heap b 0 ) ) )

(<= ( goal ?p 0)( true ( c o n t r o l ?p ) ) )

Outline

1 Motivation

2 Preliminaries

3 Subgame DetectionBasic Idea

Basic Idea

Definitions

Definition(Game). A game is a tuple G = (F , A, I, R) where

F is a set of fluents,A is a set of actions,I is the initial state, a set of ground instances of F ,R is a set of roles.

Basic Idea

Definitions

Definition(Subgame). A game G = (F , A, I, R) is a subgame ofG′ = (F ′, A′, I′, R′) iff F ⊆ F ′, A ⊆ A′, I ⊆ I′, R ⊆ R′, and F , A, Iand R are not empty.

Definition(Subgame Independence). Two subgamesGs = (Fs, As, Is, Rs) and Gs′ = (Fs′, As′, Is′, Rs′) of game Gare independent each other iff Fs ∩ Fs′ = � and As ∩ As′ = �.

Basic Idea

Dependency Relations between Fluents and Actions

Potential Precondition (e.g.)

(<= ( legal ?p ( reduce ?x ?n ) )( true ( heap ?x ?m) ) . . . )

(heap ?x ?m) is aprecondition of (reduce ?x?n)

Potential Positive Effect (e.g.)

(reduce ?x ?n) is a positiveeffect of (heap ?x ?m)

Potential Negative Effect

F is a negative effect of M if F is true now, and F might be falseafter execution of M

Basic Idea

Finding Independent Subgames

Subgames

connected components of fluents and actions

Basic Idea

Problemsubgames share fluent and action names

obvious subgames (heap a and heap b) are still connected

Basic Idea

Solutionargument instantiation

fluent: argument’s value does NOT change in the wholegamemove: refers to an instantiated argument of a fluent

Outline

1 Motivation

2 Preliminaries

3 Subgame Detection

4 Solving Decomposable GamesMotivationSubgame SearchGlobal Game Search

Motivation

Decomposition Search

Main Idea1 search subgames separately (Subgame Search)2 build global plans with subgame plans (Global Game

Search)

Problemgoal and terminal conditions are NOT defined forsubgames

Motivation

Decomposition Search

Main Idea1 search subgames separately (Subgame Search)2 build global plans with subgame plans (Global Game

Search)

Problemgoal and terminal conditions are NOT defined forsubgames

Subgame Search

Local Concept (goal and terminal rule decomposition)

Local Concept detection example (double-tictactoe):(<= (goal xplayer 100) (line1 x) (line2 x))

Local Concepts: line1(x), line2(x)

Subgame Search

Turn-Move Sequence

Turn-Move Sequencea sequence of pairs (Player,Action) (a path in subgame tree) +a set of evaluations of local concepts in the subgame

4 turn-move sequences for xplayer :[(x , mark(3, 1)) ◦ (o, mark(1, 1)), Eval1][(x , mark(3, 1)) ◦ (x , mark(1, 1)), Eval2][(x , mark(1, 1)) ◦ (x , mark(3, 1)), Eval3][(x , mark(1, 1)) ◦ (o, mark(3, 1)), Eval4]

Subgame Search

The Target of Subgame Search

Sequence Simplificationremove evaluation dominated sequences (paths)

a seq is evaluation dominated if there is another similarseq with better evaluationsa set of seqs S start with a move of player p is removed

if all s ∈ S are dominated by other seqs start with othermoves of p

[(x , mark(3, 1)) ◦ (o, mark(1, 1)), Eval1][(x , mark(3, 1)) ◦ (x , mark(1, 1)), Eval2]The following two are dominated by theabove:[(x , mark(1, 1)) ◦ (x , mark(3, 1)), Eval3][(x , mark(1, 1)) ◦ (o, mark(3, 1)), Eval4]

Global Game Search

Simple Example

Using normal search methods, for each state,choose legal moves from turn-move sequences returnedfrom subgame search

Outline

1 Motivation

2 Preliminaries

3 Subgame Detection

5 Experimental Results and ConclusionExperimental ResultsConclusion

Experimental Results

Testing Results Comparison

time cost(second): for finding the first optimal strategyDS: Decomposition Search; NS: Normal Search;SGS: Subgame Search; GGS: Global Game Search

Conclusion

Done and ToDo

What we have done:1 subgame detection2 decomposition search (incl. a special version for impartial

games)

Conclusion

Done and ToDo

What we have done:1 subgame detection2 decomposition search (incl. a special version for impartial

games)What can be improved:

1 more efficient subgame detection method2 apply pruning techniques in decomposition search3 use subgame plans more efficiently in global game search

Conclusion

Thank you for your attention!

Appendix

For Further Reading

For Further Reading I

John H. ConwayOn Numbers and Games.Academic Press, 1976.

Elwyn R. Berlekamp, John H. Conway, Richard K. GuyWinning Ways 2nd Edition.2001.

Martin MüllerDecomposition search: A combinatorial games approach togame tree search, with applications to solving Goendgames1999.

Appendix

For Further Reading

For Further Reading II

M. Günther, S. Schiffel and M. ThielscherDecomposition of Single Player Games2007.

Eric SchkufzaDecomposition of Games for Efficient Reasoning2008.

Appendix

For Further Reading

Time Complexity Comparison I

Impartial and Partial GamesAssume that a game G has n subgames, G1, G2, ..., Gn withV1, V2, ..., Vn states respectively,

normal search: O(V1 ∗ V2 ∗ ... ∗ Vn)

decomposition search: O(V1 + V2 + ... + Vn)

ExampleFor double-tictactoe, the number of states is about 18!(includingrevisited states), while the state for each subgame is about∏9

n=1(2n) which is∏9

n=1(2n − 1) times smaller than 18!

Appendix

For Further Reading

Time Complexity Comparison II

Parallel GamesAssume that a parallel game G has n subgames, G1, G2, ..., Gnwith V1, V2, ..., Vn states respectively,

normal search: O(V1 ∗ V2 ∗ ... ∗ Vn)

Serial GamesAssume that a serial game G has n subgames, for subgame ithere are Vi states and Ti terminal states

normal search:O(V1 + T1 ∗V2 + T1 ∗ T2 ∗V3 + ... + T1 ∗ T2 ∗ ... ∗ Tn−1 ∗Vn)

Decomposition of Multi-Player Gamesdengji-zhao.net/publications/AI/2009/AI09_Slides.pdf ·...

Documents

Calibration of self-decomposable Lévy models · 2018-11-18 · Calibration of self-decomposable L´evy models 3 2. The model 2.1. Self-decomposable L´evy processes A real valued

Translation-Consistent Subgroup Decomposable Inequality Indicesepu/acegd2015/papers/BhargavMaharaj.pdf · Translation-Consistent Subgroup Decomposable Inequality Indices Bhargav Maharaj12

Preliminaries - Archive

Empirical Analysis of ideal recombination on random decomposable problems

Subgame Perfection Revisited Sequential Equilibriumsubgame perfect if it specifies Nash equilibrium strategies in every subgame. 3 . Subgame Perfection Revisited Game 1: • Two NE:

Nearly Complete Graphs Decomposable into Large …matousek/cla/inducedmatch8.pdfNearly Complete Graphs Decomposable into Large Induced Matchings and their Applications Noga Alon Ankur

Thesis Preliminaries

DECOMPOSABLE ORDERED GROUPS - math.unl.edumbrittenham2/classwk/990s18/public/orderings/barriga...arXiv:1402.6520v1 [math.LO] 26 Feb 2014 DECOMPOSABLE ORDERED GROUPS ELIANA BARRIGA,

Lecture 6 - Subgame Perfection

3.2.2. Dynamic Games of complete information: Backward Induction and Subgame perfection

A Decomposable Algorithm for Contour Surface Display Generation

Uncertainty Aversion and Backward Inductionfm · subgame perfection and show that the backward induction outcome breaks down in the subgame-perfect equilibrium of the centipede game,

Backward Induction and Subgame Perfection. The ...Backward Induction and Subgame Perfection. The justiﬁcation of a “folk algorithm.” By Marek M. Kaminski# Abstract: I introduce

0 Preliminaries

Subgame Perfect Equilibrium - UCLA Econ · Subgame Perfect Equilibrium One-Shot Deviation Principle Continuity at In nity In general, one-shot deviation principle does not hold for

Subgame Perfect Equilibria in In nite Stage Games: an

Credibility and Subgame Perfect Equilibriuma subgame perfect equilibrium ]In a a subgame perfect equilibrium, best responses are played in every subgames 20 Credible Threats and Promises]The

Lecture 14: Subgame Perfect Nash Equilibrium · Subgame perfect Nash equilibrium The solution concept of subgame perfect Nash equilibrium is a reﬁnement of the solution concept

TODO: Decomposable and Responsive Power Models for … · 2013. 8. 24. · Decomposable and Responsive Power Models for Multicore Processors using Performance Counters Ramon Bertran∗†,

Game Theory 2: Extensive-Form Games and Subgame Perfectionhome.uchicago.edu/bdm/pepp/gt2_handout.pdf · 2016-07-28 · Game Theory 2: Extensive-Form Games and Subgame Perfection 1/26