View
214
Download
0
Embed Size (px)
Citation preview
1
CP-AI-OR-02
Gomes & Shmoys
The Promise of LP to Boost CSP Techniques for Combinatorial Problems
The Promise of LP to Boost CSP Techniques for Combinatorial Problems
Carla P. [email protected]
David [email protected]
Department of Computer Science
School of Operations Research and Industrial Engineering
Cornell University
CP-AI-OR 2002
2
CP-AI-OR-02
Gomes & Shmoys
MotivationMotivation
Increasing interest in combining Constraint Satisfaction Problem (CSP) formulations and Linear Programming (LP) based techniques for solving hard
computational problems.
Successful results for solving problems that are a mixture of linear constraints – where LP excels – and combinatorial constraints – where CSP excels.
However, surprisingly difficult to successfully integrate LP and CSP based techniques in a
purely combinatorial setting.
Example: Satisfiability
3
CP-AI-OR-02
Gomes & Shmoys
Power of RandomizationPower of Randomization
Randomization is magic ---
we have some intuitions why it works.
5
CP-AI-OR-02
Gomes & Shmoys
Outline of TalkOutline of Talk
• A purely combinatorial problem domain• Problem formulations
• CSP formulation• LP formulations
– Assignment formulation– Packing Formulation
• Randomization• Heavy-tailed behavior in combinatorial search• Approximation Algorithms for QCP
• A Hybrid Complete CSP/LP Randomized Rounding Backtrack Search Approach
• Empirical Results• Conclusions
6
CP-AI-OR-02
Gomes & Shmoys
A purely combinatorial problem domain
7
CP-AI-OR-02
Gomes & Shmoys
Quasigroups or Latin Squares:An Abstraction for Real World Applications
Quasigroups or Latin Squares:An Abstraction for Real World Applications
Gomes and Selman 97
Quasigroup or Latin Square
(Order 4)
A Quasigroup or Latin Square is an n-by-n matrix such that each row and
column is a permutation of the same n colors
68% holes
The Quasigroup or Latin Square Completion Problem (QCP):
8
CP-AI-OR-02
Gomes & Shmoys
ComplexityComplexity
Better characterization
beyond worst case?
Critically constrained
area
42% 50%20%
Complexity of Latin Square Completion
EASY AREA EASY AREA
35% 42% 50%
Time: 150 1820 165
QCP is NP-Complete
9
CP-AI-OR-02
Gomes & Shmoys
Problem Formulations
10
CP-AI-OR-02
Gomes & Shmoys
QCP as a CSPQCP as a CSP
• Variables -
• Constraints -
}...,,2,1{, njix
....,,2,1,;,, njijicellofcolorjix
....,,2,1);,,...,2,
,1,
( ninixix
ixalldiff
....,,2,1);,,...,,2
,,1
( njjnxjx
jxalldiff
)2(nO
)(nO
row
column
kijPLStsjikjix ..,,
11
CP-AI-OR-02
Gomes & Shmoys
Pure CSP approaches solve QCP instances up
to order 33 relatively well.
Higher orders (e.g.,critically constrained area)
are beyond the reach of CSP solvers.
12
CP-AI-OR-02
Gomes & Shmoys
LP Formulations
13
CP-AI-OR-02
Gomes & Shmoys
Assignment FormulationAssignment Formulation
Cubic representation of QCP
Columns
Rows
Colors
14
CP-AI-OR-02
Gomes & Shmoys
QCPAssignment Formulation
QCPAssignment Formulation
}1,0{ijkx
....,,2,1,,1,
nkjii ijkx
kj
....,,2,1,,1,, nkjik ijkxji
....,,2,1,,1,
nkjij ijkx
ki
Row/color line
Column/color line
Row/column line
kijPLStskjikji
x ..,,
1,,
nijkxnn
j ki 1 11
max
....,,2,1,,;, nkjikcolorhasjicellijkx
Max number of colored cells
15
CP-AI-OR-02
Gomes & Shmoys
Packing formulationPacking formulation
Max number of colored cells in the selected patterns
s.t. one pattern per family
a cell is covered at most by one pattern
Families of patterns
(partial patterns are not shown)
17
CP-AI-OR-02
Gomes & Shmoys
QCPPacking Formulation
QCPPacking Formulation
MkMky
,},1,0{
,
1,
kMM Mk
yk one pattern per color
at most one pattern covering each cell
kMM MkyMn
k ,||1
max
1),:( ,1
,
Mji
kMM Mk
yn
kji
Max number of colored cells
18
CP-AI-OR-02
Gomes & Shmoys
Any feasible solution to the packing LP relaxation is
also a solution to the assignment LP relaxation
The value of the assignment relaxation is at least the bound implied by the packing formulation => the packing formulation provides a tighter upper bound than the assignment formulation
Limitation – size of formulation is exponential in n. (one may apply column generation techniques)
k
MM Mky
ijkx
,
19
CP-AI-OR-02
Gomes & Shmoys
Randomization
20
CP-AI-OR-02
Gomes & Shmoys
BackgroundBackground
Stochastic strategies have been very successful in the area of local search.
Simulated annealingGenetic algorithmsTabu SearchWalksat and variants.
Limitation: inherent incomplete nature of local search methods.
21
CP-AI-OR-02
Gomes & Shmoys
Randomized variable and/or value selection – lots of different ways.
Example: randomly breaking ties in variable and/or value selection.
Compare with standard lexicographic tie-breaking.
Note: No problem maintaining the completeness of the algorithm!
Randomized backtrack searchRandomized backtrack search
22
CP-AI-OR-02
Gomes & Shmoys
Sa
mp
le m
ea
nErratic Behavior of Mean
Number runs
Empirical Evidence of Heavy-TailsEmpirical Evidence of Heavy-Tails
(*) no solution found - reached cutoff: 2000Time: (*)3011 (*)7
Easy instance – 15 % preassigned cells
Gomes et al. 97
500
2000
3500
Median = 1!
23
CP-AI-OR-02
Gomes & Shmoys
Decay of DistributionsDecay of Distributions
Standard
Exponential Decay
e.g. Normal:
Heavy-Tailed
Power Law Decay
e.g. Pareto-Levy:
0,]Pr[ 2
CsomeforxCexX
Pr[ ] ,X x Cx x 0
Power Law Decay
Standard Distribution(finite mean & variance)
Exponential Decay
Infinite variance, infinite mean
24
CP-AI-OR-02
Gomes & Shmoys
Exploiting Heavy-Tailed BehaviorExploiting Heavy-Tailed Behavior
Heavy Tailed behavior has been observed in several domains: QCP, Graph Coloring, Planning, Scheduling, Circuit synthesis, Decoding, etc.
Consequence for algorithm design:
Use restarts or parallel / interleaved runs to
exploit the extreme variance performance.
Restarts eliminate heavy-tailed behavior
70%unsolved
1-F
(x)
Un
solv
ed f
ract
ion
Number backtracks (log)
250 (62 restarts)
0.001%unsolved
25
CP-AI-OR-02
Gomes & Shmoys
Randomized backtrack search – • active research area -> very effective when combined with
no-good learning!• solved open problems• different variants of randomization/restarts, e.g., biased
probability function for variable/value selection, “jumping” to different points in the search tree
State-of-the-art Sat Solvers incorporate randomized restarts:Chaff RelsatGrasp Goldberg’s SolverQuest SatZ, SATO, …
used to verify 1/7 of a Alpha chip (Pentium IV)
26
CP-AI-OR-02
Gomes & Shmoys
Randomized Rounding
27
CP-AI-OR-02
Gomes & Shmoys
Randomized RoundingRandomized Rounding
Randomized Rounding
Solve a relaxation of combinatorial problem;
Use randomization to go from the relaxed version to the original problem;
28
CP-AI-OR-02
Gomes & Shmoys
Randomized Rounding of a 0-1 Integer Programming
Randomized Rounding of a 0-1 Integer Programming
Solve the LP relaxation;
Interpret the resulting fractional solution as providing the probability distribution over which to set the variables to 1.
Note: The resulting solution is not guaranteed to be feasible. Nevertheless, good intuition of why randomized rounding is a powerful tool.
29
CP-AI-OR-02
Gomes & Shmoys
LP Based Approximations
30
CP-AI-OR-02
Gomes & Shmoys
Approximation AlgorithmApproximation Algorithm
Assumption: Maximization problem
the value of the objective function delivered by algorithm A for input instance I.
the optimal value of the objective function for input instance I.
The performance ratio of an algorithm A is the infimum (supremum, for min) over all I of the ratio
A is an - approximation algorithm if it has performance ratio at least (at most, for min)
)(IAV
)(*
)(IV
IAV
)(*IV
31
CP-AI-OR-02
Gomes & Shmoys
Approximation AlgorithmApproximation Algorithm
For randomized algorithms we replace by
in the definition of performance ratio.
(expectation is taken over the random choices performed by the algorithm).
Note: the only randomness in the performance guarantee stems from the randomization of the algorithm itself, and not due to any probabilistic assumptions on the instance.
In general, the term approximation algorithm will denote a polynomial-time algorithm.
)(IAV
)]([ IAVE
33
CP-AI-OR-02
Gomes & Shmoys
Approximations Based on Assignment Formulation
Approximations Based on Assignment Formulation
Kumar et. al 99
Algorithm1 - at each iteration, the algorithm
solves the LP relaxation and sets to 1 the variable
closest to 1. This is an 1/3 approximation algorithm.
Algorithm 2 – at each iteration, the algorithm selects a compatible matching for a color, for which the LP relaxation places the greatest total weight.
This is an 1/2 approximation algorithm.
Experimental evaluation -> problems up to order 9.
35
CP-AI-OR-02
Gomes & Shmoys
ApproximationBased on Packing Formulation
ApproximationBased on Packing Formulation
Randomization scheme:for each color K choose a pattern with probability (so that some matching is selected for each color)
As a result we have a pattern per color.
Problem: some patterns may overlap, even though in expectation, the constraints imply that the number of matchings in which a cell is involved is 1.
*,Mk
y
36
CP-AI-OR-02
Gomes & Shmoys
Packing formulationPacking formulation
0.8
0.2
1
1 1
Max number of colored cells in the selected patterns
s.t. one pattern per family
a cell is covered at most by one pattern
37
CP-AI-OR-02
Gomes & Shmoys
(1-1/e)- ApproximationBased on Packing Formulation
(1-1/e)- ApproximationBased on Packing Formulation
Let’s assume that the PLS is completable
Z*=h
What is the expected number of cells uncolored by our randomized procedure due to overlapping conflicts?
From we can compute
So, the desired probability corresponds to the probability of a cell not be colored with any color, i.e.:
*,Mk
y
kMM
Mky
ijkx
*
,*
)*1()*2
1)(*1
1()1
*1( ijnxijx
ijx
n
k ijkx
38
CP-AI-OR-02
Gomes & Shmoys
(1-1/e)- ApproximationBased on Packing Formulation
(1-1/e)- ApproximationBased on Packing Formulation
This expression is maximized when all the are equal therefore:
So the expected number of uncolored cells is at most at least holes are expected to be filled by this technique.
*ijkx
)*1()*2
1)(*1
1()1
*1( ijnxijx
ijx
n
k ijkx
enn
n
k ijkx 1)11()
1*1(
eh he)11(
39
CP-AI-OR-02
Gomes & Shmoys
Putting all the pieces together
40
CP-AI-OR-02
Gomes & Shmoys
CSP ModelLP Model + LP Randomized RoundingHeavy-tailsWe want to maintain completeness
How do we put all the pieces together?
A HYBRID COMPLETE CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
41
CP-AI-OR-02
Gomes & Shmoys
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
Central features of algorithm:
• Complete Backtrack search algorithm
• It maintains two formulations
• CSP model
• Relaxed LP model
• LP Randomized rounding for setting values at the top of the tree
• CSP + LP inference
42
CP-AI-OR-02
Gomes & Shmoys
Variable setting
controlled by LP Randomized
RoundingCSP & LP Inference
Search & Inferencecontrolled by CSP
%LP
Interleave-LP
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
•Populate CSP Model• Perform propagation
•Populate LP solver•Solve LP
Adaptive CUTOFF
43
CP-AI-OR-02
Gomes & Shmoys
1. Initialize CSP model and perform propagation of constraints (Ilog Solver);
2. Solve LP model (Ilog Cplex Barrier)LP provides good heuristic guidance and pruning information for the search. However solving the LP is relatively expensive.
3. Two parameters control the LP effort%LP – this parameter controls the percentage of variables set based on the LP rounding (%LP=0 pure CSP strategy)Interleave-LP – sets the frequency in which we re-solve the LP.
4. Randomized rounding scheme: rank variables according to the LP value. Select the highest ranked variable and set its value to 1 with probability p given by its LP value. With probability (1-p), randomly select a color form the colors allowed in the CSP model.
5. Perform propagation CSP propagation after each variable setting. (A total of Interleave-LP variables is assigned this way without resolving the LP)
6. Use a cutoff value to restart the sercah (keep increasing it to maintain completeness)
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
HYBRID CSP/LP RANDOMIZED ROUNDING BACKTRACK SEARCH
44
CP-AI-OR-02
Gomes & Shmoys
Empirical Results
45
CP-AI-OR-02
Gomes & Shmoys
Time PerformanceTime Performance
46
CP-AI-OR-02
Gomes & Shmoys
Performance in BacktracksPerformance in Backtracks
47
CP-AI-OR-02
Gomes & Shmoys
PerformancePerformance
With the hybrid strategy we also solve instances of order 40 in critically constrained area – out of reach for pure CSP;
We even solved a few balanced instances of order 50 in the critically constrained order!
• more systematic experimentation is required to better understand limitations and strengths of approach.
48
CP-AI-OR-02
Gomes & Shmoys
Conclusions
49
CP-AI-OR-02
Gomes & Shmoys
ConclusionsConclusions
Approximations based on LP randomized rounding (variable/value setting) + CSP propagation --- very powerful.
Combating heavy-tails of backtrack search through randomization --- very effective.
Consequence: New ways of designing algorithms - aim for
strategies which have highly asymmetric distributions that can be exploited using restarts, portfolios of algorithms, and interleaved/parallel runs.
General approach holds promise for a range of combinatorial problems
Final TAKE HOME MESSAGE
Randomization does not incomplete search !!!
50
CP-AI-OR-02
Gomes & Shmoys
www.cs.cornell.edu/gomeswww.orie.cornell.edu/~shmoys
Check also:
www.cis.cornell.edu/iisi
www.cs.cornell.edu/gomeswww.orie.cornell.edu/~shmoys
Check also:
www.cis.cornell.edu/iisi
Demos, papers, etc.
51
CP-AI-OR-02
Gomes & Shmoys
Eighth International Conference on the
Principles and Practice of
Constraint Programming
September 7-13
Cornell, Ithaca NY
CP 2002