Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
University of Waterloo
Faculty of Engineering
Global Optimization in Slope Analysis
by Simulated Annealing
Rocscience, Inc.
Toronto, Ontario
Prepared by
Su, Xiao
ID 20251755
2A Chemical EngineeringSeptember 1st , 2008
4105 Credit Pointe Dr, Mississauga, Ontario L5M 3K7
September 1st, 2008 Dr. Thomas Duever Chair of Chemical Engineering University of Waterloo Waterloo, Ontario N2L 3G1 Dear Dr Duever,
I am submitting this document, entitled “Global Optimization in Slope Analysis by Simulated Annealing” as my 2A Work Report for Rocscience, Inc. My project at the company investigated the efficiency of simulated annealing methods for finding the minima (critical failure surfaces and accompanying factors of safety) of slope stability equations. The report proposes Adaptive Simulated Annealing (ASA) as a powerful global optimization method for finding the critical failure surface.
During my 2A work term, I worked under the supervision of Dr. Brent Corkum, software development manager at Rocscience. I implemented the simulated annealing algorithm as a new optimization method in SLIDE 5.0, a slope stability analysis program developed by Rocscience. I researched the efficiency of different two SA techniques, namely the very fast simulated annealing (VFSA) and the adaptive simulated annealing (ASA). The multi-dimensional simplex method was also implemented for comparative purposes. In addition to learning how to program, I had the opportunity to study the theoretical and practical aspects of non-convex optimization, especially the performance of stochastic methods when used on multi-modal functions. I also learned soil mechanics and slope stability principles.
This report was written entirely by me and has not received any previous academic credit at Rocscience, University of Waterloo or any other institution. I would like to thank Dr Brent Corkum for his excellent supervision and Dr Reginald Hammah for invaluable references and discussion. I also thank Dr Jim Hazzard, Damir Valiulin and Jeremy Smith for their programming tips and continued help during the program development. Finally, I would like to thank Dr John Curran, Dr Thamer Yacoub, Igor Pashutinski and Ms Lesley-Ann Foulds for helpful discussions. Sincerely, Xiao Su ID 20251755
Contents
Table of Contents ii
Table of Figures iv
List of Tables iv
Summary vi
1 Introduction 1
1.1 Slope Stability Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 The Method of Slices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 The Need for Global Optimization . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Simulated Annealing (SA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 Improvements on Simulated Annealing . . . . . . . . . . . . . . . . . . . . . 6
2 Methods 8
2.1 Very Fast Simulated Annealing (VFSA) . . . . . . . . . . . . . . . . . . . . . 9
2.2 Re-annealing and Adaptive Simulated Annealing (ASA) . . . . . . . . . . . . 10
2.3 Transformation of Variables - ASAc and ASAt . . . . . . . . . . . . . . . . . 11
2.4 Simplex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.5 Auto-Refine Search (ARS) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3 Results 18
3.1 Verification cases for SLIDE . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
3.2 Customer Cases for SLIDE . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
4 Discussion and Explanatory Examples 23
4.1 Method Comparison — Verification Cases . . . . . . . . . . . . . . . . . . . 23
4.2 Method Comparison — Customer Cases . . . . . . . . . . . . . . . . . . . . 25
4.3 Simplex, ARS and other Optimization . . . . . . . . . . . . . . . . . . . . . 30
4.4 Annealing — Practical Considerations . . . . . . . . . . . . . . . . . . . . . 31
4.5 Annealing — General Remarks . . . . . . . . . . . . . . . . . . . . . . . . . 32
ii
5 Conclusions 39
6 Recommendations 40
List of Symbols 41
References 43
iii
List of Figures
1 Flowchart for VFSA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2 Flowchart for ASA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3 Flowchart for Simplex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4 Grid-search performed for Verification Case #87 . . . . . . . . . . . . . . . . 24
5 Auto-Refine Search for Verification Case #87 . . . . . . . . . . . . . . . . . 24
6 Adaptive Simulated Annealing for Verification Case #87 . . . . . . . . . . . 25
7 Grid-Search for Customer Case #4 . . . . . . . . . . . . . . . . . . . . . . . 26
8 Auto-Refine Search for Customer Case #8 . . . . . . . . . . . . . . . . . . . 27
9 Adaptive Simulated Annealing for Customer Case #8 . . . . . . . . . . . . . 28
10 Grid-Search for Customer Case #15 . . . . . . . . . . . . . . . . . . . . . . . 28
11 Auto-Refine Search for Customer Case #15 . . . . . . . . . . . . . . . . . . 29
12 Adaptive Simulated Annealing for Customer Case #15 . . . . . . . . . . . . 29
13 ASAt for Customer Case #15 . . . . . . . . . . . . . . . . . . . . . . . . . . 30
14 Generation range based on Cauchy distribution for T = 10 . . . . . . . . . . 34
15 Generation range based on Cauchy distribution for T = 0.2 . . . . . . . . . . 34
16 Generation range based on Cauchy distribution for T = 0.01 . . . . . . . . . 35
17 Convergence of ASAc for Customer Case #4 . . . . . . . . . . . . . . . . . . 36
18 (a) A function landscape in which SA methods are inefficient; (b) a fractal
landscape in which SA methods have high performance (Sorkin, 1991) . . . . 37
19 Grid Search for Customer Case #17 . . . . . . . . . . . . . . . . . . . . . . . 37
20 Convergence of ASAc for Customer Case #17 . . . . . . . . . . . . . . . . . 38
List of Tables
1 Required variables for safety factor analysis . . . . . . . . . . . . . . . . . . 3
2 Safety factors found for customer cases. Labeled in red are the safety factors
not within 0.01 of the global minimum . . . . . . . . . . . . . . . . . . . . . 19
iv
3 Safety Factors found for Customer Cases. Global minimum found are labeled
in blue, whereas minima within 0.05 of global minimum are considered satis-
factory and are labeled in red. . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4 Running time for customer cases. The running time for example 41 was not
considered due to the large computational effort required for all methods to
convert Hoek-Brown material properties. . . . . . . . . . . . . . . . . . . . . 22
5 Material properties for Verification Case #87 . . . . . . . . . . . . . . . . . . 23
6 Support Properties for Verification Case #87 . . . . . . . . . . . . . . . . . . 23
v
Summary
This work investigates the efficiency of simulated annealing (SA) when applied to slope
stability analysis. The aim is to find the most critical surface along which failure of a soil
or rock mass is likely to occur. This is achieved by minimizing a cost function known as
the factor of safety F(v), for a position parameter v. Traditionally this surface is found
by a variety of methods ranging from trial and error (which relies heavily on the engineer’s
experience) to automated search routines (mostly techniques of local search).
Simulated annealing (SA) is a method of global optimization, and was first developed
for solving combinatorial systems such as the traveling salesman problem and chip placement.
The original SA has recently been applied to slope stability with varying degrees of success.
The current work investigates the performance of state-of-art SA methods such as the
very fast simulated annealing (VFSA) and adaptive simulated annealing (ASA) for circular
slip surface search. The methods were implemented in C++ and incorporated into the
SLIDE 5.032 software developed by Rocscience, Inc. These methods rely on a probabilistic
random walk, coupled with a scheme for decreasing the annealing schedules exponentially.
ASA was originally implemented with centre-point radius as control variables (ASAc). It
was then found that the use of three points on the slip surface greatly increases the efficiency
of the search - and this new version of the method is referred to as ASAt.
The multi-dimensional downhill simplex was also implemented for comparison, and all
methods are compared with the Auto-Refine Search in SLIDE. The performance of both
VFSA and ASA were comparable with ARS. However, when centre-radius parameters are
used the search is exceedingly slow. On the other hand, ASAt is able to locate the global
minimum for all verification cases and customer cases tested, within tolerable speed.
It is concluded that adaptive simulated annealing offers a robust alternative to existing
optimization methods. Major advantages of ASA include reliability and ease of implemen-
tation. The transformation of variables proved successful, and should be implemented for
all circular search algorithms. It is recommended for simulated annealing to be extended to
general non-circular surfaces, and for the local minima to be recorded for current circular
search.
vi
1 Introduction
The analysis of the stability of slopes is essential to modern society, as it examines the safety
of major human constructions such as dams, embankments and structures built along the
sides of slope. At the core of slope analysis lies the challenge of finding the geological surface
with the greatest instability or representing the most probable region in which the mass of
soil or rock will slide. The potential instability is estimated by the factor of safety F , and
critical failure is the surface with the lowest F .
The problem then becomes finding the surface with position vector v(x1, x2, x3, . . . , xn)
which minimizes the safety factor of safety function F . For examples, the parameters x1
to xn can be the centre and radius of a circular surface or the vertices of a non-circular
surface. The challenge lies in that the cost function is often highly multi-modal, and for
some slopes the function domain is strongly discontinuous. Most optimization methods,
especially gradient-based, tend to get trapped in local minima. Simulated annealing (SA),
on the other hand, is a global optimization method that allows up-hill as well as down-hill
moves, being capable of escaping the local minima.
In the current section, some basic slope stability concepts are briefly reviewed. The
original simulated annealing is then presented, and its major drawbacks are discussed, as
well as factors that led to the development of faster annealing algorithms.
1.1 Slope Stability Concepts
The factor of safety F is defined as the ratio of the total shear strength (S) available along
a slip surface to the shear stress (Sm) that is actually mobilized along the surface due to
actions of external loads:
F =S
Sm. (1)
Many methods have been developed to compute factor of safety, based on moment
and/or force equilibrium. Most of them belong to the well established class of limit equilib-
rium methods (LEM). Even though the LEMs do not consider stress-strain relationships, it
provides an accurate estimate of the factor of safety without the need for extensive knowledge
of initial conditions (Cheng et al., 2007a).
1
The LEMs generally result in a problem formulation that is a statically indeterminate.
In order to obtain a solution, different assumptions can be made on the distribution of inter-
nal forces. The quality of the solution is closely tied with the reality of these approximations.
Other techniques exist for computing factors of safety such as strength reduction methods
(SRM). These methods do not require assumptions on inter-slice shear force distributions
and employ finite element techniques (Abramson et al., 2002). However, the SRM have dis-
advantages which include long solution time and the need for detailed knowledge of boundary
conditions, the latter being unknown in many cases.
The flexibility and speed of LEMs lies in the method of slices, in which the earth mass
within the trial surface is divided into vertical slices and at each slice, force and moment
equilibrium resultants must be zero for static equilibrium (Abramson et al., 2002). The factor
of safety at each slice is taken as the factor of safety F of the slope. It results in a non-linear
equation with F as a root, and must be found through iteration. Often, the procedures
for solving the factor of safety equation do not converge (the solution “blows up”), due to
inappropriate force assumptions or negative friction values (Duncan and Wright, 2005).
1.2 The Method of Slices
The method of slices is used by most computer programs, as it can readily accommodate
complex slope geometries, variable soil conditions and the influence of external boundary
loads (Abramson et al., 2002). The method divides the slope geometry into n vertical slices,
creating 6n− 2 unknowns (Abramson et al., 2002), as summarized in Table 1.
A common assumption is that the normal force on the base of the slice acts at the
midpoint thus reducing the number of unknowns to 5n− 2. From moment, force and Mohr-
Coulomb relationships, only n − 2 assumptions are left to make the problem determinate
(Table 1). How these last degrees of freedom are handled differentiate the available methods
of analysis. The most popular methods are reviewed below (Abramson et al., 2002):
• Ordinary Method of Slices (OMS) — This method neglects all interslice forces, there-
fore relying on moment equilibrium alone. It is one of the simplest analysis procedures.
• Bishop’s Simplified Method — assumes that all interslice shear forces are zero, reducing
2
Table 1: Required variables for safety factor analysis
Equations Condition
n Moment Equilibrium for each Slice
2n Force equilibrium in two directions for each slice
n Mohr-Coulomb relationship between shear strength and normal effective stress
4n Total Number of equations
Unknowns Variable
1 Factor of safety
n Normal force at base of each slice
n Location of normal force
n Shear force at base of each slice
n− 1 Interslice force
n− 1 Inclination of interslice force
n− 1 Location of interslice force (line of thrust)
6n− 2 Total number of unknowns
the number of unknowns by n− 1. This leaves 4n− 1 degrees of freedom, leaving the
solution overdetermined as horizontal force equilibrium will not be satisfied for one
slice.
• Janbu’s Simplified Method — Janbu also assumes zero interslice shear forces. Simi-
lar to Bishop’s method, the solution is overdetermined and does not satisfy moment
equilibrium completely. Janbu presents a correction factor f0 to account for this inad-
equacy.
• Spencer’s Method — Spencer proposes a method that rigorously satisfies static equi-
librium by assuming that resultant interslice forces to have a constant, but unknown
inclination. It requires an iterative method for both the factor of safety and the in-
terslice force inclination between the slices, matching the required 4n equations. It is
considered a complete procedure (i.e. force and moment equilibrium are satisfied for
all slices).
3
1.3 The Need for Global Optimization
For all LEMs presented above, the critical failure surface is generally found by trial and error.
Many search methods were developed (Greco, 1996, Malkawi et al., 2001), but in most cases a
good initial guess is needed, and the quality of results depends on the engineers’ experience.
The factor of safety function is often highly multimodal, and it is certainly non-smooth,
being regarded as an N − p complete problem (Cheng, 2007). This multi-modality generally
results from multi-layering of the soil or mineral lenses.
Therefore, finding the critical slip surface pertains to the field of global optimization.
Well-known deterministic methods such as the gradient or Newton’s method inherently per-
form local optimization (Polyak, 1987). Further, Newton’s method does not even distinguish
minima (or maxima) from saddle points (Polyak, 1987).
A class of techniques that has been relatively successful in global search is that of the
stochastic methods. These methods are particularly useful for slope stability, as they are
generally robust and relatively insensitive to problem type.
A promising start-up is simulated annealing (SA), a powerful stochastic method for
global optimization.
1.4 Simulated Annealing (SA)
Simulated annealing (SA) was developed in the early 80s (Kirkpatrick et al., 1983) for com-
binatorial optimization. It differs from other search methods as it allows downhill as well
as uphill moves, based on the underlying similarity between the physico-chemical process of
annealing and the minimization of a function (Kirkpatrick, 1984). In annealing, a material
(i.e. metal) is heated until its molecules acquire sufficient mobility (a melted state). Then, by
decreasing the temperature slowly, the molecules undergo various configuration changes, al-
ways seeking for a lower energy state. If the decrease is sufficiently slow, a perfect crystalline
solid will form, and the system will be at its minimum energy state. If the temperature is
decreased fast, as in quenching, the molecules will collapse into an amorphous solid and have
poor physical properties (its energy state will be at a “local minimum”).
In statistical mechanics, the probability of the atom existing at an energy state can be
4
modeled by a Boltzmann distribution P (E) = exp (−E/T ). It is clear that the higher the
temperature the greater the chance of an atom of existing at a higher energy state. This
concept is implemented as a transition probability for the optimization through the famous
Metropolis criterion (Metropolis et al., 1953).
Similarly for SA, let E1 be the energy of the system at configuration 1. To model the
probability of the system changing to state 2 (at an Energy E2), the Metropolis criterion
consists in the acceptance/rejection probability P (dE) = exp (−dE/T ). Simulated annealing
uses this criterion as the acceptance probability for a random move - let the function F (x)
be defined. For the move to be accepted from x1 to x2, a random number r with uniform
distribution between (0, 1) is generated and compared with P (dF ) = exp (−dF/T ). If r <
P (dF ), then the move is accepted. If higher, the move is rejected. It can be seen that if
dF < 0 (F2 < F1), the move will always be accepted as the move is clearly downhill.
It can be seen that at high temperatures, more uphill moves will be accepted. It is the
goal of simulated annealing to start the temperature T high enough so the search method
can sample every valley of the parameter space. A “melting criterion” was initially proposed
as a 0.8 ratio between accepted and rejected moves (Kirkpatrick, 1984). Then, by reducing
the temperature the search becomes more selective and less up-hill moves are accepted, until
at T = 0 only down-hill moves are accepted.
The algorithm was soon extended to continuous functions (Corana et al., 1987). The
position vector becomes then analogous to the molecular arrangement of a metal, and the cost
function to the energy of the arrangement. The goal becomes to find the perfect arrangement
of atoms which will minimize the overall energy of a system, by performing random-walks
along the function domain.
In a more rigorous formulation, simulated annealing depends on three functions
(Van Laarhoven and Aarts, 1987, Yao, 1995):
1. A generation function GXY (Tk), which generates a state Y based on a state X at the
annealing temperature Tk.
2. An acceptance function AXY (Tk), which is the transition probability from one state to
another. This is determined by the Metropolis criterion.
5
3. An annealing schedule S(Tk), which determines the temperature reduction rule.
During the past two decades, intensive research has been invested in improving the
generation and annealing schedule, such that global convergence is guaranteed at a higher
speed.
1.5 Improvements on Simulated Annealing
The method soon became as popular as the well-known genetic algorithms (GA), being used
in various fields for image and signal processing, biology, geophysical inversion (Pei et al.,
2007) and finance (Van Laarhoven and Aarts, 1987). Similarities between the two methods
were soon drawn, due to the “natural” basis of the algorithms (Davis, 1987).
The original annealing has many issues, and is now considered in fact a simulated
quenching (SQ) algorithm as its convergence is not guaranteed for every case (Ingber, 1993).
These are mainly referred to the algorithms which use an exponentially decreasing schedule
without an adequate generation functon, such as:
T = rTk, (2)
which fails to comply with the necessary convergence conditions. This is not to say that
quenching methods are inefficient, in fact, for many specific problems in which the researchers
know the behavior of the system, simulated quenching has performances that are far superior
and should therefore be used.
Adaptive step-size for the random-walk have been proven to increase performance
(Corana et al., 1987). By increasing the step-size if the search is going well (and vice-
versa if badly), the ratio between accepted and rejected moves is kept close to 1:1. Even
though sufficiency conditions have not been proved for this algorithm, it was shown to be
efficient in finding global minima of very hard test functions, such as Rosenbrock valleys of
up to 1020 local minima.
The concept of using an adaptive step-size has been essential to the development of
better annealing algorithms. The Classical Simulated Annealing (CSA) (Geman and Geman,
1984) was the first annealing algorithm with a rigorous mathematical proof for its global
6
convergence. It was proven to converge if a Gaussian generation is used for GXY (TK),
coupled with an S(Tk) that decreases no faster than To/(log k). However, a logarithmic
decreasing schedule is considered to be too slow, and for many problems the number of
iterations required is considered as “overkill” (Ingber, 1993).
An exponentially faster variant known as fast simulated annealing (FSA) was proposed
(Szu and Hartley, 1987) which uses a Cauchy distribution as GXY (TK). It was heuristically
proven that if coupled with an annealing schedule T = T0/k, the method will converge
rapidly, due to its capabilities of performing longer jumps in the parameter space. Corana
et al. (1987), using a similar proof, proved the convergence of a schedule exponentially faster
than the fast annealing, namely T = T0/ exp (k + 1). The algorithm of VFSA by itself, even
though faster than classical SA, is still slow for many problems (Yao, 1995).
To address this, Ingber (1989) proposed a different generation temperature for each
dimension, and the re-annealing of temperature every couple of iterations. The combination
of the VFSA with a re-annealing is often referred to as adaptive simulated annealing (Chen
and Luk, 1999, Ingber, 1993). It has been shown successful in fields such as signal processing
(Chen and Luk, 1999) as well as wavelet estimation in geophysics (Velis and Ulrych, 1996).
The original simulated annealing, as proposed by Kirkpatrick (1984), was applied by
Cheng (2003) with a fair amount of success. However, in a recent comparison (Cheng et al.,
2007b), simulated annealing was found to require a large amount of iterations. Further,
parameter tuning was also found to be an issue. Due to the number of different slope
geometries, the optimization method is required to be relatively insensitive to change in
function characteristics.
The present study was designed primarily to assess the performance of state-of-art SA
methods when applied to circular search, and determine the most adequate parameters. The
methods of VFSA, ASA, Auto-refine search and simplex are compared based on speed and
reliability. It was noted that a transformation of variables from centre-point/radius to three
points on the slope greatly improves the efficiency of the search, and reasons are given based
on the theoretical framework of annealing.
7
2 Methods
The very fast simulated annealing (VFSA) and the adaptive simulated annealing (ASA)
were implemented in the SLIDE 5.032 software developed by Rocscience, Inc. SLIDE is a
2D slope stability program designed for rock or soil slopes and contains a variety of modeling
capabilities, including material shear strength, external loading, groundwater and support.
The software already employs several automated search methods, the most efficient one
being the Auto-Refine Search (ARS). The auto-refine search is a typical tailored algorithm,
specifically designed for circular surfaces, and is used as the main comparative method.
A standard down-hill search was also implemented for comparison purposes - the multi-
dimension downhill simplex method (Simplex). It is considered a strictly local search method
(Corana et al., 1987, Press et al., 2002), although it is fairly robust and often finds the global
minimum (Corana et al., 1987).
Both the auto-refine search and the simplex are deterministic methods, whereas sim-
ulated annealing is considered a stochastic method, therefore requiring the use of random-
number generators. For all purposes, pseudo-random number generation uses the CRandom-
Number class developed by Rocscience. The CRandomNumber class allows a variety of capabil-
ities and carries a variety of statistical distributions. Finally, CRandomNumber allows the use
of seeds for pseudo-random number generation, therefore making the optimization results
reproducible.
Finally, a crucial issue is the validity of the failure surfaces generated by the optimiza-
tion methods. In many cases, especially for circular search, invalid surfaces are frequently
generated due to a number of reasons (no slope intersection, convergence problems, etc).
Currently, the following error-handling was universally adopted: if an invalid surface is
generated, the optimal surface so far is retrieved and a new random-walk is performed using
this surface. If this procedure fails for a certain number of trials, the iteration breaks and
reports an error flag. This general scheme is used with few modifications for all the methods
described below.
Spencer’s method is used for all cases as it considers both moment and force equilib-
rium, and is considered the simplest of the complete equilibrium procedures (Duncan and
8
Wright, 2005). All methods were implemented with the assumption of a circular failure
surface. They were initially implemented with x, y (centre co-ordinates) and the radius r
being the control parameters.
2.1 Very Fast Simulated Annealing (VFSA)
VFSA was implemented in the Aslide.cpp file under the function void Do_Single
_Centers_VFSA(). The algorithm is derived from a standard Boltzmann simulated anneal-
ing with a more sophisticated generation function and an exponential annealing schedule. A
program flow chart is shown in Figure 1.
In the very fast simulated annealing, the generation function Y = G(X, Tk) is based on
the product of a D-dimensional Cauchy distribution. The generation is based on a random
walk for each dimension i, ruled by the following functional relationship:
Yi = Xi + ri(Li) (3)
In the former equation, L represents the characteristic length of the problem. In the original
formulation by (Ingber, 1989), Li = Ui − Bi in which Ui represents the upper bound of
the ith-dimension and Bi represents the lower bound. In the current algorithm L=diagonal
of slope geometry. If Yi is an invalid surface, it is simply discarded. The random variable
ri ∈ [0, 1] follows a Cauchy distribution, and is mapped from a uniform random variable
ui ∈ [0, 1] using the following function:
ri = sgn(ui − 0.5)Ti
[(1 +
1
Ti
)|2ui−1|− 1
](4)
The generation of the Cauchy variable and the random number is performed by the void
GetStepVFSA(). The walk is performed by the function void PerformWalkVFSA(). The
Cauchy-based generator presented above allows for different sensitivities at different temper-
atures. The simplified version of the VFSA presented here does not perform a re-annealing,
therefore Ti = T for 1 ≤ i ≤ n. This generation function allows for the sensitivity of each
dimension to the parameters; at high Ti the distribution is almost uniform, allowing for
long-range sampling. As Ti is decreased, the distribution becomes a Cauchy, and the further
9
the temperature is decreased the distribution becomes more and more centered around 0
(the lorentzian wings “fold”). For the very fast annealing a higher temperature has a similar
effect to a larger value of L. The behavior of the generation function is further investigated
in section 4.
The annealing schedule S(Tk) is performed by the function void Reduce_Temperature
_VFSA():
Ti = Tin exp (−cik1/n) (5)
The control parameter ci (in this case ci = c for all dimensions) is a user-defined
control parameter. The higher c is set, the faster the schedule is decreased. However,
without implementing the re-annealing, c should preferably be set lower (by lower we mean
between 1 and 3) to guarantee convergence.
Two stopping criterion found in literature were implemented: comparing the fopt[k]
with fopt[k − j] (Cheng et al., 2007b), where j is an integer between 0 and nepsilon. If the
difference fopt[k]− fopt[k− j] < ftol for all j, the program is stopped. In other words, if there
has not been any visible improvement for the global optimum in the previous n� consecutive
runs, the algorithm is to be stopped. An alternative that is to stop when the temperature
becomes sufficiently close to 0 — currently if the temperature T reaches 10−4 × Tin .
2.2 Re-annealing and Adaptive Simulated Annealing (ASA)
The convergence of VFSA is still slow for many applications (Ingber, 1993, Velis and Ulrych,
1996). The reason is that it assumes the same sensitivity for each dimension of the parameter,
the generation temperature used in equation 4 is identical to the acceptance function. In fact,
the parameter space can have completely different sensitivities for each dimension, therefore
it is suggested the use of different generation temperatures for each dimension Ti,gen and a
periodic re-scaling of these temperatures. By re-scaling, the temperature of the insensitive
dimensions is increased relative to the sensitive dimensions every Naccept iterations, and the
sampling range is “stretched out”. This method is often referred to as very fast simulated
re-annealing (VFSR) or adaptive simulated annealing (ASA) (Chen and Luk, 1999).
10
The sensitivity si of each parameter is given by the partial derivative of the function
with relation to the i-th dimension of the position vector:
si =∂F
∂Xi(6)
The temperature Ti,gen is linearly re-scaled proportional to the ratio of the maximum
sensitivity smax of the i-th dimensions:
Ti,gen =smaxsi
Ti,gen (7)
A flow-chart of the algorithm is presented below. At the initialization step, the accep-
tance temperature Taccept = Taccept(0) is first set to the initial value of the cost function, thus
scaling the temperature to the behavior of the cost function and the quality of the first guess.
Further, this adapts the Metropolis criterion to the problem in question, as the ratio dE/T
will be in the same order of magnitude. At every re-scaling, the initial temperature Taccept(0)
will be reset to the last accepted cost function and the acceptance temperature since the
re-scaling iteration will be reset to the best value of the function so far (fopt). The annealing
schedule is also re-scaled for the generation, by an inverse relation from equation 5.
ki = −1
clog
(Ti,gen(ki)
Ti,gen(0)
)n(8)
The acceptance counter is also re-scaled:
ka = −1
clog
(Taccept(ki)
Taccept(0)
)n(9)
This continually adapts S(Tk) to the behavior of the function. After every Ngen itera-
tions, the algorithm is annealed according to the VFSA rule (equation 5), and the generation
function is updated according to the following rule. Adaptive simulated annealing was im-
plemented in a class CASA by inheritance, and can be found under (“XiaoASA.h”). The
base class CVFSA contains the functions implemented in the VFSA.
2.3 Transformation of Variables - ASAc and ASAt
The VFSA and ASA were originally implemented with the control parameter v(X1, X2, X3),
in which X1 = x coordinate of the centre, X2 = y coordinate and X3 = r the radius of the
11
circle. However, due to the theoretical framework of ASA, it was noticed that the original
formulation required improvements. The function domain is unbounded if centre and radius
are used, which goes against the assumptions set by Ingber (1989) for a valid Ui and Li.
Therefore, a transformation of variables is suggested. Instead of the traditional centre
point and radius schemes (Duncan and Wright, 2005), it is well known that three non-colinear
points define a circle. Our control parameters become then two points on the slope surface,
and a third point determined by the tangent vector to the circle. Now, for v(X1, X2, X3),
X1 = pt1, X2 = pt2 and X2 = θ, where pt1 and pt2 are the points on the slope and θ is the
tangent angle that determines the third point. The third point forms a point tangent to the
circle, and by analytical geometry the centre and radius of the circle can be easily found.
With this transformation, the domain of the function becomes bounded. The upper
slope line is parameterized by t ranging from 0 to 1. A one-to-one mapping exists between
each point pt on the slope line and t (t↔pt). The angle θ is also bounded by the line
connecting pt1 and pt2, as well as the vertical line parallel to the y-axis. This Adaptive
Simulated Annealing with Three points (ASAt) has proven efficiency over the ASA with
centre and radius (refer to section 3). To differentiate between the methods, the original
Adaptive Simulated Annealing with Centre-radius parameters is referred to as ASAc. We
refer to the general method simply as ASA.
2.4 Simplex
The Simplex method was implemented into the Aslide.cpp file under the function void
Do_Single_Centers_Simplex(). The Simplex method was originally developed by (Nelder
and Mead, 1965) for estimating Hessian matrices. The version used in the current work was
based on the numerical recipes for C++ (Press et al., 2002), and a detailed description of
the algorithm can be found therein. A simplified flow chart is presented in Figure 3 (y[i]
represents the function value at point i).
The Simplex is known as a multi-dimensional downhill method (Press et al., 2002).
Its allows a geometric interpretation: an n-dimensional Simplex has n + 1 vertices. In two-
dimensions the simplex is a triangle and in three a tetrahedron. Each vertex is a point in the
function, and the algorithm sorts the vertices into high points and low points according to the
12
function value. The Simplex then moves by making reflections and contractions around its
vertices, tumbling downhill in an n-dimensional topography until it finds the local minimum.
When the simplex reaches a narrow valley, the method can contract itself and ooze through
the “eye of needle”, contracting itself in all directions (Press et al., 2002).
Advantages of the Simplex method include easy implementation, robustness, speed,
and the fact that it requires only function evaluations. The main features of the current
simplex method are:
1. Initialization: The Simplex method requires ndim+1 initial circles. The vertices of the
Simplex are stored in the matrix p[i][j] with 1 ≤ i ≤ 4 and 1 ≤ j
2.5 Auto-Refine Search (ARS)
The Auto-Refine Search is the main circular optimization method in SLIDE. It can be seen
as an iterative improvement method. An outline of the method is given below (Rocscience,
2006):
1. The slope is divided into a fixed number of divisions according to the user defined
Divisions Along Slope.
2. Circles are then generated between EACH PAIR OF DIVISIONS, according to the
value of the Circles Per Division.
3. The safety factors are then calculated for these circles, and the average safety factor
associated with EACH DIVISION along the slope, is recorded. This constitutes ONE
ITERATION of the Auto Refine Search method.
4. The value of Divisions To Use In Next Iteration, is then used, to determine how many
divisions along the slope, will be used to define a new, narrowed search area, in the
next iteration. Only the divisions with the lowest (average) safety factors are used, the
divisions with high safety factors are discarded from the analysis. For example, if 10
Divisions Along Slope are set with 50% divisions to use in the next iteration, then after
each iteration 5 divisions will be used. The divisions of the slope which are retained,
are then used to form a new slope poly-line. Using this new, narrowed slope surface,
steps 1 to 4 are then repeated, for the Number of Iterations specified.
14
Figure 1: Flowchart for VFSA
15
Figure 2: Flowchart for ASA
16
Figure 3: Flowchart for Simplex
17
3 Results
The optimization methods (ARS, Simplex, VFSA, ASAc and ASAt) were tested on twenty
SLIDE verification problems as well as twenty-three SLIDE customer cases. The verification
are actual engineering cases (Rocscience, 2006), which include weak layers, bolts, dams and
embankments. The cases related to circular search were selected, and the numbering from
the original files was preserved.
For the customer cases, the running time was recorded as a measure of speed. Indeed,
number of iterations is important for mathematical analysis as it is less sensitive to problem
characteristics (Sorkin, 1991). However, for practical purposes, running time (in seconds) is
paramount, as it includes all the operations performed by the engine including error-checks
and initialization of the algorithm. In some cases (as we will discuss later) a fair amount of
computational effort was wasted due to the generation of invalid surfaces.
Unless otherwise specified, the default tolerance of δ = 0.005 was used for the factor of
safety convergence.
3.1 Verification cases for SLIDE
The parameters of the optimization methods are presented below, and were kept constant
for all the verification cases:
VFSA: L = (1/8)Diagonal, Nt = 50, c = 2 and Tin = 2, with a stopping criterion of
Tstop = Tin10−5 or N� = 50.
ASAc: L =Diagonal, N� = 10, Naccept = 20, Ngen = 200, c = 2 and ftol = 0.0001.
ASAt: N� = 30, Naccept = 50, Ngen = 200 and c = 2, and ftol = 0.0001.
For Simplex, the original parameters from the Numerical Recipes was used (Press et al.,
2002). For ARS the default settings in SLIDE were used (4500 generation surfaces, Divisions
along the slope=10, Circles per division=10 and Divisions to use in the next iteration=50%).
The factor of safeties labeled in red are within 0.01 of the lowest factor of safety found.
ASA demonstrated a better capacity of finding the global minima in all verification cases,
followed by a similar performance between VFSA and the Auto-refine search. The success
rate of each method was also computed. Examples 87 and 88 are such cases, and they are
18
Table 2: Safety factors found for customer cases. Labeled in red are the safety factors not
within 0.01 of the global minimum
Verification # Auto-Refine VFSA ASAc Simplex ASAt[I]
Search
1 0.985 0.985 0.985 0.985 0.985
2 1.590 1.586 1.587 1.586 1.588
3 1.374 1.373 1.373 1.373 1.375
4 0.990 0.990 0.991 0.990 0.992
5 1.959 1.948 1.948 1.948 1.948
6 1.948 1.948 1.948 1.948 1.948
10 1.494 1.494 1.494 1.494 1.495
11 1.063 1.063 1.063 1.452 1.064
12 1.071 1.071 1.071 1.070 1.074
13 1.334 1.334 1.330 1.330 1.338
14 1.405 1.404 1.404 1.403 1.406
15 0.424 0.422 0.423 0.423 0.423
16 1.117 1.117 1.117 1.117 1.118
17 1.342 1.342 1.342 1.342 1.343
87 1.067 1.072 1.044 1.134 1.035
88 1.042 1.091 1.055 1.087 1.049
% found 94% 88% 100% 88% 100%
19
discussed in the next section. It must be noted that for some cases even though the function
values are close, the surface location was found to be completely different.
3.2 Customer Cases for SLIDE
The customer cases for SLIDE were considerably more challenging than the verification
examples, due to the multitude of slope geometries and their strong multi-layering. The
Simplex method was modified: after every down-hill simplex run, the best vertex is obtained.
Three new vertices are added to the best vertex and the algorithm is re-run. Results were
obtained both for this modified simplex as well as the standard.
The VFSA was also modified to investigate the sensitivity of the algorithm to the
characteristic length L. The VFSA algorithm was tested both with a characteristic length
of an eighth of the slope diagonal, as well as the full diagonal. ASAc was tested with the
same parameters used for the verification case.
On the other hand, the parameters for ASAt were found to be too conservative and
therefore drastically reduced speed for some special cases. Therefore, a new set of parameters
was chosen:
ASAt[I]: N� = 30, Naccept = 50, Ngen = 300, c = 2, ftol = 0.001 and δ = 0.0001.
ASAt[II]: N� = 20, Naccept = 30, Ngen = 300 and c = 2, ftol = 0.001 and δ = 0.0001.
The parameter ftol was set lower than δ = 0.0001 to prevent possible precision problems.
In Table 3, the results labeled in blue are the lowest factor of safeties found among
the methods, and the results labeled in red are within 0.05 of the global minimum. This
criterion is purely comparative, and was determined based on a range in which all surfaces
were approximately in the same location. The percentage of minima successfully found is
also summarized in Table 3.
Table 4 presents the running time in seconds for each of the verification cases. It can
be noticed that the speed of a method is highly problem dependent. In general the simplex
was found to be faster for most of the examples, followed by the auto-refine search and lastly
by the annealing methods.
20
Table 3: Safety Factors found for Customer Cases. Global minimum found are labeled in
blue, whereas minima within 0.05 of global minimum are considered satisfactory and are
labeled in red.
C. E. Simplex Simplex VFSA Auto- ASAc ASAt[I] ASAt[II]
# (3 iter) (1 iter) (FULL) Refine
1 0.921 0.921 0.763 0.769 0.763 0.769 0.780
2 1.214 1.287 1.121 1.121 1.123 1.127 1.127
3 1.271 1.450 1.23 1.278 1.275 1.280 1.281
4 1.242 1.389 1.133 0.206 0.238 0.193 0.214
5 1.412 1.444 1.408 1.436 1.409 1.415 1.415
6 1.321 1.321 1.253 1.323 1.263 1.251 1.263
7 1.525 2.167 1.211 1.175 1.175 1.184 1.184
8 1.945 2.290 1.811 2.231 1.804 1.818 1.807
9 2.992 3.485 1.237 1.221 1.239 1.234 1.233
10 1.563 1.408 1.406 1.406 1.406 1.406 1.406
11 2.292 2.292 2.282 2.283 2.286 2.287 2.287
13 2.050 2.578 1.263 1.028 0.937 1.075 0.973
14 1.506 1.510 1.079 1.130 1.114 1.146 1.114
15 0.450 0.215 0.831 0.163 0.844 0.102 0.102
16 1.441 1.306 1.262 1.260 1.284 1.311 1.287
17 1.208 Error 1.1777 1.216 1.176 1.225 1.188
19 0.704 0.651 0.651 0.658 0.651 0.651 0.651
26 2.601 3.596 2.595 2.596 2.608 2.600 2.602
34 1.280 1.682 1.176 1.156 1.152 0.813 0.812
40 1.707 1.322 0.241 0.299 0.244 0.219 0.219
41 1.008 1.069 0.663 0.664 0.662 0.613 0.597
42 0.188 0.189 0.185 0.188 0.185 0.194 0.194
Success % 32% 32% 77% 73% 86% 91% 100%
21
Table 4: Running time for customer cases. The running time for example 41 was not
considered due to the large computational effort required for all methods to convert Hoek-
Brown material properties.
C. E. Simplex Simplex VFSA Auto- ASAc ASAt[I] ASAt[II]
# (3 iter) (1 iter) (FULL) Refine
1 8 3 164 27 1092 260 134
2 96 91 167 17 273 243 92
3 185 32 178 4 459 87 79
4 108 57 367 7 178 103 174
5 3 0 58 0 117 17 12
6 39 5 2129 5 595 125 203
7 40 4 126 5 345 111 103
8 39 12 85 0 134 27 53
9 144 9 418 4 258 62 82
10 16 2 40 2 80 8 8
11 42 19 102 2 124 49 51
13 76 1 120 4 186 53 46
14 3 2 131 5 164 41 102
15 72 31 737 4 1763 6 7
16 14 13 105 6 138 17 24
17 32 17 48 2 156 38 34
19 19 1 21 3 93 15 23
26 39 13 70 1 107 22 21
34 130 22 184 8 851 124 129
40 32 15 183 3 243 28 23
42 8 3 164 27 1092 260 134
Time 55 17 290 14 388 84 84
22
4 Discussion and Explanatory Examples
4.1 Method Comparison — Verification Cases
The results in Table 2 demonstrate that for verification cases all the methods are successful.
For most cases the results were within 0.01 precision of each other. The ASA methods
were particularly efficient, finding the global minima within an accuracy range of 0.01 in
100% of the cases. Among the different cases, #87 and #88 presented the greatest variance.
Verification #87 is a problem taken from the Baseline case studied in the “Geo-synthetic
Reinforced Multi-tiered Walls”, a paper by Leshchinsky, D. and Han, J. (Rocscience, 2006).
The geometry consists of a three-tiered wall with geo-synthetic support. Both cases are based
on the same slope geometry and differ only in the material and support properties. Table 5
presents the material properties for case #87 and Table 6 presents its support properties.
Table 5: Material properties for Verification Case #87
Zone c (kPa) Φ (0) γ (kN/m3)
Reinforced and retained fill 0 34 18
Foundation soil 10 34 18
Blocks 2.5 34 18
Table 6: Support Properties for Verification Case #87
Length (m) Tensile Strength (kN/m) Pullout Strength
6.3 10.0 80%
Figure 4 displays a refined grid-search for case #87. The main challenge for search
methods are the numerous regions with invalid surfaces. There were 853 valid surfaces
against 3987 invalid surfaces, corresponding to a failure of generation rate of 82%. The most
common cause was failure of the factor of safety function to converge within required number
of iterations, occurring in 52% of the cases.
The failure surface found by ARS (Figure 5) comprises the two lowest tiers, and creates
a tension crack due to the reverse curvature of the circle. The safety factor found was 1.067.
23
Figure 4: Grid-search performed for Verification Case #87
Figure 5: Auto-Refine Search for Verification Case #87
24
The global minimum found by ASAc (Figure 6), on the other hand, encompasses all
tiers (1.055). This is a specially interesting case, as the function values are very close even
though the surfaces are different. It must be noticed that in a slope analysis this difference
may well be critical to the safety of the project.
Figure 6: Adaptive Simulated Annealing for Verification Case #87
4.2 Method Comparison — Customer Cases
The results in Table 3 show that ASAt[II] was the most successful in terms of percentage of
minima found within 0.05 of the global minimum, finding the global minimum in 100% of
the cases. The remaining annealing methods (ASAc, VFSA) were quite accurate in terms
of accuracy when compared to ARS. The simplex method proved fairly ineffective, having
significant failures in many cases (such as customer case #8). The restarts did little to
improve the search. This is unsurprising as the simplex method is in essence a downhill local
search by concept. Nevertheless the occasional global effectiveness was displayed, including
the correct minimum for customer case #15.
For most cases, the annealing methods and ARS had very similar results. Cases of
particular interest are #4, #8 and #15. It can be noticed that the VFSA and both Simplex
failed to find the region of correct answer for #4, and for #15 only the Simplex and ARS
found answers close to the global minima (Table 3). For case #8, only the annealing methods
25
Figure 7: Grid-Search for Customer Case #4
found the region with the correct factor of safety.
The grid-search (Figure 7) displays the function landscape for case #4. It can be
noticed that this case is particularly difficult, being strongly multi-modal.
ARS was able to find the minimum very fast, probably because the divisions were
refined enough to create a circle in the weak layer. ASAc was also successful in finding the
region of global minima, albeit at a lower speed. ASAt, on the other hand, tracked down the
global minimum relatively fast (Table 3 - 84s). This case shows both the general parameter-
rescaling advantages of ASA, as well as the influence on speed from the transformation of
variables.
The grid-search representations of the energy landscape might be deceiving. The col-
ored grid is a two dimensional representation of a three-dimensional function. The grid
search displays the best factor of safety from each centre, therefore the graph only shows
the best two-dimensional “slice” of the overall problem (the centre with the best radius). It
should be expected that there are at least n-times as many local minima in the complete
problem.
In several cases, the annealing methods located lower global minimum than the ARS,
including significantly different locations. For customer case #8 ARS (Figure 8) found a
26
Figure 8: Auto-Refine Search for Customer Case #8
safety factor of 2.231, whereas both ASAc (FOS=1.804 - Figure 9), VFSA (FOS=1.811) as
well as ASAt (FOS=1.807) found a more more critical failure surface.
The minimum found by ARS (2.231) comprises a large slip surface starting at the top
of the slope all the way to the base, whereas the safety factor of 1.804 found by ASAc is
restricted to approximately the upper half of the geometry (Figure 9). Again, this difference
may have crucial impact in the design of a slope.
Case #15 presents also a challenge, however the difficulty lies primarily in the discon-
tinuity of the function as opposed to its multimodality. The critical factor of safety lies in a
very narrow range close to the base of the slope, and between the initial guess and the slope
there are many invalid surfaces (Figure 8). There were 23470 invalid surfaces as opposed to
5868 valid surfaces (80% failure generation rate). Errors were due to a variety of reasons,
especially the high probability of multiple slope intersection. The critical failure surface, as
noted by the grid-search and the ARS, tends to an almost vertical tension-crack.
The failure of ASA and VFSA to converge can be explained by the large discontinuity
regions (Figure 13). The methods probably failed to generate valid surfaces around the
global minimum, therefore they were “pushed” towards the left of the slope. Centre-radius is
particularly ill-suited for this problem, as significant computation effort was wasted in error-
checks (1763 seconds - Table 6) and the search method did not span the function domain
27
Figure 9: Adaptive Simulated Annealing for Customer Case #8
Figure 10: Grid-Search for Customer Case #15
28
Figure 11: Auto-Refine Search for Customer Case #15
Figure 12: Adaptive Simulated Annealing for Customer Case #15
29
Figure 13: ASAt for Customer Case #15
properly. The transformation of variables, on the other hand, was highly successful for this
case, quickly finding the global minimum (7s - Table 6).
Finally, it was noticed that multi-layering and an apparently hard geometry do not
imply difficult optimization problems. User example # 34 looks hard but is easy, whereas
user example #15 looks easy but is extremely multi-modal. A significant amount of variance
was found in geometries with support (i.e. verification cases #87 & #88, customer case #8).
Duncan and Wright (2005) point out that reinforcements and external loads might have an
effect on the numerical stability of the solution for the complete equilibrium procedures (such
as Spencer’s method).
4.3 Simplex, ARS and other Optimization
Even though the Simplex is known to adapt to local landscapes (Nelder and Mead, 1965),
it was robust in a global search, finding 95% of the verification cases and 35% of the cus-
tomer cases. The current initialization works well for simple slopes, as noted in section 3.
The simplex was also able to find the global minimum in customer case #15 (Table 3), a
particularly complex slope.
It was noticed that the efficiency of the simplex depended significantly on how the
vertex matrix is initialized. For example, if the circles were all in the same neighborhood
30
(or maybe concentric), the search would be strongly localized. A desirable initialization will
create vertices that roughly span the function domain. As the local landscape tends towards
the global landscape, the simplex acquires global efficiency.
The ARS seems to have an efficient initialization algorithm - it involves the parameter-
ization of the slope surface and point-wise generation of circles. This concept may be used
successfully in the case of the simplex and improve the quality of the search.
It must be kept in mind that the simplex is at its core a local search (Corana et al.,
1987), and 100% global convergence should not be expected. However, the method is still
very useful: for simple slopes the simplex has shown both speed and reliability.
4.4 Annealing — Practical Considerations
In general, ASA with centre-radius were proven to be robust, failing only in a few exceptional
cases for which large amounts of failure surfaces were found. VFSA has a slightly lower but
still satisfactory performance. A further advantage of ASA is that for most problems its
parameters did not need tuning, a very important characteristic for slope problems (Cheng
et al., 2007b). ASA only requires the user to specify a control parameter c, which does not
affect the convergence of the problem.
The main drawback for the centre-radius scheme tested is speed. They are both at
least 20 times slower than ARS. This difference in speed may be unsurprising, as ARS is a
deterministic method, and simulated annealing a stochastic method. Further, there is also a
conceptual imperfection with the use of centre and radius as parameters. Due to its random
sampling nature, the domain for SA methods must be bounded. However, this is not the
case for centre and radius. It is easy to see that even for very far centers valid surfaces
can still be generated, and theoretically the parameter space cannot be completely covered.
Therefore, global convergence is not guaranteed.
The transformation to three points corrects both problems. As two points must always
be constrained to the slope-line, very few invalid surfaces are generated. The number of
degrees of freedom will be maintained, the two parametric points on the slope line and a
third angular vector to represent a third point. All these three parameters are bounded, and
the generation function can now sample the complete range of the slope if enough points are
31
generated.
Finally, it has been noted that all methods run into speed issues when Hoek-Brown
material properties are used. Most slopes tested use Mohr-Coulomb material properties,
with the exception of customer #41. The running time for the case ranged from 40 min to
6h, therefore its speed result was not included in Table 2 as it is considered an outlier.
4.5 Annealing — General Remarks
Standard simulated annealing (SA) has been successfully used on slope stability, (Cheng
et al., 2007a,b) with accuracy comparable to other general algorithms for global optimization.
Its effectiveness lies in its ability to sample large regions of the domain without falling into
a local minimum.
However, this intelligence is purely making up-hill moves as well as down-hill. Two
major drawbacks are the large number of iterations and the necessity for tuning the para-
meters for each solution case. VFSA offers a faster solution than standard annealing, with a
generation function that is highly superior. The quality of the solution from VFSA is similar
to the deterministic optimization in SLIDE, ARS.
However, VFSA is still fairly simple. Unlike genetic algorithms (Davis, 1987), the
algorithm itself does not “learn” much from the function behavior as the search proceeds.
ASA addresses both speed and effectiveness issues (Ingber, 1993). By allowing the search
method to adjust to the sensitivity of the different dimensions of the parameter space, ASA
optimizes its search range by allowing an exponentially decreasing schedule. It must be
noted that the speed of ASA does not rely on the fact that it stops “fast”, as much as the
fact that it can reliably find the local minimum within tolerable time.
The optimization performed by ASA can be understood intuitively. If the search is
going well, decrease the temperature rapidly and if the search is going badly, decrease the
temperature slowly. For ASA, high temperatures signify a larger search range and a less
selective acceptance function. This is very similar to the “dynamic stepping” (Corana et al.,
1987), in which the ratio between accepted and rejected points is used to control the step
size.
This “adaptivenes” addresses the asymptotic convergence noted by many researchers
32
(Van LaarHoven, 1988). As the search approximates the valley with the global minimum,
standard annealing methods will descend slower as most points generated will be uphill. A
stochastic method with a constant probability distribution will become slower and slower as
the method converges to the minima simply because it has a low probability of generating
points around its current state. As annealing converges in distribution, it is essential for
its search range to shrink “intelligently” as it approaches the answer so as to accelerate
convergence. This is more efficient than setting the characteristic length of the search to
be reduced to one fourth or even one tenth of the width of a weak layer, as suggested by
(Duncan and Wright, 2005). Such an approach would slow the search prohibitively, whereas
the probabilistic approach taken by ASA displays robustness as well as a tolerable speed.
The sensitivity approach currently used is more robust, as the derivatives directly
incorporate function information, whereas “acceptance rate” is more indirect. An interesting
question may arise: if the partial derivative of the function is calculated at every Naccept
points, why do we not walk downhill? It must be remembered that the partial derivative is
computed at the point with the lowest factor of safety found so far. At the Naccept iteration
the search method may be far from the current global minimum position, and the partial
derivative of the function at that point becomes now useless. We can pursue the derivational
method and compute the partial derivative at each dimension, however we would be some
type of hybrid “gradient-combinatorial” method that most probably violates the underlying
convergence conditions of annealing, although a proof can not be given at this point.
Annealing has a hierarchical underlying concept: it searches the major regions of local
minima first then refines its search over the small valleys. This is especially obvious with the
temperature-dependent Cauchy distribution as presented by Ingber. At high temperatures
the search ranges over the domain (Figure 14) whereas as the temperature is decreased the
search narrows down on the local valleys. The graphs were generated using 1000 points
distributed according to equation 4. Figure 14 displays an almost uniform distribution at
T = 10, the distribution allows the search method to run through the entire domain.
33
Figure 14: Generation range based on Cauchy distribution for T = 10
Figure 15: Generation range based on Cauchy distribution for T = 0.2
34
Figure 16: Generation range based on Cauchy distribution for T = 0.01
It can be noticed that the distribution approximates a Cauchy distribution as the T
decreases (Figures 14, 15,16), and with a decreasing “standard deviation”. The graphs can
be interpreted as the “range” of the search method, 0 being the current point and (-1, 1)
representing the lower and upper bound respectively. At high temperatures the move-sets
have a long-range, therefore sampling in a “coarse grained” fashion, similar to the FSA
(Szu and Hartley, 1987). As the temperature is reduced, the Cauchy machine refines the
search around the current state (Figure 16). It has to be noted that FSA assumes the same
temperature for generation and for all the dimensions, whereas ASA assumes a different
temperature for each dimension and a different temperature for the acceptance function. As
noted above, this suggests a certain conceptual similarity to the dynamic walk presented by
Corana et al. (1987).
The efficiency of such a method for multi-modal was validated by many of the slope
cases. For analysis purposes, the last accepted factor of safety was plotted for customer case
#4 for 4775 accepted points (Figure 17). The hierarchical convergence can be easily noticed
- at high temperatures the method samples over the space, and at low temperatures the
method refines around the local minimum.
Sorkin (1991) has argued that the effectiveness of annealing depends on how well the
move-set and the energy function are matched. The author suggests annealing is effective
35
Figure 17: Convergence of ASAc for Customer Case #4
for functions possessing a certain degree of “fractalness”, such as customer case #4 discussed
above. Fractal functions possess a hierarchical self-similarity, in other words, big valleys are
self-similar to the small valleys. It is easy to see from Figure 18 why VFSA and ASA would
be extremely efficient and gradient-dependent methods would fail.
On the other hand, Sorkin (1991) argues that for other function landscapes annealing
can be very inefficient (Figure 18). Customer # 17 might be such a case. The grid search
(Figure 19) displays two large plateaus and clear down-hill behavior from the function. ASAc
was 15 times slower than ARS. The problems that annealing faced with case #17 are general
to slopes with large regions with homogeneous material properties, for which the factor of
safeties vary very little at the local minima. For such cases, a few runs from a down-hill
method such as the simplex can easily find the answer (Table 3). Annealing methods are
then “overkill” (see convergence plot Figure 20).
Despite its efficiency, the convergence of ASA relies on the same sufficiency proofs as
CSA (Geman and Geman, 1984), FSA (Szu and Hartley, 1987) and VFSA (Ingber, 1989).
It relies on the fact that any points can be generated infinitely often in time on the func-
tion domain. It was noted by Yao (1995) that generating does not mean visiting, and the
acceptance temperature can prevent the search from visiting a region, even though points
there have been generated. Any annealing method fails if the valley with the minima is not
36
Figure 18: (a) A function landscape in which SA methods are inefficient; (b) a fractal
landscape in which SA methods have high performance (Sorkin, 1991)
Figure 19: Grid Search for Customer Case #17
37
Figure 20: Convergence of ASAc for Customer Case #17
visited (as in customer case #15). Another characteristic is that even ASA does not escape
from the “curse of dimensionality” - the exponent 1/n strongly affects convergence speed at
higher dimensions (Ingber, 1989).
Simulated annealing has shown promising results by finding the global minima in al-
most all cases tested in the current work, some which involve extremely hard minimization
problems. The accuracy of ASA is superior to the deterministic circular search methods,
and it has shown the highest reliability out of the methods tested. However, its speed is
in general lower than the ARS or the Simplex, and in many cases this does not result in a
significant improvement in accuracy.
Simulated annealing is surprisingly successful even for its simplicity, being known as
nature’s optimization. It must be remembered however, that nature is much more powerful
than any current human technology, and the optimization of crystalline structures can be
carried out by millions of atoms in very fast re-arrangements, whereas human computers can
only compute at a limited speed.
Ackley (Davis, 1987) stated in his review on bit vector function optimization that
“There is simply no best strategy for solving all possible problems”. It can be argued however,
that there can be a strategy that probably offers the best solution for most of the problems,
and simulated annealing seems to be a promising answer.
38
5 Conclusions
Based on the analysis of the global optimization methods, the following conclusions are
made:
1. Adaptive simulated annealing is a very powerful algorithm for the global optimization
of slope stability problems. With the proper control variables, ASA was successful and
relatively fast for 100% of the verification and customer cases.
2. The efficiency of circular optimization has been proven to be greater when three points
on the slope are used as control parameters (ARS and ASAt), as opposed to centre
point and radius (ASAc). The use of two points on the slope reduces invalid surface
generation, and the parameterization of the slope line allows complete sampling of the
parameter space.
3. For simple geometries (such as the verification cases) all methods work well. In general
the simplex is the fastest method, however it fails to optimize more complex slopes,
especially with multi-layering.
4. The convergence behavior of ASA was shown to be more efficient for landscapes with
fractal characteristics (highly multi-modal functions with a self-similarity).
39
6 Recommendations
Based on the analysis of the global optimization methods, the following recommendations
are made:
1. Record major local minima:
The current optimization methods only output the global minimum, in other words,
the lowest safety factor found. It is recommended that major local minima should be
also recorded Duncan and Wright (2005). In many cases they indicate regions of failure
and discrete regions where instability might occur.
2. Improve seeding for Simplex method:
The simplex proved significant robustness in the global optimization cases tested, es-
pecially for the verification cases. Further work on method is strongly recommended,
especially the use of better seeding techniques. It is recommended that its seeding to
be based on the initialization of the ARS.
3. Implementation for Parallel Processing:
The current optimization methods were implemented for running in a single processor.
However, parallel processing is recommended as for some slope geometries the number
of iterations required is very large. A second reason is that the methods run very slowly
when Hoek-Brown material properties are used, due to the computation effort used to
convert stress data.
4. Extension of Simulated Annealing for Non-Circular Surfaces:
The simulated annealing methods were proven to be very efficient for circular search
methods. Their implementation for general non-circular surfaces is strongly recom-
mended. Adaptive simulated annealing is expected to favorably compete with current
multi-dimension methods used for non-circular search.
40
List of Symbols
AXY (Tk) acceptance function
ARS Auto-Refine Search
ASA Adaptive Simulated Annealing
ASAt Adaptive Simulated Annealing with Three Points
ASAc Adaptive Simulated Annealing with centre-point/radius
c control parameter for VFSA and ASA or cohesion of soil
d standard deviation of ensemble-averaged simulation
E energy function analogous to objective function
F objective function to be minimized
ftol tolerance for stopping criterion
FSA fast simulated annealing
GXY (Tk) state-generation function
i counter for dimension
j counter for stopping criterion
k counter for each internal temperature iteration (VFSA)
ki counter for each generation temperature at i-th dimension (VFSA)
ka counter for temperature decrease(ASA)
n number of dimensions or number of simplex vertices
n(� iterations required for stopping criterion i
r random number following Cauchy distribution
si sensitivity of the function at the i-th dimension
S(Tk) annealing schedule
T general annealing temperature
Ti annealing temperature for VFSA at each dimension i
Ti,gen generation temperature for ASA at each dimension i
Ti,accept acceptance temperature for ASA at each dimension i
Taccept(0) initial acceptance temperature for ASA (equal for all dimensions)
V FSA very fast simulated annealing
41
v position vector to be evaluated by objective function
X initial position vector
Y Final position vector
Greek Letters
δ tolerance for factor of safety equation
� tolerance level
φ Angle of Friction Envelope
θ angle of tangent vector to the circular surface
42
References
Abramson, L. W., Lee, T. S., Sharma, S., and Boyce, G. M. (2002). Slope Stability and
Stabilization Methods. John Wiley & Sons, New York.
Chen, S. and Luk, B. L. (1999). Adaptive simulated annealing for optimization in signal
processing applications. Signal Processing, 79(1):117–128.
Cheng, Y. M. (2003). Location of critical failure surface and some further studies on slope
stability analysis. Computers and Geotechnics, 30(3):255–267.
Cheng, Y. M. (2007). Global optimization analysis of slope stability by simulated annealing
with dynamic bounds and dirac function. Engineering Optimization, 39(1):17–32.
Cheng, Y. M., Lansivaara, T., and Wei, W. B. (2007a). Two-dimensional slope stability
analysis by limit equilibrium and strength reduction methods. Computers and Geotechnics,
34(3):137–150.
Cheng, Y. M., Li, L., and Chi, S. C. (2007b). Performance studies on six heuristic global
optimization methods in the location of critical slip surface. Computers and Geotechnics,
34(6):462–484.
Corana, A., Marchesi, M., Martini, C., and Ridella, S. (1987). Minimizing multimodal
functions of continuous-variables with the simulated annealing algorithm. Acm Transactions
on Mathematical Software, 13(3):262–280.
Davis, L., editor (1987). Genetic Algorithms and Simulated Annealing. Research Notes in
Artificial Intelligence. Pitman Publishing, London, Great Britain.
Duncan, J. and Wright, S. (2005). Soil Strength and Slope Stability. John Wiley & Sons,
Inc., New Jersey.
Geman, S. and Geman, D. (1984). Stochastic relaxation, gibbs distributions, and the
bayesian restoration of images. Ieee Transactions on Pattern Analysis and Machine In-
telligence, 6(6):721–741.
Greco, V. R. (1996). Efficient monte carlo technique for locating critical slip surface. Journal
of Geotechnical Engineering-Asce, 122(7):517–525.
Ingber, L. (1989). Very fast simulated re-annealing. Mathematical and Computer Modelling,
12(8):967–973.
43
Ingber, L. (1993). Simulated annealing - practice versus theory. Mathematical and Computer
Modelling, 18(11):29–57.
Kirkpatrick, S. (1984). Optimization by simulated annealing — quantitative studies. Journal
of Statistical Physics, 34(5-6):975–986.
Kirkpatrick, S., Gelatt, C. D., and Vecchi, M. P. (1983). Optimization by simulated anneal-
ing. Science, 220(4598):671–680.
Malkawi, A. I. H., Hassan, W. F., and Sarma, S. K. (2001). Global search method for
locating general slip surface using monte carlo techniques. Journal of Geotechnical and
Geoenvironmental Engineering, 127(8):688–698.
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E. (1953).
Equation of state calculations by fast computing machines. Journal of Chemical Physics,
21(6):1087–1092.
Nelder, J. A. and Mead, R. (1965). A simplex-method for function minimization. Computer
Journal, 7(4):308–313.
Pei, D. H., Louie, J. N., and Pullammanappallil, S. K. (2007). Application of simulated
annealing inversion on high-frequency fundamental-mode rayleigh wave dispersion curves.
Geophysics, 72(5):R77–R85.
Polyak, B. T. (1987). Introduction to Optimization. Translation Series in Mathematics and
Engineering. Optimization Software, Inc., New York.
Press, W. H., Teukolsky, S., Vetterling, W., and Flannery, B. (2002). Numerical Recipes
C++: The Art of Scientific Computing. Press Syndicate of the University of Cambridge,
Cambridge, UK, 2nd edition.
Rocscience (2006). Slide manual. Technical report, Rocscience, Inc.
Sorkin, G. B. (1991). Efficient simulated annealing on fractal energy landscapes. Algorith-
mica, 6(3):367–418.
Szu, H. and Hartley, R. (1987). Fast simulated annealing. Physics Letters A, 122(3-4):157–
162.
Van LaarHoven, P. J. M. (1988). Theoretical and computational aspects of simulated anneal-
ing. CWI Tract. CWI: Centrum voor Wiskunde en Informatica, Amsterdam, Netherlands.
Van Laarhoven, P. J. M. and Aarts, E. H. L. (1987). Simulated Annealing: Theory and
44
Applications. Mathematics and Its Applications. D. Reidel Publishing Company, Dordretch,
Holland.
Velis, D. R. and Ulrych, T. J. (1996). Simulated annealing wavelet estimation via fourth-
order cumulant matching. Geophysics, 61(6):1939–1948.
Yao, X. (1995). A new simulated annealing algorithm. International Journal of Computer
Mathematics, 56(3-4):161–168.
45