Comparison of Latin Hypercube andComparison of Latin...

Comparison of Latin Hypercube andComparison of Latin Hypercube and Quasi Monte Carlo Sampling

Techniques.

Sergei Kucherenko1, Daniel Albrecht2, Andrea Saltelli2

1Imperial College London, UKEmail: s.kucherenko@imperial.ac.uk

2The European Commission, Joint Research Centre, ISPRA(VA), ITALY

Outline Outline

Monte Carlo integration methods

Latin Hypercube sampling design

Quasi Monte Carlo methods. Sobol’ sequences and their properties

Comparison of sample distributions generated by different techniques.Comparison of sample distributions generated by different techniques

Do QMC methods loose their efficiency in higher dimensions ?

Global Sensitivity Analysis and Effective dimensions

C i ltComparison results

Monte Carlo integration methods

[ ] ( )

see as an expectation: [ ] [ ( )]nH

I f f x dx

I f E f x

p [ ] [ ( )]1Monte Carlo : [ ] ( )

I f f zN =

{ } is a sequence of random points in Error: [ ] [ ]

z HI f I fε

= − .

2 1/ 21/ 2

[ ] [ ]

( )= ( ( ))

fENσε ε = →

Convergence does not depent N

Improve MC convergence by decreasing ( )on dimensionality but it is slow

fσp g y g ( )Use variance reduction techniques:antithetic variables; control variates;

3stratified sampling LHS sampling→

Latin Hypercube sampling

Latin Hypercube sampling is a type of Stratified Sampling.

To sample N points in d-dimensions

Divide each dimension in N equal intervals => Nd subcubes.

Take one point in each of the subcubes so that being projected to

lower dimensions points do not overlap

Latin Hypercube sampling

{ } 1 - independent random permutations of {1 }k n Nπ = { }, 1,..., - independent random permutations of {1,..., }each uniformly distributed over all N! possible permutations

( ) 1LHS coordinates: , 1,..., , 1,...,k

k k ii

i Ux i N k nN

π − += = =

U(0,1) LHS is built by superimposing well stratified one-dimensional

kiU ∼

It cannot be expected to provide good uniformity propertiessamples.

p p g y p pin a n-dimensional unit hypercube .

Deficiencies of LHS sampling

1) S i b dl l d ( )1) Space is badly explored (a)

2) Possible correlation between variables (b)

3) Points can not be sampled sequentially3) Points can not be sampled sequentially

=> Not suited for integration

Discrepancy. Quasi Monte Carlo.

:Defintions: ( ) ( ) [0 ) [0 ) [0 )Discrepancy is a measure of deviation from unifo yrmit

Q y H Q y y y y∈ = × × ×1 2

Defintions: ( ) , ( ) [0, ) [0, ) ... [0, ),( ) volume of

Q y H Q y y y ym Q Q

∈ = × × ×−

( )sup ( )

ND m Q

N∈= −

* 1/ 2 1/ 2

Random sequences: (ln ln ) / ~ 1/

ND N N N

≤(ln )) Low discrepancy sequences (LDS)

nN(ND c d≤

) Low discrepancy sequences (LDS)

Convergence: [ ] [ ] ( ) ,QMC N N

NI f I f V f Dε

= − ≤

Assymptotically ~ (1/ ) much higher than

(ln )M

NO Nε

Assymptotically ~ (1/ ) much higher than

~ ( )1/QMC

QMC. Sobol’ sequences

(ln )Convergence: for all LDSnO N

Nε = −

1(ln )For Sobol' LDS: , if 2 , integern

kO N N kN

= = −

.Sobol' LDS:

1. Best uniformity of distribution as N goes to infinity.2. Good distribution for fairly small initial sets.3 A f t t ti l l ith3. A very fast computational algorithm.

"Preponderance of the experimental evidence amassed to datePreponderance of the experimental evidence amassed to date points to Sobol' sequences as the most effective quasi-Monte Carlo method for application in financial engineering."

Paul Glasserman Monte Carlo Methods in Financial Engineering

Paul Glasserman, Monte Carlo Methods in Financial Engineering, Springer, 2003

Sobol LDS. Property A and Property A’

A low-discrepancy sequence is said to satisfy Property A if for any binary segment (not an arbitrary subset) of the n-dimensional sequence of length 2n there is exactly one point in each 2n hyper-octant that results from subdividing the unit hypercube along each of its length extensions into halfhypercube along each of its length extensions into half.

A low-discrepancy sequence is said to satisfy Property A’ if for any binary segment (not an arbitrary subset) of the n-dimensional sequence of length 4n there is exactly one point in each 4n hyper octant that results from subdividing the unit

exactly one point in each 4n hyper-octant that results from subdividing the unit hypercube along each of its length extensions into four equal parts.

9Property A Property A’

Distributions of 4 points in two dimensions

Property A

NoMC -> No

LHS -> No

Sobol’ -> Yes

Distributions of 16 points in two dimensions

Property A’

NoMC -> No

LHS -> No

Sobol’ -> Yes

Comparison of Discrepancy I.Low Dimensions

Use standard MC and

LHSQMC-Sobol Use standard MC and ,

LHS generators

Sobol' sequence generator:0.001

0.01Lo

Sobol sequence generator:

SobolSeq:

Sobol' sequences satisfy1e-05

0.0001

128 256 512 1024 2048 4096 8192 16384 32768

*. q y

Properties A and A’

www.broda.co.uk

128 256 512 1024 2048 4096 8192 16384 32768Log2(N)

0.01MC

LHSQMC-Sobol

Result:

QMC in low dimensions shows h ll di th0 0001

much smaller discrepancy than MC and LHS

0.0001128 256 512 1024 2048 4096 8192 16384 32768

Log2(N)

Comparison of Discrepancy IIHigh Dimensionsg

1e-07MC

LHSQMC-Sobol

1e 09128 256 512 1024 2048 4096 8192 16384 32768

Log2(N)

All sampling methods in high-dimensions have comparable discrepancy

Do QMC methods loose their efficiency in higher dimensions ?dimensions ?

(ln )QMC

nO Nε =

Assymptotically ~ (1/ )

NO Nε

but increseas with until exp( )

50 5 10 not achievable for practical applicationsQMC N N n

ε ≈

= ≈50, 5 10 not achievable for practical applicaIs QMC better than MC and LHS in higher dimensions ( 2

n N≥

= ≈ −

ANOVA decomposition and Sensitivity Indices

Consider a modelx is a vector of input variables

( )( )

Y f xx x x x==

f(x) is integrable

ANOVA decomposition:

1 2( , ,..., )0 1

x x x xx

=≤ ≤

.( ) ( ) ( )0 1,2,..., 1 2

1( ) , ... , , ..., ,

i i ij i j k ki i j i

Y f x f f x f x x f x x x>

= = + + + +∑ ∑∑

ANOVA decomposition:

( , , ..., ) 0, , 1i s is k

i i j i

i i if x x dx k k s

= ∀ ≤ ≤∫0

Variance decomposition: 2 2 2 2, 1,2,...,i iji i j nσ σ σ σ= + +∑ ∑ …

kijlij

i SSSS 21...1 ++++= ∑∑∑Sobol’ SI:

i ,...,2,11

∑∑∑<<<=

Sobol’ Sensitivity Indices (SI)

Definition:1 1

2 2... ... /

s si i i iS σ σ=

- partial variances( )1 1 1 1

12 2... ...

,..., ,...,s si i i i i is i isf x x dx xσ = ∫

- total variance

Sensitivity indices for subsets of variables:

( )( )220

f x f dxσ = −∫( )x y z.Sensitivity indices for subsets of variables: ( ),x y z=

2 2, ,

y i is i i

σ σ= ⟨ ⟨ ∈Κ

=∑ ∑ …

Total variance for a subset:( )11 ... ss i i= ⟨ ⟨ ∈Κ

( ) 222z

toty σσσ −=

Corresponding global sensitivity indices:

/ 22 σσS ( ) / 22tottotS16

,/σσ yyS = ( ) ./ 2σσ toty

totyS =

Effective dimensions

superposition sensThe effective dimension of ( ) in the i th ll t i t h th t

Let u be a cardinality of a set of variables . ef

is the smallest integer such that (1 ), 1

dS ε ε

< <≥ − <<∑

It means that ( ) is almost a us m xf of -dimensional functions.Sd.

truncation sensThe function ( ) has effective dimension in the ie f( 11 ),

f x dS ε ε≥ − <<∑

___________________________________________________________

{1,2,..., }

Important property:

( ) 1Examp

Importa

property:

f x x d d

→ = == ∑17

1,Example: i

i TS nf x x d d=

→ = == ∑

Classification of functions

Type B,C. Variables are Type A. Variables are yp ,equally important

S S d↔

ypnot equally important

T Ty zS S d>> ↔ <<

i j TS S nd≈ ↔ ≈y z

y zT n

n nd>> ↔ <<

Type B. Dominant low order indices

Type C. Dominanthigher order indices n

SS nd=

≈ ↔ <<∑1

SS nd=

<< ↔ ≈∑18

When LHS is more effective than MC ?

0ANOVA: ( ) ( ) ( )i if x f f x r x= + +∑( ) high order interactions terms

r x −

2 21 1LHS: ( ) [ ( )] ( ) (Stein, 1987)

nLHS HE r x dx O

N Nε = +∫

∫ ∫.

1 1 1MC: ( ) [ ( )] [ ( )] ( )

if [ ( )] i

n nMC i iH Hi

E f x dx r x dx ON N N

ε = + +∑∫ ∫

∫ ll ( T B f ti )d2if [ ( )] is snH

r x dx∫ mall ( Type B functions )Sd⇔

2 2 ( ) ( )

LHS MCE Eε ε→ <

Classification of functions

Function type

Description Relationship between

dT dS QMC is more

LHS is more

Si and Sitot efficient

than MCefficient than MC

A A few dominant Sy

tot/ny >> Szto / nz

<< n << n Yes No

variablesy y z z

B No unimportant subsets; only

≈ n << n Yes Yes

.subsets; only low-order interaction terms are present

Si ≈ Sj, ∀ i, jSi / Si

tot ≈ 1, ∀ i

presentC No

unimportant subsets; high- Si ≈ Sj ∀ i j

≈ n ≈ n No No

order interaction terms are present

Si ≈ Sj, ∀ i, jSi / Si

tot << 1, ∀ i

How to monitor convergence of MC LHS and QMC calculations ?MC, LHS and QMC calculations ?

The root mean square error is defined asThe root mean square error is defined as 1/2

1 ( )K

ε ⎛ ⎞= −⎜ ⎟⎝ ⎠∑

K is a number of independent runs

MC and LHS: all runs should be statistically independent ( use a

1kK =⎝ ⎠

. y p (different seed point ).

QMC: for each run a different part of the Sobol' LDS was used ( start from a different index number )

0 1cN α α− < <

start from a different index number ).

The root mean square error is approximated by the formula

MC: 0.5QMC 1

, 0 1cNα

< <≈

QMC: 1LHS: ?

αα≤∼

Integration error vs. N. Type A(a) f(x) = ∑n

j 1(-1)iΠij 1 xj n = 360 (b) f(x) = Πs

i 1 │4xi-2│/(1+a i) n = 100(a) f(x) ∑ j=1( 1) Π j=1 xj, n 360, (b) f(x) Π i=1 │4xi 2│/(1+a i), n 100

1/ 21 K⎛ ⎞

0MC (-0.50)

QMC-SOB (-0.94)

LHS (-0.52)

T Ty z

S S d2

1 ( )K

⎛ ⎞= −⎜ ⎟⎝ ⎠∑

LHS ( 0.52) y z

y zT n

n nd>> ↔ <<

~ , 0 1N αε α− < <-25

8 11 14 17 20log2(N)

MC (0.49)

QMC-SOB (0.65)

LHS (0.50)

-10log 2

(b) -148 11 14 17 20

log2(N)

Integration error. Type A

1/ 21 K⎛ ⎞

T Ty z

TS S nd>> ↔ <<

1 ( )K

⎛ ⎞= −⎜ ⎟⎝ ⎠∑

~ , 0 1N αε α− < <

y zTn n

Index Function Dim n Slope MC

Slope QMC

Slope LHS

1A 360 0.50 0.94 0.52( )

−∑ ∏

4 2n x a+

2Aa1 = a2 = 0

a = = a = 6 52

100 0.49 0.65 0.501

− ++∏

a3 = … = a100 = 6.52

Integration error vs. N. Type BDominant low order indices 1

S d↔ <<∑Dominant low order indices1

SS nd=

≈ ↔ <<∑

-8MC (0.52)

QMC-SOB (0 96)

1 −−

=∏= n

(a)-16

QMC-SOB (0.96)

LHS (0.69)

360=n(a)

8 11 14 17 20

log2(N)

.log2(N)

MC (0.50)

)11()(1

+=∏=

(b)-12

QMC-SOB (0.87)

LHS (0.62)

8 11 14 17 20

log2(N)

Integration error. Type B functionsDominant low order indicesDominant low order indices

SS nd=

≈ ↔ <<∑

Index Function Dim n Slope MC

Slope QMC

Slope LHS.

1B 30 0.52 0.96 0.691 0 5

n xn−−∏

2B 30 0.50 0.87 0.62

1 0.5i n=

⎛ ⎞+⎜ ⎟⎝ ⎠

∏4 2

3Bai = 6.52

30 0.51 0.85 0.551

− ++∏

The integration error vs. N. Type CDominant higher order indices: 1

S nd<< ↔ ≈∑Dominant higher order indices: 1

SS nd=

<< ↔ ≈∑

-2MC (0.47)

QMC-SOB (0.64)

4 2( ) , 0

x af x a

− += =

+∏-8

QMC SOB (0.64)

LHS (0.50)

→ −

∏-12

8 11 14 17 20

log2(N).log2(N)

-2 MC (0.49)

QMC-SOB (0.68)

1/( ) (1/ 2)n

nif x x= ∏(b) -10

LHS (0.51)

∏-14

8 11 14 17 20

log2(N)

Integration error for type C functionsDominant higher order indicesg

SS nd=

<< ↔ ≈∑

.Index Function Dim n Slope

MCSlope QMC

Slope LHS. MC QMC LHS

1C 10 0.47 0.64 0.501

−∏n

2C 10 0.49 0.68 0.51( )1

The integration error vs. N. Function 1AMC QMC LHS Theoretical Value

-0.320

-0.315

MC QMC LHS Theoretical Value

( )1in

i∑ ∏-0.330

-0.325

( )1 1

∑ ∏

.-0.345

-0.340

-0.335Fu

-0.355

-0.350

0 2 4 6 8 10 12

QMC: convergence is monotonicMC d LHS ill ti

0 2 4 6 8 10 12log2 (N)

MC and LHS: convergence curves are oscillating

QMC is 30 times faster than MC and LHS

LHS: it is not possible to incrementally add a new point while keeping the old LHS design

Evaluation of quantiles I. Low quantile

Distribution dimension n = 5.

are independent standard normal variates

( ) ,n

f x x=

= ∑(0 1)x N are independent standard normal variates(0,1)ix N∼

0.0100

"LHS"Expon. ("QMC")

Expon. ("MC")

Expon. ("LHS").

0.0010

0.000110 11 12 13 14 15 16 17 18

Log2(N)

Low quantile (percentile for the cumulative distribution function) = 0.05A superior convergence of the QMC method

Evaluation of quantiles II. High quantile

Distribution dimension n = 5.

are independent standard normal variates

( ) ,n

f x x=

= ∑(0 1)x N are independent standard normal variates(0,1)ix N∼

0.0100"MC"

Expon. ("MC")

Expon. ("QMC")

Expon. ("LHS").

0.0010

0 0001

Hi h til ( til f th l ti di t ib ti f ti ) 0 95

0.000110 11 12 13 14 15 16 17 18

Log2(N)

High quantile (percentile for the cumulative distribution function) = 0.95QMC convergences faster than MC and LHS

Summary

Sobol’ sequences possess additional uniformity properties which MC andSobol sequences possess additional uniformity properties which MC and LHS techniques do not have (Properties A and A’).

Comparison of L discrepancies shows that the QMC method has the

Comparison of L2 discrepancies shows that the QMC method has the lowest discrepancy in low dimensions ( up to 20).

QMC method outperforms MC and LHS for types A and B functions.QMC method outperforms MC and LHS for types A and B functions (problems with low effective dimensions)

LHS h d f MC l f B f iLHS method outperforms MC only for type B functions.

QMC remains the most efficient method among the three techniques for non-uniform distributions

Comparison of Latin Hypercube andComparison of Latin...

Documents

Hierarchical Refinement of Latin Hypercube Samplestechniques,” a stratiﬁed sampling strategy called Latin Hypercube Sampling (LHS). LHS was ﬁrst suggested by W. J. Conover, whose

LATIN HYPERCUBE DESIGN OF EXPERIMENTS AND SUPPORT …

PROJECT NAME HYPERCUBE

Latin hypercube sampling and the propagation of …read.pudn.com/downloads317/doc/1404459/Latin hypercube...Latin hypercube sampling and the propagation of uncertainty in analyses

HyperCube DID Product Overview - West Corporation DID Product Overview . ... DID Voice Call Flow 3 Wireline ... SMPP 3.4/HTTP HyperCube SMS Aggregator DALLAS HyperCube SMS

Optimal maximin L1-distance Latin hypercube designs based

Extension of Latin Hypercube Samples with Correlatea Variables

Hypercube Brandbook

Dynamic Hypercube Topology

A User's Guide to Sandia's Latin Hypercube Sampling Software

Space-ﬁlling Latin hypercube designs for computer experiments · 2017-08-28 · Space-ﬁlling Latin hypercube designs for computer experiments 613 Our main motivation for investigating

Progress Towards Nested Space and Sub-Space Filling Latin Hypercube Sample Designs

HYPERCUBE ALGORITHMS-1

Credit dynamics in a ﬁrst-passage time model with jumps ...packham.net/index_files/thesis_natp.pdfLatin hypercube sampling with dependence 115 11. Latin hypercube sampling with dependence

Fast Generation of Nested Space-filling Latin Hypercube Sample Designs

Hypercube Networks

Latin Hypercube Sampling: Application to Pit Lake ...22 Proceedings of the 1998 Conference on Hazardous Waste Research LATIN HYPERCUBE SAMPLING: APPLICA-TION TO PIT LAKE HYDROLOGIC

On the use of a Modified Latin Hypercube Sampling …eprints.whiterose.ac.uk/7925/1/modified_latin_wrr.pdfHess, S., Train, K.E., Polak, J.W. (2006) On the use of a Modified Latin Hypercube

Risk aggregation with empirical margins: Latin hypercubes ... · setting includes Latin Hypercube Sampling with dependence, also known as the Iman{Conover method. The primary question

Fast Generation of Nested Space-filling Latin Hypercube Sample … · 2011. 3. 9. · Binning Optimal Symmetric Latin Hypercube Sampling (BOSLHS) •Gets 1D projections right •Is