Simulation and Random Number Generation. Summary Discrete Time vs Discrete Event Simulation Random number generation Generating a random sequence

Simulation and Random Number Generation

Summary

Discrete Time vs Discrete Event Simulation Random number generation

Generating a random sequence Generating random variates from a Uniform distribution Testing the quality of the random number generator

Some probability results Evaluating integrals using Monte-Carlo simulation Generating random numbers from various distributions

Generating discrete random variates from a given pmf Generating continuous random variates from a given

distribution

Discrete Time vs. Discrete Event Simulation

Solve the finite difference equation 1 1min 100, 0.1 logk k k k kx x x x u

Given the initial conditions x0, x-1 and the input function uk, k=0,1,… we can simulate the output!

This is a time driven simulation.

For the systems we are interested in, this method has two main problems.

-10 0 10 20 30 40 50-150

-100

-50

0

50

100

k

x(k)

Issues with Time-Driven Simulation

System Input

2

1 if truck arrives at ( )

0 otherwise

tu t

1

1 if parcel arrives at ( )

0 otherwise

tu t

System Dynamics

1 2

1 2

( ) 1 if ( ) 1, ( ) 0,

( ) ( ) 1 if ( ) 0, ( ) 1

( ) otherwise

x t u t u t

x t x t u t u t

x t

t

u1(t)

t

u2(t)

t

x(t)

Δt

Inefficient: At most steps nothing happens Accuracy: Event order in each interval

Time Driven vs. Event Driven Simulation Models

Time Driven Dynamics

1 2

1 2

( ) 1 if ( ) 1, ( ) 0,

( ) ( ) 1 if ( ) 0, ( ) 1

( ) otherwise

x t u t u t

x t x t u t u t

x t

Event Driven Dynamics

1 if ,

1 if

x e af x e

x e d

State is updated only at the occurrence of a discrete event

In this case, time is divided in intervals of length Δt, and the state is updated at every step

Pseudo-Random Number Generation

It is difficult to construct a truly random number generator. Random number generators consist of a sequence of

deterministically generated numbers that “look” random. A deterministic sequence can never be truly random. This property, though seemingly problematic, is actually desirable

because it is possible to reproduce the results of an experiment!

Multiplicative Congruential Method

xk+1 = a xk modulo m

The constants a and m should follow the following criteria For any initial seed x0, the resultant sequence should look like random

For any initial seed, the period (number of variables that can be generated before repetition) should be large.

The value can be computed efficiently on a digital computer.

Pseudo-Random Number Generation

To fit the above requirements, m should be chosen to be a large prime number. Good choices for parameters are

m= 231-1 and a=75 for 32-bit word machines m= 235-1 and a=55 for 36-bit word machines

Mixed Congruential Method

xk+1 = (a xk + c) modulo m

A more general form:

where n is a positive constant

xk/m maps the random variates in the interval [0,1)

1

modn

j k jkj

a xx m

Quality of the Random Number Generator (RNG)

The random variate uk=xk/m should be uniformly distributed in the interval [0,1). Divide the interval [0,1) into k subintervals and count

the number of variates that fall within each interval. Run the chi-square test.

The sequence of random variates should be uncorrelated Define the auto-correlation coefficient with lag k > 0 Ck

and verify that E[Ck] approached 0 for all k=1,2,…

where ui is the random variate in the interval [0,1).

1 12 2

1

1 n k

k i i ki

C u un k

Some Probability Review/Results

Probability mass functions

X1 2 3

Pr{X=1}=p1

Pr{X=2}=p2

Pr{X=3}=p3

Pr

Pr

X

x

i

F X xx

X i

Cumulative mass functions

X1 2 3

p1

p1+ p2

p1+ p2 + p3

1

Pr 1i

X i

Some Probability Results/Review

Probability density functions

Pr{x=x1}= 0!

x0

fX(x)

x1

1

1

1 1Prx

X

x

x X x

f dxx

Cumulative distribution

Pr

( )

X

x

X

F X xx

f y dy

x0

FX(x)

1

Independence of Random Variables

Joint cumulative probability distribution.

( , ) Pr ,XYF x y X x Y y

Independent Random Variables.

Pr , Pr PrX C Y D X C Y D

For discrete variables

Pr , Pr PrX x Y y X x Y y For continuous variables

( , ) ( ) ( )XY X Yf x y f x f y ( , ) ( ) ( )XY X YF x y F x F y

Conditional Probability

Conditional probability for two events.

PrPr |

Pr

ABA B

B

Bayes’ Rule.

Total Probability

Pr Pr | Prk kk

X x X x Y y Y y

Pr Pr | Pr

Pr | Pr |Pr Pr

AB B A AB A A B

A B

Expectation and Variance

Expected value of a random variable

( )XE xf x dxX

Pr{ }i ii

E x X xX

Expected value of a function of a random variable

( )XE g f x dxg xX

Pr{ }i ii

E g x X xg X

Variance of a random variable

2var E X EX X

Covariance of two random variables

The covariance between two random variables is defined as

cov , E X E Y EX Y X Y

cov , E XY XE YE E EX Y Y X X Y

E E EXY X Y

The covariance between two independent random variables is 0 (why?)

Pr ,

Pr Pr

i j i ji j

i i j ji j

E x y X x Y yXY

x X x y Y y E EX Y

Markov Inequality

If the random variable X takes only non-negative values, then for any value a > 0.

Pr

E XX aa

Note that for any value 0<a < E[X], the above inequality says nothing!

Chebyshev’s Inequality

If X is a random variable having mean μ and variance σ2, then for any value k > 0.

2

1Pr kX

k

μ

σ

kσkσ

x

fX(x)

This may be a very conservative bound! Note that for X~N(0,σ2), for k=2, from Chebyshev’s inequality Pr{.}<0.25, when in

fact the actual value is Pr{.}< 0.05

The Law of Large Numbers

Weak: Let X1, X2, … be a sequence of independent and identically distributed random variables having mean μ. Then, for any ε >0

1 ...Pr 0, as nX X

nn

Strong: with probability 1,

1 ...lim n

n

X X

n

The Central Limit Theorem

Let X1, X2, …Xn be a sequence of independent random variables with mean μi and variance σι

2. Then, define the random variable X,

1 ... nX X X Let

11

...n

n ii

E E X XX

22 2

1

n

i

E X

Then, the distribution of X approaches the normal

distribution as n increases, and if Xi are continuous, then

2 / 21

2x

Xf ex

Chi-Square Test

Let k be the number of subintervals, thus pi=1/k, i=1,…,k.

Let Ni be the number of samples in each subinterval. Note that E[Ni]=Npi where N is the total number of samples

Null Hypothesis H0: The probability that the observed random variates are indeed uniformly distributed in (0,1).

Let T be 2

2

1 1

1 k k

i i ii ii

k NT N Np N

Np N k

Define p-value= PH0{T>t} indicate the probability of observing a value t

assuming H0 is correct.

For large N, T is approximated by a chi-square distribution with k-1 degrees of freedom, thus we can use this approximation to evaluate the p-value

The H0 is accepted if p-value is greater than 0.05 or 0.01

Monte-Carlo Approach to Evaluating Integrals

Suppose that you want to estimate θ, however it is rather difficult to analytically evaluate the integral.

1

0

g dxx Suppose also that you don’t want to use numerical integration.

Let U be a uniformly distributed random variable in the interval [0,1) and ui are random variates of the same distribution. Consider the following estimator.

1

1ˆn

ii

g un

Also note that:

as E ng U

1

0

g du E gu U

Strong Law of Large Numbers

Monte-Carlo Approach to Evaluating Integrals

Use Monte-Carlo simulation to estimate the following integral.

b

a

g dxx Let y=(x-a)/(b-a), then the integral becomes

What if 0

g dxx

1 1

0 0

g dy h dyy a yb a b a

Use the substitution y= 1/(1+x),

What if 1 1 1

1 1

0 0 0

... ,..., ...n ng x x dx dx

Example: Estimate the value of π.

(-1,-1) (1,-1)

(1,1)(-1,1)

Area of Circle

Area of square 4

Let X, Y, be independent random variables uniformly distributed in the interval [-1,1]

The probability that a point (X,Y) falls in the circle is given by 2 2Pr 1

4X Y

SOLUTION Generate N pairs of uniformly distributed random variates (u1,u2) in

the interval [0,1). Transform them to become uniform over the interval [-1,1), using

(2u1-1,2u2-1).

Form the ratio of the number of points that fall in the circle over N

Discrete Random Variates

Suppose we would like to generate a sequence of discrete random variates according to a probability mass function

0

Pr , 0,1,..., , 1N

j j jj

X x p j N p

x

1

u

X

Inverse Transform Method

0 0

1 0 0 1

1

0 0

if

if

if j j

j i ii i

x u p

x p u p p

X

x p u p

Discrete Random Variate Algorithm (D-RNG-1)

Algorithm D-RNG-1 Generate u=U(0,1) If u<p0, set X=x0, return; If u<p0+p1, set X=x1, return; … Set X=xn, return;

Recall the requirement for efficient implementation, thus the above search algorithm should be made as efficient as possible!

Example: Suppose that X{0,1,…,n} and p0= p1 =…= pn = 1/(n+1), then

1X un

Discrete Random Variate Algorithm

D-RNG-1: Version 1 Generate u=U(0,1) If u<0.1, set X=x0, return; If u<0.3, set X=x1, return; If u<0.7, set X=x2, return; Set X=x3, return;

Assume p0= 0.1, p1 = 0.2, p2 = 0.4, p3 = 0.3. What is an efficient RNG?

D-RNG-1: Version 2 Generate u=U(0,1) If u<0.4, set X=x2, return; If u<0.7, set X=x3, return; If u<0.9, set X=x1, return; Set X=x0, return;

More Efficient

Geometric Random Variables

Let p the probability of success and q=1-p the probability of failure, then X is the time of the first success with pmf

1Pr iX i pq Using the previous discrete random variate algorithm, X=j if

1

1 1

Pr Prj j

i i

X i U X i

1

1

1

Pr 1 Pr 1 1j

j

i

X i X j q

11 1j jq U q

11j jq U q

min : 1jX j q U

min : log log 1

log 1min :log

X j j q U

Uj jq

As a result:

log 11

logU

Xq

Poisson and Binomial Distributions

Poisson Distribution with rate λ.

Pr , 0,1,...!

ieX i i

i

The binomial distribution (n,p) gives the number of successes in n trials given that the probability of success is p.

!Pr , 0,1,...,1

! !n iin

X i p i npi n i

Note: 1 1i ip pi

Note: 1

1

1 1i i

n pp p

i p

Accept/Reject Method (D-RNG-AR)

Suppose that we would like to generate random variates from a pmf {pj, j≥0} and we have an efficient way to generate variates from a pmf {qj, j≥0}.

Let a constant c such that for all such that 0j

jj

pc j p

q

In this case, use the following algorithm

D-RNG-AR:1. Generate a random variate Y from pmf {qj, j≥0}. 2. Generate u=U(0,1)3. If u< pY/(cqY), set X=Y and return;4. Else repeat from Step 1.

Accept/Reject Method (D-RNG-AR)

Show that the D-RNG-AR algorithm generates a random variate with the required pmf {pj, j≥0}.

Prip X i

Pr and stop after X i k

Pr Not stop up to 1k

Therefore

1

Pr not accepted| Prk

j

Y Y j Y j

i ii

i

p pq

cq c

1

Pr not stop for 1,..., 1 Pr and stop at iteration k

k X i k

Pr | and is accepted Pr Y accepted| PrX i Y i Y i Y i

1 pi/cqi qi

1

1

k

j

jj j

pq

cq

11

1k

c

1

1

11

ki

ik

pp

cc

ip

D-RNG-AR Example

Determine an algorithm for generating random variates for a random variable that take values 1,2,..,10 with probabilities 0.11, 0.12, 0.09, 0.08, 0.12, 0.10, 0.09, 0.09, 0.10, 0.10 respectively.

max 1.2i

ii

pc

q

D-RNG-1: Generate u=U(0,1) k=1; while(u >cdf(k))

k=k+1; x(i)=k;

D-RNG-AR: u1=U(0,1), u2=U(0,1) Y=floor(10*u1 + 1); while(u2 > p(Y)/c)

u1= U(0,1); u2=U(0,1); Y=floor(10*rand + 1);

y(i)=Y;

The Composition Approach

Let X1 have pmf {qj, j≥0} and X2 have pmf {rj, j≥0} and define

Pr (1 ) , 0j j jX j p aq a r j

Suppose that we have an efficient way to generate variates from two pmfs {qj, j≥0} and {rj, j≥0}

Suppose that we would like to generate random variates for a random variable having pmf, a (0,1).

1

2

with probability

with probability 1-

X aX

X a

Algorithm D-RNG-C:

Generate u=U(0,1) If u <= a generate X1 Else generate X2

Continuous Random VariatesInverse Transform Method

Suppose we would like to generate a sequence of continuous random variates having density function FX(x)

Algorithm C-RNG-1: Let U be a random variable uniformly distributed in the interval (0,1). For any continuous distribution function, the random variate X is given by

u

X x

1FX(x)

1XX F U

Example: Exponentially Distributed Random Variable

Suppose we would like to generate a sequence of random variates having density function

xXf ex

Solution Find the cumulative distribution

0

1x

y xXF e dy ex

Let a uniformly distributed random variable u

1 xXu F ex

ln 1 xu 1ln 1x u

Equivalently, since 1-u is also uniformly distributed in (0,1)

1lnx u

Convolution Techniques and the Erlang Distribution

Suppose the random variable X is the sum of a number of independent identically distributed random variables

1

n

ii

X Y

An example of such random variable is the Erlang with order n which is the sum of n iid exponential random variables with rate λ.

( )

!

n x

X

exf xn

Algorithm C-RNG-Cv: Generate Y1,…,Yn from the given distribution X=Y1+Y2+…+Yn.

Accept/Reject Method (C-RNG-AR)

Suppose that we would like to generate random variates from a pdf fX(x) and we have an efficient way to generate variates from a pdf gX(x).

Let a constant c such that

for all X

X

f x c xg x

In this case, use the following algorithm

C-RNG-AR:1. Generate a random variate Y from density gX(x). 2. Generate u=U(0,1)

3. If u< fX(Y)/(cgX(Y)), set X=Y and return;4. Else repeat from Step 1.

Accept/Reject Method (C-RNG-AR)

The C-RNG-AR is similar to the D-RNG-AR algorithm except the comparison step where rather than comparing the two probabilities we compare the values of the density functions.

Theorem The random variates generated by the Accept/Reject method

have density fX(x). The number of iterations of the algorithm that are needed is a

geometric random variable with mean c

Note: The constant c is important since is implies the number of iterations needed before a number is accepted, therefore it is required that it is selected so that it has its minimum value.

C-RNG-AR Example

Use the C-RNG-AR method to generate random variates X that are normally distributed with mean 0 and variance 1, N(0,1).

212

| |

2( )

2

xXf x e

First consider the pdf of the absolute value of |X|.

We know how to generate exponentially distributed random variates Y with rate λ=1.

( ) , 0xYg x e x

Determine c such that it is equal to the maximum of the ratio

21

2| | ( ) 2

2

x xX

Y

f xe

g x 2e

c

C-RNG-AR Example

C-RNG-AR for N(0,1): u1=U(0,1), u2=U(0,1); Y= -log(u1); while(u2 > exp(-0.5(Y-1)*(Y-1)))

u1= U(0,1); u2=U(0,1); Y= -log(u1);

u3= U(0,1); If u3 < 0.5 X=Y; Else X= -Y;

Suppose we would like Z~N(μ, σ2), then

:Z X

Generating a Homogeneous Poisson Processes

A homogenous Poisson process is a sequence of points (events) where the inter-even times are exponentially distributed with rate λ (The Poisson process will be studied in detail during later classes)

Let ti denote the ith point of a Poisson process, then the algorithm for generating the first N points of the sequence {ti, i=1,2,…,N} is given by

Algorithm Poisson-λ: k=0, t(k)=0; While k<N

k= k+1; Generate u=U(0,1) t(k)= t(k-1) – log(u)/lambda;

Return t.

Generating a Non-Homogeneous Poisson Processes

Suppose that the process is non-homogeneous i.e., the rate varies with time, i.e., λ(t) ≤λ, for all t<T.

Let ti denote the ith point of a Poisson process, and τ the actual time, then the algorithm for generating the first N points of the sequence {ti, i=1,2,…,N} is given by

Algorithm Thinning Poisson-λ: k=0, t(k)=0, tau= 0; While k<N

Generate u1=U(0,1); tau= tau – log(u1)/lambda; Generate u2= U(0,1); If(lambda(tau)\lambda < u2)

k= k+1, t(k)= tau; Return t.

Generating a Non-Homogeneous Poisson Processes

Again, suppose that the process is non-homogeneous i.e., the rate varies with time, i.e., λ(t) ≤λ, for all t<T but now we would like to generate all points ti directly, without thinning.

Assuming that we are at point ti, then the question that we need to answer is what is the cdf of Si where Si is the time until the next event

| | ( ) Pr |iS i iF s S s t t 0

1 exps

it y dy

Thus, to simulate this process, we start from t0 and generate S1 from FS1 to go to t1=t0+S1. Then, from t1, we generate S2 from FS2 to go to t2=t1+S2 and so on.

Example of Non-Homogeneous Poisson Processes

Suppose that λ(t)= 1/(t+α), t ≥0, for some positive constant a. Generate variates from this non-homogeneous Poisson process.

First, let us determine the rate of the cdf

0

s

it y dy

Inverting this yields

0

1s

i

dya t y

log i

i

s t a

t a

1 exp logi

iS

i

s t aF

t a

1 i

i i

t a s

s t a s t a

1

1Si

it a uF

u

Example of Non-Homogeneous Poisson Processes

Inverse Transform 1

1Si

it a uF

u

Thus we start from t0=0

0 1 11 0

1 11 1

t a u aut t

u u

21 1 22 1

2 21 1

ut a t aut t

u u

…

1 11 1 1

i i i ii i

i i

t a u t aut t

u u

The Composition Approach

1 1

( ), 1, 0, 1,...,n n

iX i i i

i i

F rG x r r i nx

Suppose that we have an efficient way to generate variates from cdfs G1(x),…, Gn(x).

Suppose that we would like to generate random variates for a random variable having cdf

Algorithm C-RNG-C: Generate u=U(0,1)

If u<p1, get X from G1(x), return;

If u<p1+p2, get X from G2(x), return; …

Polar Method for Generating Normal Random Variates

2 2 11 1 2 222 2

1 1 1( , )

22 2

x y x yXYf x y e e e

Let X and Y be independent normally distributed random variables with zero mean and variance 1. Then the joint density function is given by

X

Y

θ

R

Then make a variable change2 2r x y arctan

y

x

The new joint density function is12

1 1( , )

2 2r

Rf r e

Uniform in the interval [0,2π]

Exponential with rate 1/2

(-1,1)

(-1,-1)

(1,1)

(1,-1)

Polar Method for Generating Normal Random Variates

Algorithm C-RNG-N1: Generate u1=U(0,1), u2=U(0,1); R= -2*log(u1); W= 2*pi*u2; X= sqrt(R) cos(W); Y= sqrt(R) sin(W);

But, sine and cosine evaluations are inefficient! Algorithm C-RNG-N2:

1. Generate u1=U(0,1), u2=U(0,1);

2. Set V1= 2*u1-1, V2= 2*u2-1;

3. S=V1*V1+V2*V2;

4. If S > 1, Go to 1

5. R= sqrt(-2*log(S)/S);

6. X= R*V1;

7. Y= R*V2;

Generates 2 independent RVs

(V1,V2)

Simulation of Discrete Event Systems

INITIALIZE

EVENT CALENDAR

e1 t1

e2 t2…

CLOCK STRUCTURE

RNG

TIMESTATE

Update Statex’=f(x,e1)

Update Timet’=t1

Delete Infeasible

EventsAdd New Feasible Events

Verification of a Simulation Program

Standard debugging techniques Debug “modules” or subroutines Create simple special cases, where you know what to

expect as an output from each of the modules Often choosing carefully the system parameters, the

simulation model can be evaluated analytically. Create a trace which keeps track of the state

variables, the event list and other variables.

Documents

Simulation and Random Number Generation. Summary Discrete Time vs Discrete Event Simulation Random number generation Generating a random sequence