24
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.

FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

Page 1: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

FREQUENCY ANALYSIS

• Basic Problem:

To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.

Page 2: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

FREQUENCY ANALYSIS

• Basic Assumptions:

(a) Data analyzed are to be statistically independent

& identically distributed - selection of data (Time

dependence, time scale, mechanisms).

(b) Change over time due to man-made (eg.

urbanization) or natural processes do not alter the

frequency relation - temporal trend in data

(stationarity).

Page 3: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

FREQUENCY ANALYSIS

• Practical Problems:(a) Selection of reasonable & simple distribution.

(b) Estimation of parameters in distribution.

(c) Assessment of risk with reasonable accuracy.

Page 4: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

REVIEW OF BASIC CONCEPTSProbabilistic

Outcome of a hydrologic event (e.g., rainfall amount & duration; flood peak discharge; wave height, etc.) is random and cannot be predicted with

certainty.

Terminologies Population The collection of all possible outcomes relevant to the process of interest. Example: (1) Max. 2-hr rainfall depth: all non-negative real numbers; (2) No. of storm in June: all non-negative integer numbers.

Sample A measured segment (or subset) of the population.

Page 5: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

REVIEW OF BASIC CONCEPTSTerminologies - Random Variable A variable describable by a probability distribution which

specifies the chance that the variable will assume a particular value.

Convention: Capital letter for random variables (say, X) whereas the lower case letter (say, x) for numerical realization that the random variable X will take.

Example: X = rainfall amount in 2 hours (a random variable) ; x = 100.2 mm/2hr (realization).- Random variables can be - discrete (eg., no. of rainy days in June) or - continuous (eg., max. 2-hr rainfall amount, flood discharge)..

Page 6: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

REVIEW OF BASIC CONCEPTSTerminologies Frequency & Relative Frequency

o For discrete random variables:o Frequency is the number of occurrences of a specific event. Relative frequency

is resulting from dividing frequency by the total number of events. e.g.

n = no. of years having exactly 50 rainy days; N = total no. of years.

Let n=10 years and N=100 years. Then, the frequency of having exactly 50 rainy days is 10 and the relative frequency of having exactly 50 rainy days in 100 years is n/N = 0.1.

o For continuous random variables:o Frequency needs to be defined for a class interval.o A plot of frequency or relative frequency versus class intervals is called

histogram or probability polygon.o As the number of sample gets infinitely large and class interval length

approaches to zero, the histogram will become a smooth curve, called probability density function.

Page 7: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

REVIEW OF BASIC CONCEPTSTerminologies

Probability Density Function (PDF) – • For a continuous random variable, the PDF must satisfy

and f(x) ≥ 0 for all values of x.

• For a discrete random variable, the PDF must satisfy

and 1≥ p(x) ≥ 0 for all values of x.

-

( ) 1f x dx

( ) 1all x

p x

Page 8: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

REVIEW OF BASIC CONCEPTSTerminologies

- Cumulative Distribution Function -

For a continuous random variable,

For a discrete random variable, by

-

Pr( ) ( ) ox

oX x f x dx

Pr( ) = ( )i o

o iall x x

X p xx

Page 9: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Statistical Properties of Random Variables

• Population - Synonymous to sample space, which describes the complete assemblage of all the values representative of a particular random process.

• Sample - Any subset of the population.

• Parameters - Quantities that are descriptive of the population in a statistical model. Normally, Greek letters are used to denote statistical parameters.

• Sample statistics (or simply statistics): Quantities calculated

on the basis of sample observations.

Page 10: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Statistical Moments of Random Variables

• Descriptors commonly used to show statistical properties of a RV are those indicative

(1) Central tendency; (2) Dispersion; (3) Asymmetry.

• Frequently used descriptors in these three categories are related to statistical moments of a RV.

• Two types of statistical moments are commonly used in hydrosystem engineering applications:

(1) product-moments and (2) L-moments.

Page 11: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Product-Moments • rth-order product-moment of X about any reference point X=xo is defined, for continuous case,

as

whereas for discrete case,

where E[] is a statistical expectation operator. • In practice, the first three moments (r=1, 2, 3) are used to describe the central tendency,

variability, and asymmetry. • Two types of product-moments are commonly used:

– Raw moments: µr'=E[Xr] rth-order moment about the origin; and – Central moments: µr=E[(Xµx)r] = rth-order central moment – Relations between two types of product-moments are:

. where Cn,x = binomial coefficient = n!/(x!(n-x)!)• Main disadvantages of the product-moments are:

(1) Estimation from sample observations is sensitive to the presence of outliers; and (2) Accuracy of sample product-moments deteriorates rapidly with increase in the order of the moments.

r r r

o o x o x

- -

E X x = x x f x dx = x x dF x

1

Kr r

o k o x kk=

E X x = x x p x

0

1r

i i 'r,i x r ir

i=

= μμ C 0

r' i

r,ir x r ii=

μ = μC

Page 12: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Mean, Mode, Median, and Quantiles• Expectation (1st-order moment) measures central tendency of random variable X

– Mean () = Expectation = 1 = location of the centroid of PDF or PMF.– Two operational properties of the expectation are useful:

• in which k=E[Xk] for k = 1,2, …, K.

• For independent random variables,

• Mode (xmo) - the value of a RV at which its PDF is peaked. The mode, xmo, can be obtained by solving

• Median (xmd) - value that splits the distribution into two equal halves, i.e,

• Quantiles - 100pth quantile of a RV X is a quantity xp that satisfies P(X xp) = Fx(xp) = p • A PDF could be uni-modal, bimodal, or multi-modal. Generally, the mean, median, and mode

of a random variable are different, unless the PDF is symmetric and uni-modal.

1x x x x

- -

E X = μ = x f x dx = x dF x = F x dx

1 1

K K

k k k kk= k=

E a X = a

1 1

K K

k kk= k=

E X =

0 5mdx

x md xF x = f x dx = .

0

mo

x

x x

f x=

x

Page 13: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Uni-modal and bi-modal distributions

x

(a) Uni-modal distribution

fx(x)

x

(b) Bi-modal distribution

fx(x)

Page 14: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Variance, Standard Deviation, and Coefficient of Variation

• Variance is the second-order central moment measuring the spreading of a RV over its range,

• Standard deviation (x) is the positive square root of the variance. • Coefficient of variation, x=x/x, is a dimensionless measure; useful for

comparing the degree of uncertainty of two RVs with different units. • Three important properties of the variance are:

– (1) Var[c] = 0 when c is a constant.– (2) Var[X] = E[X2] E2[X]– For multiple independent random variables,

where ak =a constant and k = standard deviation of Xk, k=1,2, ..., K.

2 2

1 1

K K

k k k kk= k=

Var a X = a

2 222 x x x x

-

Var X = = E X μ = x μ f x dx

Page 15: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Skewness Coefficient • Measures asymmetry of the PDF of a random variable• Skewness coefficient, x, defined as

• The sign of the skewness coefficient indicates the degree of symmetry of the probability distribution function.

• Pearson skewness coefficient –

• In practice, product-moments higher than 3rd-order are less used because they are unreliable and inaccurate when estimated from a small number of samples

• See Table for equations to compute the sample product-moments.

3

31 5 32

x

x .x

E X μμ = =

μ

1mox

x

μ x=

Page 16: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Relative locations of mean, median, and mode for positively-skewed, symmetric, and negatively-skewed distributions.

x x mo x md

(a) Positively Skewed, x>0

fx(x)

x

xxmo=xmd

(b) Symmetric, x=0

fx(x)

x

x md x mo x

(c) Negatively skewed, x<0

fx(x)

x

Page 17: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Product-moments of random variables

Moment Measure of Definition Continuous Variable Discrete Variable Sample Estimator

First Central Location

Mean, Expected value E(X)=x

( )x xx f x dx

( )x k kall x's

μ x p x ix x / n

Second Dispersion

Variance, Var(X)=2= x2

Standard deviation, x

Coefficient of variation,x

2 2( ) ( )x x xx μ f x dx

( )x Var X

x = xx

22

'k x x kx P x

x all x s

( )x Var X

x = xx

2 211 is x x

n

211 is x x

n

vC s x

Third Asymmetry

Skewness Skewness coefficient, x

33 ( ) ( )x xμ x μ f x dx

x = 3 / x

3

33 ( ) ( )k x x k

all x's

μ x μ p x x = 3 / x

3

3

3 ( 1)( 2) i

nm x x

n n

g = m3 / s

3

Fourth Peakedness

Kurtosis, x

Excess coefficient, x

44 ( ) ( )x xμ x μ f x dx

x = 4 / x

4

xx

44 ( ) ( )k x x k

all x's

μ x μ p x x = 4 / x

4

xx

4

4

( 1)

( 1)( 2)( 3) i

n nm x x

n n n

k = m4 / s

4

Page 18: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Kurtosis (x)

• Measure of the peakedness of a distribution. • Related to the 4th central product-moment as

• For a normal RV, its kurtosis is equal to 3. Sometimes, coefficient of excess, x=x3, is used.

• All feasible distribution functions, skewness coefficient and kurtosis must satisfy

4

42 42

x

xx

E X μμ = =

μ

2 1x x

Page 19: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions

• NORMAL DISTRIBUTION

Standardized Variable:

Z has mean 0 and standard deviation 1.

X

Z

x < < ,

σ

μx -

π =x f

x

x

x

xxN for2

1exp

2

1 ,|

2

2

Page 20: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions

• STANDARD NORMAL DISTRIBUTION:

0

0.1

0.2

0.3

0.4

0.5

-3 -2 -1 0 1 2 3

z

( z )

Page 21: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions • LOG-NORMAL DISTRIBUTION

0.0

0.2

0.40.60.8

1.01.2

1.41.6

0 1 2 3 4 5 6x

fLN(x)

x=0.6 x=1.3

x=0.3

(a) x = 1.0

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0 1 2 3 4 5 6x

fLN(x)

x =1.65

x =2.25

x =4.50

(b) x = 1.30

2

2 lnlnln

lnln

ln( ) -1 1 | , = exp - , > 0

22x

xxLNxx

xx xf

x

Page 22: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions

• Gumbel (Extreme-Value Type I) Distribution

0

0.1

0.2

0.3

0.4

-4 -3 -2 -1 0 1 2 3 4

y

Max

Min

fEV1(y)

1 | , exp exp for maxima

1 exp exp for minima

EVx

x = F

x ξ = +

β

1

1| , exp exp for maxima

1exp exp for minima

EV

x x x = f

x x = + +

Page 23: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions • Log-Pearson Type 3 Distribution

e

β

ξx

Γβ = xf x-

α

P

/

1

3

1,,|

0.00

0.05

0.10

0.15

0.20

0.25

0.30

0 2 4 6 8 10 12 14

x

fG(x)

Page 24: FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions

Some Commonly Used Distributions • Log-Pearson Type 3 Distribution

e

β

ξx

Γβ x = xf x-

α

LP

/)ln(

1

3

ln1,,|

with >0, xe when >0 and with >0, xe when <0