25
An Introduction to Extreme Value Theory Petra Friederichs Meteorological Institute University of Bonn COPS Summer School, July/August, 2007

An Introduction to Extreme Value Theory

  • Upload
    others

  • View
    12

  • Download
    0

Embed Size (px)

Citation preview

Page 1: An Introduction to Extreme Value Theory

An Introduction to Extreme Value Theory

Petra Friederichs

Meteorological InstituteUniversity of Bonn

COPS Summer School, July/August, 2007

Page 2: An Introduction to Extreme Value Theory

Applications of EVT

Finance• distribution of income has so called fat tails• value-at-risk: maximal daily lost• re-assurance

Hydrology• protection against flood• Q100: maximal flow that is expected once every 100 years

Meteorology• extreme winds • risk assessment (e.g. ICE, power plants)• heavy precipitation events• heat waves, hurricanes, droughts• extremes in a changing climate

Page 3: An Introduction to Extreme Value Theory

In classical statistics: focus on AVERAGE behavior of stochastic process

central limit theorem

In extreme value theory:focus on extreme and rare events

Fisher-Tippett theorem

What is an extreme?

from Ulrike Schneider

Application

Page 4: An Introduction to Extreme Value Theory

Extreme Value Theory

Block Maximum

for

follows a Generalized Extreme Value (GEV) distribution

Peak over Threshold (POT)

very large threshold u

follow a Generalized Pareto Distribution (GPD)

Poisson-Point GPD Process

combines POT with Poisson point process

M n=max {X 1 , , X n}

M n

n∞

{X i−u∣X iu}

Page 5: An Introduction to Extreme Value Theory

Extreme Value Theory

Block Maximum

Example: station precipitation (DWD) at Dresden

1948 – 2004: 57 seasonal (Nov-March and May-Sept) (maxima over approximately 150 days)

M n=max {X 1 , , X n}

DresdenSummer

Winter

Page 6: An Introduction to Extreme Value Theory

GEV – Fisher-Tippett Theorem

G y=exp −[1y−]−1 /

The distribution of

converges to ( )

which is called the Generalized Extreme Value (GEV) distribution.

It has three parameters

location parameter

scale parameter

shape parameter

M n=max {X 1 , , X n}

n∞

G y=exp −exp −y−

≠0

=0

Page 7: An Introduction to Extreme Value Theory

GEV – Types of Distributions

GEV has 3 types depending on shape parameter

Gumbel

Fréchet

Weibull

=0

G x =exp−exp[−x ]

x=y−

=1 /0

G x =exp−[1x]−

=−1 /0

G x =exp−[1−x]

Page 8: An Introduction to Extreme Value Theory

GEV – Types of Distributions

GEV has 3 types depending on shape parameter

Gumbel

exponential tail

Fréchet

so called „fat tail“

Weibull

upper finite endpoint

=0

x=y−

=1 /0

=−1 /0

Page 9: An Introduction to Extreme Value Theory

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Gumbel type GEV iff

for all

Exponential decay in the tail of

{X 1 , , X n}

X i

X iF x

F x

limx∞

1−F xtb x 1−F x

=e−t

t0F x

F(x)p=0.9p=0.99

x0.9 x0.99

Page 10: An Introduction to Extreme Value Theory

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Frechet type GEV iff

for all

polynomial decay in the tail of

{X 1 , , X n}

X i

X iF x

F x

limx∞

1−F x 1−F x

=−1/

0F x

Page 11: An Introduction to Extreme Value Theory

GEV – Types of Distribution

Conditions to the sample from which the maxima are drawn

must be independently identically distributed (i.i.d.)

Let be the distribution of

is in the domain of attraction of a Weibull type GEV iff

there exists with

and

for all

has a finite upper end point

{X 1 , , X n}

X i

X iF x

F x

limx∞1−F F−

1 x1−F F−

1x−1

=1/

0F x

F F =1F

F

Page 12: An Introduction to Extreme Value Theory

GEV – return level

zm= − log −log 1−1/m

Of interest often is return level

value expected every m observation (block maxima)

calculated using invers distribution function (quantile function)

can be estimated empirically as the

zm

Prob yzm=1−G yzm=1m

zm= G−11−1m = inf y ∣F y

1m

zm=G−11−

1m = −

1−−log 1−1/m−

Page 13: An Introduction to Extreme Value Theory

DresdenSummer

Winter

Maximum daily precipitation for Nov-March (Winter) and May-Sept (Summer)

Block Maxima

Page 14: An Introduction to Extreme Value Theory

Block Maxima

Page 15: An Introduction to Extreme Value Theory

Block Maxima

Page 16: An Introduction to Extreme Value Theory

Peak over Threshold

Peak over Threshold (POT)

very large threshold u

follow a Generalized Pareto Distribution (GPD)

{X i−u∣X iu}

Daily precipitation for Nov-March (green) and May-Sept (red) Dresden

Page 17: An Introduction to Extreme Value Theory

Peak over Threshold

Peak over Threshold (POT)

The distribution of

exceedances over large threshold u

are asymptotically distributed following a

Generalized Pareto Distribution (GPD)

two parameters

scale parameter

shape parameter

advantage: more efficient use of data

disadvantage: how to choose threshold not evident

Y i :=X i−u∣X iu

H y∣X iu=1−1yu−1/

Page 18: An Introduction to Extreme Value Theory

POT – Types of Distribution

GDP has same 3 types as GEV depending on shape parameter

Gumbel

exponential tail

Pareto (Fréchet)

polynomial tail behavior

Weibull

has upper end point

=0

H y =1−exp−yu

0

1−H y ~c y−1/

0

F=u∣∣

Page 19: An Introduction to Extreme Value Theory

Peak over Threshold

POT: threshold u = 15mm

Page 20: An Introduction to Extreme Value Theory

Poisson Point – GPD Process

A= t 2−t1 [1y−u u

]−1/

Poisson point – GPD process with intensity

on

A= t1, t2 × y ,∞

Page 21: An Introduction to Extreme Value Theory

Estimation and Uncertainty

General concept of estimating parameters from a sample

Maximum Likelihood (ML) Method

Assume are draw from a GEV (GPD,...) with unknown parameters

and PDF

{ yi} , i=1, , n

yi~F y∣ , ,

{ yi}

f y∣ , ,=F ' y∣ , ,

Page 22: An Introduction to Extreme Value Theory

Maximum Likelihood Method

L , ,=∏i=1

nf yi∣ , ,

The likelihood L of the sample is then

It is easier to minimize the negative logarithm of the likelihood

in general there is no analytical solution for the minimum with respect to the parameters

Minimize using numerical algorithms.

The estimates maximize the likelihood of the data.

l , ,=−∑i=1

nlog f yi∣ , ,

, ,

Page 23: An Introduction to Extreme Value Theory

Profile log Likelihood

Page 24: An Introduction to Extreme Value Theory

To Take Home

There exists a well elaborated statistical theory for extreme values.

It applies to (almost) all (univariate) extremal problems.

EVT: extremes from a very large domain of stochastic processes follow one of the three types: Gumbel, Frechet/Pareto, or Weibull

Only those three types characterize the behavior of extremes!

Note: Data need to be in the asymptotic limit of a EVD!

Page 25: An Introduction to Extreme Value Theory

References

Coles, S (2001): An Introduction to Statistical Modeling of Extreme Values. Springer Series in Statistics. Springer Verlag London. 208p

Beirlant, J; Y. Goegebeur; J. Segers; J. Teugels (2005): Statistics of Extremes. Theory and Applications. John Wiley & Sons Ltd. 490p

Embrechts, Küppelberg, Mikosch (1997): Modelling Extremal Events for Insurance and Finance. Springer Verlag Heidelberg.648p

Gumbel, E.J. (1958): Statistics of Extremes. (Dover Publication, New York 2004)

R Development Core Team (2003): R: A language and environment for statistical computing, available at http://www.R-project.org

The evd and ismev Packages by Alec Stephenson and Stuard Coles