18
Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Spatial Statistics & R Our recent developments C. Vega Orozco, J. Golay, M. Tonini, M. Kanevski Center for Research on Terrestrial Environment Faculty of Geosciences and Environment University of Lausanne June 26, 2013

Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Spatial Statistics & ROur recent developments

C. Vega Orozco, J. Golay, M. Tonini, M. Kanevski

Center for Research on Terrestrial EnvironmentFaculty of Geosciences and Environment

University of Lausanne

June 26, 2013

Page 2: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

1 IntroductionSpatial StatisticsMethods of investigation

2 Spatial Point Pattern AnalysisPoint ProcessesValidity Domain

3 Applications of Clustering MeasuresFractal DimensionMultifractal MeasuresThe Morisita Index

4 Conclusions

Page 3: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Outline

1 IntroductionSpatial StatisticsMethods of investigation

2 Spatial Point Pattern AnalysisPoint ProcessesValidity Domain

3 Applications of Clustering MeasuresFractal DimensionMultifractal MeasuresThe Morisita Index

4 Conclusions

Page 4: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Spatial Statistics

Introduction

Spatial Statisticsß It concerns the analysis of how spatial objects and its attributes are

distributed in space and space-time.

ß Three main branches of spatial statistics are commonly distinguished:geostatistics, lattice statistics and spatial point pattern analysis.

ß Spatial point pattern analysis: distribution of point objects in spaceand analysis and modelling of the associated spatial pattern.

Motivation

ß Many environmental, socio-economic and other data can be treated asstochastic point processes.

ß Frequently, such events exhibit scaling behaviour indicating clustering oftheir spatial point pattern.

Page 5: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Methods of investigation

Methods of investigation

Space-Timepattern analysis

Globalmeasures

Timeanalysis

AllanFactor

MorisitaIndex

(Multi)-fractal

dimension

Lacunarity

Ripley’s K -function

Spaceanalysis

Voronoïpolygons

MorisitaIndex

(Multi)-fractal

dimension

Lacunarity

Ripley’s K -function

Space-Time

K -function

MorisitaIndexLocal

measures

GeostatisticsMultivariate

Geo-simulations

Space-timevariogramLocal fractal

measuresSandboxcounting

Space-time scanstatistics

Retros-pective

ProspectiveRegularshape

Irregularshape

Susceptibilitymapping(MachineLearning)

Featureselection

RandomForest

ArtificialNeural

NetworkSOM

SupportVector

Machine

Page 6: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Outline

1 IntroductionSpatial StatisticsMethods of investigation

2 Spatial Point Pattern AnalysisPoint ProcessesValidity Domain

3 Applications of Clustering MeasuresFractal DimensionMultifractal MeasuresThe Morisita Index

4 Conclusions

Page 7: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Point Processes

Point Processes

ß The events are represented by points (geographical coordinates) andmarks (attributes).

ß It aims at analysing the geometrical structure of patterns formed by ob-jects that are distributed in 1-, 2- or 3-dimensional space.

ß Analyses concern the degree of clustering of the points and the spatialscale at which these operate.

ß It provides information on the geometrical properties of the point structureand on underlying processes that have caused the patterns.

ß Methods:[ Topological measures[ Statistical measures: the Morisita index[ (Multi)Fractal measures: the box-counting method and the general-

ized Rényi dimensions

Page 8: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Validity Domain

Validity Domains & Simulations

The concept of Validity Domain (VD) is applied to take into account thenatural constraints of the geographical space.

Within the VD, it is possible to simulate random point patterns (e.g. CSR orother patterns with known properties).

These patterns are compared to the real phenomena allowing quantifying thereal clustering.

Example of simulations in VD:

Regular pattern in Bounding Box

Random pattern in Bounding Box

Random pattern in Canton Ticino

Random pattern in Forest area

Raw pattern of forest fires

Page 9: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Outline

1 IntroductionSpatial StatisticsMethods of investigation

2 Spatial Point Pattern AnalysisPoint ProcessesValidity Domain

3 Applications of Clustering MeasuresFractal DimensionMultifractal MeasuresThe Morisita Index

4 Conclusions

Page 10: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Fractal Dimension

Fractal Dimension: the box-counting method

A regular grid of boxes of size δ is superimposed on the study area and thenumber of boxes, N(δ), necessary to cover the point set is counted.Then, the size of the boxes is reduced and the number of boxes is countedagain. The algorithm goes on until a minimum δ size is reach.For a fractal point pattern, the scales (δ) and the number of boxes (N(δ)) followa power law:

N(δ) ∝ δ−dfbox

quadrats: 2 x 2

x

quadrats: 8 x 8

x

y

quadrats: 32 x 32 quadrats: 64 x 64

y

log(

N(δ

))log(δ)

log(

N(δ

))

log(δ)lo

g(N

(δ))

log(δ)

log(

N(δ

))

log(δ)

Page 11: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Fractal Dimension

Regular pattern in Bounding Box

Random pattern in Bounding Box

Random pattern in Canton Ticino

Random pattern in Forest area

Raw pattern of forest fires

log(δ)

log(

N(δ

))

12.5 13 13.5 14 14.5 15 15.5 16 16.5

0.0

1.0

2.0

3.0

4.0

5.0

6.0

7.0

8.0

9.0

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

Legend

Raw Pattern = −1.68Random Pattern in Forest = −1.74Random Pattern in Ticino = −1.80Random Pattern in Box = −1.99Regular Pattern in Box = −2.00

Page 12: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Multifractal Measures

Multifracal measures: the generalized Rényi dimensions

A regular grid of boxes of size δ is superimposed on the study area. Let N(δ)be the number of non-overlapping boxes of size δ needed to cover the fractaland pi(δ) the mass probability function within the i th box.The generalized Rényi dimensions, Dq , is computed through the parameter q:

Dq =1

(1− q)limδ→0

log(∑N(δ)

i=1 pi(δ)q)

log(1/δ)

quadrats: 8 x 8

x

quadrats: 8 x 8

x

y

quadrats: 8 x 8 quadrats: 8 x 8

y

Rény

i dim

ensi

onlog(δ) log(δ)

log(δ) log(δ)

Rény

i dim

ensi

on

Rény

i dim

ensi

on

Rény

i dim

ensi

on

Page 13: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Multifractal Measures

Regular pattern in Bounding Box

Random pattern in Bounding Box

Random pattern in Canton Ticino

Random pattern in Forest area

Raw pattern of forest fires

q

Dq

0 1 2 3 4 5 6 7 8 9 10

1.0

1.1

1.2

1.3

1.4

1.5

1.6

1.7

1.8

1.9

2.0

●●

●●

●●

●● ●

●●

● ● ● ● ● ● ● ● ●

●●

●●

●● ● ● ● ● ●

●●

●●

●●

●●

●●

●●

Legend

Raw PatternRandom Pattern in ForestRandom Pattern in TicinoRandom Pattern in BoxRegular Pattern in Box

Page 14: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

The Morisita Index

The Morisita index

It measures how many times more likely it is to randomly select two pointsbelonging to the same box than it would be if the points were randomly dis-tributed.

The Morisita index, Iδ, for a chosen box size δ is computed as follows:

Iδ = Q∑Q

i=1 ni(ni − 1)N(N − 1)

Mor

isita

Inde

x

1

One box Four boxes

Nine boxes Sixteen boxes

α

ni

Mor

isita

Inde

x

1

δ

Mor

isita

Inde

x

1

δ δ

δ

Mor

isita

Inde

x1

δ

δ δ

δ

L

ni ni

ni

Page 15: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

The Morisita Index

Regular pattern in Bounding Box

Random pattern in Bounding Box

Random pattern in Canton Ticino

Random pattern in Forest area

Raw pattern of forest fires

δ

Mor

isita

Inde

x

0 20000 40000 60000 80000 1e+05

0

1

2

3

4

5

6

7

8

●●

●●●

●●●●●●

●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●

●●●

●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●● ●

●●

●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●●●●

●●

●●

●●

Legend

Raw PatternRandom Pattern in ForestRandom Pattern in TicinoRandom Pattern in BoxRegular Pattern in Box

Page 16: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Outline

1 IntroductionSpatial StatisticsMethods of investigation

2 Spatial Point Pattern AnalysisPoint ProcessesValidity Domain

3 Applications of Clustering MeasuresFractal DimensionMultifractal MeasuresThe Morisita Index

4 Conclusions

Page 17: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Conclusions

ß The clustering measures allowed characterizing the spatial pattern ofcomplex point distributions such as environmental data.

ß Simulations within validity domains allowed quantifying the real clustering.No need for statistical significance tests.

ß R software allows creating new functions and developing and customisingexisting functions.

ß Developed functions:[ Morisita index and k -Morisita index[ Multifractal dimensions: generalized Rényi dimensions[ Multifractal dimensions: multifractal singularity spectrum[ Box-counting fractal dimension[ Allan factor[ Envelopes for CSR simulation patterns and permutations

Page 18: Spatial Statistics & R - EPFL · Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions Point Processes Point Processes ß The events are represented

Introduction Spatial Point Pattern Analysis Applications of Clustering Measures Conclusions

Thank you for your attention!

Acknowledgements

This research was partly supported by the Swiss NFS project No. 200021-140658, "Analysis and modelling of space-time patterns in complex regions".