90
Outline Preliminaries Random patterns Estimating intensities Second order properties Case study: Sea turtle nesting MODULE 14: Spatial Statistics in Epidemiology and Public Health Lecture 3: Point Processes Jon Wakefield and Lance Waller 1 / 90

MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

MODULE 14: Spatial Statistics in Epidemiologyand Public Health

Lecture 3: Point Processes

Jon Wakefield and Lance Waller

1 / 90

Page 2: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

PreliminariesRandom patterns

CSRMonte Carlo testingHeterogeneous Poisson process

Estimating intensitiesSecond order properties

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Case study: Sea turtle nestingSea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

2 / 90

Page 3: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

References

I Baddeley, A., Rubak, E., and Turner. R. (2015) Spatial PointPatterns: Methodology and Applications in R. Boca Raton,FL: CRC/Chapman & Hall.

I Diggle, P.J. (1983) Statistical Analysis of Spatial PointPatterns. London: Academic Press.

I Diggle, P.J. (2013) Statistical Analysis of Spatial andSpatio-Temporal Point Patterns, Third EditionCRC/Chapman & Hall.

I Waller and Gotway (2004, Chapter 5) Applied SpatialStatistics for Public Health Data. New York: Wiley.

I Møller, J. and Waagepetersen (2004) Statistical Inference andSimulation for Spatial Point Processes. Boca Raton, FL:CRC/Chapman & Hall.

3 / 90

Page 4: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Goals

I Describe basic types of spatial point patterns.

I Introduce mathematical models for random patterns of events.

I Introduce analytic methods for describing patterns in observedcollections of events.

I Illustrate the approaches using examples from archaeology,conservation biology, and epidermal neurology.

4 / 90

Page 5: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Terminology

I Event: An occurrence of interest (e.g., disease case).

I Event location: Where an event occurs.

I Realization: An observed set of event locations (a data set).

I Point: Any location in the study area.

I Point: Where an event could occur.

I Event: Where an event did occur.

5 / 90

Page 6: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Random patterns

I We use probability models to generate patterns so, in effect,all of the patterns we consider are “random”.

I Usually, “random pattern” refers to a pattern not influencedby the factors under investigation.

6 / 90

Page 7: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Complete spatial randomness (CSR)

I Start with a model of “lack of pattern”.

I Events equally likely to occur anywhere in the study area.

I Event locations independent of each other.

7 / 90

Page 8: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Six realizations of CSR

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.00.

00.

20.

40.

60.

81.

0

u

v

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

●●

●●

● ●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v ●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

8 / 90

Page 9: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

CSR as a boundary condition

CSR serves as a boundary between:

I Patterns that are more “clustered” than CSR.

I Patterns that are more “regular” than CSR.

9 / 90

Page 10: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Too Clustered (top), Too Regular (bottom)

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Clustered

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Clustered

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Clustered

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Regular

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Regular

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

Regular

10 / 90

Page 11: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

The role of scale

I “Eyeballing” clustered/regular sometimes difficult.

I In fact, an observed pattern may be clustered at one spatialscale, and regular at another.

I Scale is an important idea and represents the distances atwhich the underlying process generating the data operates.

I Many ecology papers on estimating scale of a process (e.g.,plant disease, animal territories) but little in public healthliterature (so far).

11 / 90

Page 12: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Clusters of regular patterns/Regular clusters

●●

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

● ●

●●

●●

●●

●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●●

●●●

●●

●●

●●

Regular pattern of clusters

● ●

●● ●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

● ●

●● ●● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

● ●

●● ●

Cluster of regular patterns

12 / 90

Page 13: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Spatial Point Processes

I Mathematically, we treat our point patterns as realizations ofa spatial stochastic process.

I A stochastic process is a collection of random variablesX1,X2, . . . ,XN .

I Examples: Number of people in line at grocery store.

I For us, each random variable represents an event location.

13 / 90

Page 14: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Stationarity/Isotropy

I Stationarity: Properties of process invariant to translation.

I Isotropy: Properties of process invariant to rotation around anorigin.

I Why do we need these? They provide a sort of replication inthe data that allows statistical estimation.

I Not required, but development easier.

14 / 90

Page 15: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

CSR as a Stochastic Process

Let N(A) = number of events observed in region A, and λ = apositive constant.

A homogenous spatial Poisson point process is defined by:

(a) N(A) ∼ Pois(λ|A|)(b) given N(A) = n, the locations of the events are uniformly

distributed over A.

λ is the intensity of the process (mean number of events expectedper unit area).

15 / 90

Page 16: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Is this CSR?

I Criterion (b) describes our notion of uniform andindependently distributed in space.

I Criteria (a) and (b) give a “recipe” for simulating realizationsof this process:

* Generate a Poisson random number of events.* Distribute that many events uniformly across the study area.runif(n,min(x),max(x))

runif(n,min(y),max(y))

16 / 90

Page 17: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Compared to temporal Poisson process

Recall the three “magic” features of a Poisson process in time:

I Number of events in non-overlapping intervals are Poissondistributed,

I Conditional on the number of events, events are uniformlydistributed within a fixed interval, and

I Interevent times are exponentially distributed.

In space, no ordering so no (uniquely defined) interevent distances.

17 / 90

Page 18: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Spatial Poisson process equivalent

Criteria (a) and (b) equivalent to

1. # events in non-overlapping areas is independent

2. Let A = region, |A| = area of A

lim|A|→0

Pr[ exactly one event in A]

|A|= λ > 0

3.

lim|A|→0

Pr[2 or more in A]

|A|= 0

18 / 90

Page 19: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Interesting questions

I Test for CSR

I Simulate CSR

I Estimate λ (or λ(s))

I Compare λ1(s) and λ2(s); s ∈ D for two point processes.(same underlying intensity?

e.g. Z1(s) = location of disease cases

Z2(s) = location of population at risk

or environmental exposure levels.))

19 / 90

Page 20: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Monte Carlo hypothesis testing

Review of basic hypothesis testing framework:

I T= a random variable representing the test statistic.

I Under H0 : U ∼ F0.

I From the data, observe T = tobs (the observed value of thetest statistic).p-value = 1− F0(tobs)

I Sometimes it is easier to simulate F0(·) than to calculate F0(·)exactly.

Besag and Diggle (1977).

20 / 90

Page 21: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Steps in Monte Carlo testing

1. observe u1.

2. simulate u2, ..., um from F0.

3. p.value = rank of u1m .

For tests of CSR (complete spatial randomness), M.C. tests arevery useful since CSR is easy to simulate but distribution of teststatistic may be complex.

21 / 90

Page 22: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Monte Carlo tests very helpful

1. when distribution of U is complex but the spatial distributionassociated with H0 is easy to simulate.

2. for permutation tests

e.g., 592 cases of leukemia in ∼ 1 million people in 790regionsI permutation test requires all possible permutations.I Monte Carlo assigns cases to regions under H0.

22 / 90

Page 23: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Testing for CSR

A good place to start analysis because:

1. rejecting CSR a minimal prerequisite to model observedpattern (i.e. if not CSR, then there is a pattern to model)

2. tests aid in formulation of possible alternatives to CSR (e.g.regular, clustered).

3. attained significance levels measure strength of evidenceagainst CSR.

4. informal combination of several complementary tests to showhow pattern departs from CSR (although this has not beenexplored in depth).

23 / 90

Page 24: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Final CSR notes

CSR:

1. is the “white noise” of spatial point processes.

2. characterizes the absence of structure (signal) in data.

3. often the null hypothesis in statistical tests to determine ifthere is structure in an observed point pattern.

4. not as useful in public health? Why not?

24 / 90

Page 25: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Heterogeneous population density

●●

●●

●●

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

●●

●●

●●

●●

●●

●●

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

u

v

25 / 90

Page 26: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

What if intensity not constant?

Let d(x , y) = tiny region around (x , y)

λ(x , y) = lim|d(x ,y)|→0

E [N(d(x , y))]

|d(x , y)|

≈ lim|d(x ,y)|→0

Pr[N(d(x , y)) = 1]

|d(x , y)|

If we use λ(x , y) instead of λ, we get aHeterogeneous (Inhomogeneous) Poisson Process

26 / 90

Page 27: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Heterogeneous Poisson Process

1. N(A) = Pois(∫

(x ,y)∈A λ(x , y)d(x , y))

(|A| =∫(x ,y)∈A d(x , y))

2. Given N(A) = n, events distributed in A as an independentsample from a distribution on A with p.d.f. proportional toλ(x , y).

27 / 90

Page 28: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Example intensity function

u

v

lambda

u

v

0 5 10 15 20

05

1015

20

28 / 90

Page 29: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

Six realizations

●●

● ●

●●

0 5 10 15 20

05

1015

20

u

v

●●

●●

●●

●●

● ●

● ●

●●

●●

●●

0 5 10 15 200

510

1520

u

v ●

● ●

●●

●●

● ●

0 5 10 15 20

05

1015

20

u

v●

●●

●●

●●

●●

●●

●●

● ●

● ●

0 5 10 15 20

05

1015

20

u

v

●●

● ●

●●

●●

0 5 10 15 20

05

1015

20

u

v

●●

● ●

● ●

● ●

●●

● ●

●●

●●

0 5 10 15 20

05

1015

20

u

v

29 / 90

Page 30: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

CSRMonte Carlo testingHeterogeneous Poisson process

IMPORTANT FACT!

Without additional information, no analysis can differentiatebetween:

1. independent events in a heterogeneous (non-stationary)environment

2. dependent events in a homogeneous (stationary) environment

30 / 90

Page 31: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

How do we estimate intensities?

For an heterogeneous Poisson process, λ(s) is closely related to thedensity of events over A (λ(s) ∝ density).

So density estimators provide a natural approach (for details ondensity estimation see Silverman (1986) and Wand and Jones(1995, KernSmooth R library)).

Main idea: Put a little “kernel” of density at each data point, thensum to give the estimate of the overall density function.

31 / 90

Page 32: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

What you need

In order to do kernel estimation, you need to choose:

1. kernel (shape) - Epeuechnikov (1986) shows any reasonablekernel gives near optimal results.

2. “bandwidth” (range of influence). The larger b, thebandwidth, the smoother the estimated function.

32 / 90

Page 33: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Kernels and bandwidths

s

0.0 0.2 0.4 0.6 0.8 1.0

Kernel variance = 0.02

s

0.0 0.2 0.4 0.6 0.8 1.0

Kernel variance = 0.03

s

0.0 0.2 0.4 0.6 0.8 1.0

Kernel variance = 0.04

s

0.0 0.2 0.4 0.6 0.8 1.0

Kernel variance = 0.1

33 / 90

Page 34: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

1-dim kernel estimation

If we have observations u1, u2, . . . , uN (in one dimension), thekernel density estimate is

f (u) =1

Nb

N∑i=1

Kern

(u − ui

b

)(1)

where Kern(·) is a kernel function satisfying∫D

Kern(s)ds = 1

and b the bandwidth. To estimate the intensity function, replaceN−1 by |D|−1.

34 / 90

Page 35: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

What do we do with this?

I Evaluate intensity (density) at each of a grid of locations.

I Make surface or contour plot.

35 / 90

Page 36: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

How do we pick the bandwidth?

I Circular question...

I Minimize the Mean Integrated Squared Error (MISE)

I Cross validation

I Scott’s rule (good rule of thumb).

bu = σuN−1/(dim+4) (2)

where σu is the sample standard deviation of theu-coordinates, N represents the number of events in the dataset (the sample size), and dim denotes the dimension of thestudy area.

36 / 90

Page 37: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Kernel estimation in R

base

I density() one-dimensional kernel

library(MASS)

I kde2d(x, y, h, n = 25, lims = c(range(x),

range(y)))

library(KernSmooth)

I bkde2D(x, bandwidth, gridsize=c(51, 51),

range.x=<<see below>>, truncate=TRUE) block kerneldensity estimation

I dpik() to pick bandwidth

37 / 90

Page 38: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

More kernel estimation in R

library(splancs)

I kernel2d(pts,poly,h0,nx=20,

ny=20,kernel=’quartic’)

library(spatstat)

I ksmooth.ppp(x, sigma, weights, edge=TRUE)

38 / 90

Page 39: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Data Break: Early Medieval Grave Sites

What do we have?

I Alt and Vach (1991). (Data sent from Richard WrightEmeritus Professor, School of Archaeology, University ofSydney.)

I Archeaological dig in Neresheim, Baden-Wurttemberg,Germany.

I The anthropologists and archaeologists involved wonder if thisparticular culture tended to place grave sites according tofamily units.

I 143 grave sites.

39 / 90

Page 40: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

What do we want?

I 30 with missing or reduced wisdom teeth (“affected”).

I Does the pattern of “affected” graves (cases) differ from thepattern of the 113 non-affected graves (controls)?

I How could estimates of the intensity functions for the affectedand non-affected grave sites, respectively, help answer thisquestion?

40 / 90

Page 41: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Outline of analysis

I Read in data.

I Plot data (axis intervals important!).

I Call 2-dimensional kernel smoothing functions (choose kerneland bandwidth).

I Plot surface (persp()) and contour (contour()) plots.

I Visual comparison of two intensities.

41 / 90

Page 42: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Plot of the data

*

*

**

**

*

** * *

**

*

*

*

*

*

*

**

*

*

**

**

*

*

*

***

**

*

*

*

*

*

*

*

**

*

*

*

*

*

*

*

**

*

*

*

**

**

*

**

*

*

*

**

*

*

*

**

*

*

*

***

* *

**

*

*

*

*

**

**

*

**

*

*

*

**

*

*

*

*

**

*

*

*

*

*

*

**

** *

*

*

*

*

**

*

*

*

*

****

*

*

*

**

*

*

*

***

*

*

4000 6000 8000 10000

4000

6000

8000

10000

u

v

Grave locations (*=grave, O=affected)

OO

O

O

O

O

O

O

OO

OOO

O

O

OO

O

O

O

O

O

O

O

O

OO

OO

O

42 / 90

Page 43: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Case intensity

u

v

Intensity

Estimated intensity function

**

*

*

*

*

*

*

* *

***

*

*

**

**

*

*

*

**

*** **

*

4000 8000

4000

8000

u

v

Affected grave locations

43 / 90

Page 44: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Control intensity

u

v

Intensity

Estimated intensity function

*

*

**

*

** * *

**

*

*

*

*

*

**

*

*

**

**

*

*

***

***

*

**

**

*

*

*

*

*

*

**

*

*

*

*

*

**

*

*

**

*

*

*

*

*

** *

***

*

*

***

*

*

*

**

*

*

*

*

**

*

*

*

*

*

*

** *

*

*

*

**

*

*

**

****

*

*

**

*

*

*

*

4000 6000 8000 10000

4000

6000

8000

1000

0

u

v

Non−affected grave locations

44 / 90

Page 45: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

What we have/don’t have

I Kernel estimates suggest similarities and differences.

I Suggest locations where there might be differences.

I No significance testing (yet!)

45 / 90

Page 46: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

First and Second Order Properties

I The intensity function describes the mean number of eventsper unit area, a first order property of the underlying process.

I What about second order properties relating to thevariance/covariance/correlation between event locations (ifevents non independent...)?

46 / 90

Page 47: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Ripley’s K function

Ripley (1976, 1977 introduced) the reduced second momentmeasure or K function

K (h) =E [# events within h of a randomly chosen event]

λ,

for any positive spatial lag h.

NOTE: Use of λ implies assumption of stationary process!

47 / 90

Page 48: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Properties of K (h)

I Ripley (1977) shows specifying K (h) for all h > 0, equivalentto specifying Var[N(A)] for any subregion A.

I Under CSR, K (h) = πh2 (area of circle of with radius h).

I Clustered? K (h) > πh2.

I Regular? K (h) < πh2.

48 / 90

Page 49: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Second order intensity?

I K (h) not universally hailed as the way (or even a good way)to describe second order properties.

I Provides a nice introduction for us, but another relatedproperty is...

I The second order intensity, λ2(s,u),

λ2(s,u) = lim|d(s )|→0

|d(u)|→0

E (N(d(s))N(d(u))

|d(s)||d(u)|

49 / 90

Page 50: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Relationship to K (h)

I How does K (·) relate to λ2(s,u)?

In <2, for a stationary, isotropic processλ2(s,u) = λ2(‖s − u‖). Then,

λK (h) =2π

λ

∫ h

0

uλ2(u)du

So

λ2(h) =λ2K ′(h)

2πh.

I Which to use (K (·) or λ2(·))?

I In theory, λ2(h) often used but K (h) is easier to estimatefrom a set of data.

50 / 90

Page 51: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Estimating K (h)

Start with definition, replacing expectation with average yields

K (h) =1

λN

N∑i=1

N∑j=1

j 6=i

δ(d(i , j) < h),

for N events, where

I d(i , j) = distance between events i and j

I δ(d(i , j) < h) = 1 if d(i , j) < h, 0 otherwise.

51 / 90

Page 52: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Edges

I Think about events near “edges” of study area.

I How do we count events within h if distance between edgeand observed event is < h?

I Unobservable data drive increasing.

I More of a problem as h increases...

I “Edge effects” call for “edge correction”.

52 / 90

Page 53: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Edge correction

I Add “guard area” around study area and only calculate K (h)for h < width of guard area.

I Toroidal correction.

I Ripley’s (1976) edge correction

Kec(h) = λ−1N∑i=1

N∑j=1

j 6=i

w−1ij δ(d(i , j) < h)

where wij = proportion of the circumference of the circlecentered at event i with radius d(i , j) within the study area.

53 / 90

Page 54: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Ripley’s weights

I Conceptually, conditional probability of observing an event atdistance d(i , j) given an event occurs d(i , j) from event i .

I Works for “holes” in study area too. (Astronomy application).

I Requires definition of study boundary (and way of calculatingwij).

54 / 90

Page 55: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Calculating K (h) in R

library(splancs)

I khat(pts,poly,s,newstyle=FALSE)

I poly defines polygon boundary (important!!!).

library(spatstat)

I Kest(X, r, correction=c("border", "isotropic",

"Ripley", "translate"))

I Boundary part of X (point process “object”).

55 / 90

Page 56: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Plots with K (h)

I Plotting (h,K (h)) for CSR is a parabola.

I K (h) = πh2 implies (K (h)

π

)1/2

= h.

I Besag (1977) suggests plotting

h versus L(h)

where

L(h) =

(Kec(h)

π

)1/2

− h

56 / 90

Page 57: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Variability and Envelopes

Are there any distributional results for K (h)?

Some, but mostly for particular region shapes. Monte Carloapproaches more general.

I Observe K (h) from data.

I Simulate a realization of events from CSR.

I Find K (h) for the simulated data.

I Repeat simulations many times.

I Create simulation “envelopes” from simulation-based K (h)’s.

57 / 90

Page 58: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Example: Regular clusters and clusters of regularity

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

−0.

10.

00.

10.

20.

3

Distance (h)

sqrt

(Kha

t/pi)

− h

Estimated K function, regular pattern of clusters

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

−0.

10.

00.

10.

20.

3

Distance (h)

sqrt

(Kha

t/pi)

− h

Estimated K function, cluster of regular patterns

58 / 90

Page 59: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Data break: Medieval gravesites

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for all gravesites, rectangle

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for affected gravesites, rectangle

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for non−affected gravesites, rectangle

59 / 90

Page 60: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Clustering!

I Looks like strong clustering, but wait...

I Compares all, cases, controls to CSR.

I Oops, forgot to adjust for polygon, clustering inside squarearea!

I Significant clustering, but interesting clustering?

I Let’s try again with polygon adjustment.

60 / 90

Page 61: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Medieval graves: K functions with polygon adjustment

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for all gravesites, polygon

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for affected gravesites, polygon

0 1000 2000 3000 4000 5000

−40

040

0

Distance

Lhat

− d

ista

nce

L plot for non−affected gravesites, polygon

61 / 90

Page 62: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Clustering?

I Clustering of cases at very shortest distances.

I Likely due to two coincident-pair sites (both cases in bothpairs).

I Could we try random labelling here?

I Construct envelopes based on random samples of 30 ”cases”from set of 143 locations.

62 / 90

Page 63: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

A few notes

I Since K (h) is a function of distance (h), it can indicateclustering at one scale and regularity at another.

I K (h) measures cumulative aggregation (# events up to h).May result in delayed response to change from clustering toregularity.

I λ2 (or similar measures) based on K ′(h) may respond moreinstantaneously.

I Often see “envelopes” based on extremes, but what does thismean with respect to increasing numbers of simulations?Quantiles may be better (at least they will converge tosomething).

63 / 90

Page 64: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Testing

I Envelopes provide pointwise inference at particular values ofh, but not correct to find deviation then assess whether it issignificant.

I Can conduct test of H0 : L(h) = h (equivalent toH0 : K (h) = πh2) for a predefined interval for h (Stoyan etal., 1995, p. 51).

I Monte Carlo test based on

T = max0≤h≤hmax

∣∣∣L(h)− h∣∣∣ .

64 / 90

Page 65: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

Notes

I Just as the first and second moments do not uniquely define adistribution, λ(s) and K (h) do not uniquely define a spatialpoint pattern (Baddeley and Silverman 1984, and in Section5.3.4 ).

I Analyses based on λ(s) typically assume independent events,describing the observed pattern entirely through aheterogeneous intensity.

I Analyses based on K (h) typically assume a stationary process(with constant λ), describing the observed pattern entirelythrough correlations between events.

I Remember Barlett’s (1964) result (IMPORTANT FACT!above).

65 / 90

Page 66: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

K functionsEstimating K functionsEdge correctionMonte Carlo envelopes

What questions can we answer?

I Are events uniformly distributed in space?I Test CSR.

I If not, where are events more or less likely?I Intensity esimtation.

I Do events tend to occur near other events, and, if so, at whatscale?I K functions with Monte Carlo envelopes.

66 / 90

Page 67: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Sea turtle biology

I Existed (at least) since Jurassic period (∼ 200-120 mya).

I Air breathing reptile.

I Spends life in ocean, returns to natal beach to lay eggs.

I Lays eggs on land (∼ 100 eggs per clutch, buried in sand).

I Hatchlings appear en masse, head to the ocean.

67 / 90

Page 68: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Sea Turtle Species

68 / 90

Page 69: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Consider the hatchling

I Life fraught with peril.

I On menu for crabs, gulls, etc.

I Goal 1: Get to the ocean. Follow the light!

69 / 90

Page 70: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Sea Turtles and Humans: Regulatory implications

I All species “threatened” or “endangered”.

I Regulations in place for fisheries (TEDs), beach-front lighting,beach-front development.

I East coast of Florida (USA), prime nesting site for loggerheadturtles.

I Also popular among green turtles (biannual cycle).

I We consider impact of two different construction projects onJuno Beach, Florida.

70 / 90

Page 71: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Two construction projects

I Construction of 990-foot fishing pier in 1998.

I Environmental impact statement approved if collect data todetermine impact on sea turtle nesting.

I Beach nourishment project 2000.

I Dredge up sand offshore and rebuild beach.

I Past studies on impact of beach nourishment on turtle nesting(reduced for 2 nesting seasons, off-shore slope impact?)

71 / 90

Page 72: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

“Standard” approach

I Patrol beach each morning during nesting season.

I Count trackways in each Index Nesting Beach Zone.

I Compare distribution of counts between years.

72 / 90

Page 73: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Juno Beach data

I Marinelife Center of Juno Beach volunteers patrol beach eachmorning.

I Differential GPS coordinates for apex of each “crawl”.

I Store: Date. Nest? Species. Predation?

I Approximately 10,000 points per year.

73 / 90

Page 74: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Juno Beach Data

74 / 90

Page 75: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Loggerhead Displacement from Beach

75 / 90

Page 76: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Displacement of Green Emergences

−20000 −15000 −10000 −5000 0 5000 10000

−10

00

100

200

Feet along beach

Dis

plac

emen

t fro

m lo

ess

Displacement of nest sites to loess beach: Greens

1 2 3 4 5 6 7 8 9 10 11

Nourishment ProjectPre−pier (1997−1998)Pre−Nourishment (1999−2000)Post−Nourishment (2001−2002)

76 / 90

Page 77: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

“Linearizing” the beach

I While we have sub-meter accurate GPS locations, depth onbeach of secondary interest.

I We want to know where differences in nesting pattern occurand whether these observed differences are significant.

I Treat emergence locations as realization of a spatial pointprocess in one dimension (events on a line).

I Ignore depth on beach by projecting all points to loess curverepresenting beach.

I Three time periods of interest: Pre-pier (1997-1998),pre-nourishment (post-pier) (1999-2000), post-nourishment(2001-2002).

77 / 90

Page 78: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Kernel estimates for each time period

I Let λ(s) denote intensity function of point process (meannumber of events per unit distance).

I Intensity proportional to pdf.

I Data: locations s1, . . . , sn.

I Estimate via kernel estimation:λ(s) = 1

nb

∑ni=1 Kern

[(s−s i )

b

]I Kern(·) is kernel function, b bandwidth (governing

smoothness).

78 / 90

Page 79: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Ratio of kernel estimates

I Kelsall and Diggle (1995, Stat in Med) consider ratio of twointensity estimates as spatial measure of relative risk.

I Suppose we have n1 “before” events, n2 “after” events.

I λ1(s), λ2(s) associated before/after intensity functions.

I Conditional on n1 and n2, data equivalent to two randomsamples from pdfs:f1(s) = λ1(s)/

∫λ1(u)du and

f2(s) = λ2(s)/∫λ2(u)du.

79 / 90

Page 80: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Ratio of kernel estimates

I Consider log relative risk

r(s) = logf1(s)

f2(s).

I Some algebra yields:

r(s) = log

[λ1(s)

λ2(s)

]− log

[∫λ1(u)du∫λ2(u)du

].

I H0 : r(s) = 0 for all s.

I How do we test this?

80 / 90

Page 81: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Monte Carlo inference

I Conditional on all n1 + n2 locations, randomize n1 to“before”, n2 to “after”.

I Calculate f ∗(s) and g∗(s), get ratio r∗(s).

I Repeat many (999) times.

I Calculate pointwise 95% tolerance region.

81 / 90

Page 82: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Densities

0 5000 10000 15000 20000 25000 30000

0 e

+00

2 e

−05

4 e

−05

6 e

−05

x=Distance along beach (ft)

dens

ity

Nesting locations along beach

pre−pier CC (loggerhead)pre−pier CM (green)pre−nourishment CC (loggerhead)pre−nourishment CM (green)post−nourishment CC (loggerhead)post−nourishment CM (green)

1 2 3 4 5 6 7 8 9 10 11

Nourishment Project

82 / 90

Page 83: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Pre- vs. post-pier, Loggerheads

0 5000 10000 15000 20000 25000 30000

0 e

+00

5 e

−05

Feet along beach

Den

sitie

s

Pre−pier vs. pre−nourishment, Loggerheads

Pre−pier (1997−1998)Pre−nourishment (1999−2000)

0 5000 10000 15000 20000 25000 30000

0.0

1.0

2.0

Feet along beach

Rat

io o

f Den

sitie

s

1 2 3 4 5 6 7 8 9 10 11

Nourishment Project

83 / 90

Page 84: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Pre- vs. post-pier, Greens

0 5000 10000 15000 20000 25000 30000

0 e

+00

5 e

−05

Feet along beach

Den

sitie

s

Pre−pier vs. pre−nourishment, Greens

Pre−pier (1997−1998)Pre−nourishment (1999−2000)

0 5000 10000 15000 20000 25000 30000

0.0

1.0

2.0

Feet along beach

Rat

io o

f Den

sitie

s

1 2 3 4 5 6 7 8 9 10 11

Nourishment Project

84 / 90

Page 85: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Differences

I Green densities smoother (wider bandwidth due to smallersample size).

I Clear impact of pier on loggerheads (but very local).

I Conservation strategy? Maintain protected areas nearby.

I Note: impact on all crawls (nesting and non-nesting).

85 / 90

Page 86: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Pre- vs. post-nourishment, Loggerheads

0 5000 10000 15000 20000 25000 30000

0 e

+00

5 e

−05

Feet along beach

Den

sitie

s

Pre−nourishment vs. post−nourishment, Loggerheads

Pre−pier (1997−1998)Pre−nourishment (1999−2000)

0 5000 10000 15000 20000 25000 30000

0.0

1.0

2.0

Feet along beach

Rat

io o

f Den

sitie

s

1 2 3 4 5 6 7 8 9 10 11

Nourishment Project

86 / 90

Page 87: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Pre- vs. post-nourishment, Greens

0 5000 10000 15000 20000 25000 30000

0 e

+00

5 e

−05

Feet along beach

Den

sitie

s

Pre−nourishment vs. post−nourishment, Greens

Pre−pier (1997−1998)Pre−nourishment (1999−2000)

0 5000 10000 15000 20000 25000 30000

0.0

1.0

2.0

Feet along beach

Rat

io o

f Den

sitie

s

1 2 3 4 5 6 7 8 9 10 11

Nourishment Project

87 / 90

Page 88: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Differences

I Reduced nesting in nourishment zone, increases just to south.

I Offshore current suggests approach from north (left on plot).

I Impact on all crawls (nesting and non-nesting).

I Suggests impact of nourishment before turtle emerges. Slope?

88 / 90

Page 89: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Conclusions

Pairwise comparisons suggest:

I Local impact of pier for loggerheads, but not for greens.

I Shift to south in emergences (nesting and non-nesting) due tonourishment for both loggerheads and greens.

89 / 90

Page 90: MODULE 14: Spatial Statistics in Epidemiology and Public ...faculty.washington.edu/jonno/SISMIDmaterial/2019_SISMID_14_3.pdfI Diggle, P.J. (2013) Statistical Analysis of Spatial and

OutlinePreliminaries

Random patternsEstimating intensities

Second order propertiesCase study: Sea turtle nesting

Sea Turtle BiologyJuno Beach, FloridaDataComparing Nesting patternsPre- vs. post-nourishmentCase study conclusion

Questions?

90 / 90