23
Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London [email protected] http://www.geog.qmul.ac.uk/staff/congdonp.html http://webspace.qmul.ac.uk/pcongdon/ 1

Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

Embed Size (px)

Citation preview

Page 1: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

1

Disease Prevalence Estimates for Neighbourhoods: Combining

Spatial Interpolation and Spatial Factor Models

Peter Congdon, Queen Mary University of [email protected]

http://www.geog.qmul.ac.uk/staff/congdonp.htmlhttp://webspace.qmul.ac.uk/pcongdon/

Page 2: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

2

Data on disease prevalence

Health data may be collected across one spatial framework (e.g. health providers), but policy interest may be contrasts in health over another spatial framework (e.g. neighbourhoods).

Seek to use data for one framework to provide spatially interpolated estimates of disease prevalence for the other.

But also incorporate neighbourhood morbidity indicators that may also provide information on prevalence

Page 3: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

3

Data Framework

Focusing on England, prevalence totals for chronic diseases maintained by 8200 general practices for their populations (subject to measurement error, excess or deficits in “case-finding”). See Prevalence data tables at http://www.ic.nhs.uk/qof

These data not provided for any small area populations, e.g. 32000 neighbourhoods across England (Lower Super Output Areas or LSOAs)

Study focus: GP populations and LSOAs in Outer NE London (970K population) and on estimating neighbourhood psychosis prevalence

Page 4: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

4

London Borough Map

Page 5: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

5

Discrete Process Convolution

Use principles of discrete process convolution to estimate neighbourhood prevalence.

Geostatistical techniques (multivariate Gaussian process) computationally demanding for large number of units involved

Base Framework: Prevalence for GP Populations

Target Framework: Prevalence for Neighbourhoods

Page 6: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

6

Discrete Process Model

Page 7: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

7

Model for Base Framework, Study Data

Page 8: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

8

Model for Target Framework

Page 9: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

9

INCORPORATING OBSERVED INDICATORS of NEIGHBOURHOOD PREVALENCE

Page 10: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

10

SCHEMATIC REPRESENTATION

Page 11: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

11

LIKELIHOOD: REFLEXIVE INDICATORS

Page 12: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

12

PARAMETER IDENTIFICATION

Page 13: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

13

POTENTIAL SENSITIVITY IN INFERENCES & FIT

Sensitivity to kernel density choice Sensitivity to constraint adopted

(kernel scale set or known; process variance set or unknown)

Sensitivity to form of process effects: e.g. wj normal vs Student t

Sensitivity to density of discrete grid

Page 14: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

14

SPATIAL SENSITIVITY IN INTERPOLATED NEIGHBOURHOOD PREVALENCE

Can compare models in terms of localised hot spot probabilities of high psychosis risk Pr(k>1|y,h)>0.9

Or compare clustering of excess psychosis risk. Define binary indicators Jk=I(k>1)

Over MCMC iterations monitor excess risk in both neighbourhood k and its adjacent neighbourhoods l=1,..,Lk.

Ck is probability indicator of high risk cluster centred on neighbourhood k.

Page 15: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

15

Study Specifications

Locations: population centroids for GP populations and LSOAs

Grid set at 2km spacing, no grid point more than 2km from any neighbourhood centroid

Kernel form as in seed dispersal literature (e.g. Austerlitz et al, 2004; Clark et al, 1999), e.g. bivariate exponential with scale a and with distance d (from GP population or neighbourhood to discrete grid point) as argument is P(d|a)=)

Compare four models out of wide possible range of options

Page 16: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

16

Fit Comparisons

Page 17: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

17

Comparing Neighbourhood Spatial Risk Patterns

Page 18: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

18

OVERLAP AT NEIGHBOURHOOD LEVEL (K=562)

Page 19: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

19

Density plot (M4), prevalence rate

Page 20: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

20

Map of Interpolated Neighbourhood Prevalence under M4

Page 21: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

21

Map of Clustering Probabilities under M4 (posterior means of Ck)

Page 22: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

22

Future Research

Modify interpolation to include “formative” influences on prevalence (e.g. area deprivation)

How does model work with other chronic diseases, or with jointly dependent disease outcomes (e.g. diabetes, obesity)

Space-time prevalence models, etc

Page 23: Disease Prevalence Estimates for Neighbourhoods: Combining Spatial Interpolation and Spatial Factor Models Peter Congdon, Queen Mary University of London

23

References

Austerlitz C et al (2004) Using genetic markers to estimate the pollen dispersal curve Molecular Ecology, 13, 937–954

Clark J et al (1999) Seed dispersal near and far: patterns across temperate and tropical forests. Ecology, 80, 1475–1494.