37
Fine-scale spatiotemporal air pollution analysis using mobile monitors on Google Street View vehicles Brian Reich Department of Statistics, NCSU Department of Biological and Agricultural Engineering, NCSU, 2020 Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 1 / 37

Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Fine-scale spatiotemporal air pollution analysisusing mobile monitors on Google Street View

vehicles

Brian Reich

Department of Statistics, NCSU

Department of Biological and AgriculturalEngineering, NCSU, 2020

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 1 / 37

Page 2: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Collaborators

Yawen Guan, Margaret Johnson, Matthias Katzfuss, ElizabethMannshardt, Kyle Messier and Joon Jin Song

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 2 / 37

Page 3: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Classic air pollution monitoring scheme

I Since the clean air act of 1970, the US EPA has beenmonitoring and regulating air pollution

I They rely on a small number of stationary monitorscollecting daily data

I These data are used to

I Uphold national air quality standards

I Follow trends over space and time

I Study health effects of air pollution

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 3 / 37

Page 4: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Stationary N02 monitors in NC1

1https://deq.nc.gov/about/divisions/air-quality/air-quality-monitoringBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 4 / 37

Page 5: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

US average NO2 by year2

2https://www.epa.gov/air-trends/nitrogen-dioxide-trendsBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 5 / 37

Page 6: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Nonattainment counties3

3https://www3.epa.gov/airquality/greenbook/mapnpoll.htmlBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 6 / 37

Page 7: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Epidemiological studies

I A common approach is to regress daily health outcomesonto city average daily air pollution4

I A more expensive approach is to estimate individualexposure for a small number of subjects5

I Health effects are also studied using controlledexperiments6

4e.g., Dominici et al, JAMA, 20065e.g., Larkin and Hystad, CEHR, 20176https://www.nap.edu/catalog/24618/controlled-human-inhalation-

exposure-studies-at-epaBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 7 / 37

Page 8: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Epidemiological studies

These studies have established links between air pollution andseveral adverse health outcomes including:

I Cardiovascular diseases

I Respiratory diseases

I Cancer

I Pre-term birth

An estimated 4.2 million premature deaths globally are linked toambient air pollution7

7https://www.who.int/airpollution/ambient/health-impacts/en/Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 8 / 37

Page 9: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

New sources of monitoring data

I Air pollution epidemiology relies on a few stationarymonitors per city

I The field is undergoing a paradigm shift due tofine-resolution mobile monitors

I We analyze data collected from a car driving around thecity and continuously measuring air pollution

I Some cities now have thousands of low-cost stationarysensors

I Phone apps are under development

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 9 / 37

Page 10: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Data from Google StreetView cars8

8Photo: Apte (2017)Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 10 / 37

Page 11: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Mobile data: Oakland NO2

I Two cars were deployed from June 2015 to May 2016

I Starts on weekdays at ≈ 9am, drove ≈ 6-8 hours each day

I Measurements are taken at roughly every second.

I Large missing data (car maintenance, sensor failure, etc.)

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 11 / 37

Page 12: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Example daily observations of log(NO2)

I Car A and B drove from 8am - 2pmI 12,389 observations, covered less than a third of Oakland

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 12 / 37

Page 13: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Objectives

Develop a statistical model for real-time, high resolutionforecasting of air pollution

I Can we develop accurate maps for epi studies?

I Can we make forecasts that help people avoid exposure?

I How far ahead can we reasonably forecast air pollution?

I How many cars should be deployed future studies?

I Are cars more efficient than stationary monitors?

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 13 / 37

Page 14: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Statistical challenges

I Data are large (n ≈900,000 observations)

I Data are streaming

I Data are extremely sparse in space and time

I Data are noisy and subject to outliers

I Process is likely dynamic and nonstationary

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 14 / 37

Page 15: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Temporal aggregation

We took temporal block medians to dampen effects of extremes

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 15 / 37

Page 16: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Example landuse covariates

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 16 / 37

Page 17: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Principal components of landuse variables

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 17 / 37

Page 18: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Non-spatial landuse regression

Let Yt(s) be the log(NO2) at time t and location s

Yt(s) = Xt(s)Tβ + εt(s), εt(s)iid∼ N (0, τ2)

where Xt(s) contains

I The first seven PCs

I Four trig functions for hourly diurnal cycle

I Interactions between the PCs and trig functions

R2 ≈ 0.16 and residuals are correlated

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 18 / 37

Page 19: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Results from landuse regressionObserved vs. Predicted, Oct. 29, 2015 – Dec. 18, 2015

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 19 / 37

Page 20: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Results from landuse regressionObserved vs. Predicted, Oct. 29, 2015 – Dec. 18, 2015

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 20 / 37

Page 21: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Spatiotemporal landuse regression model

I We add a spatiotemporal process to capture dependence

Yt(s) = Xt(s)β + ηt(s) + εt(s), εt(s)iid∼ N(0, τ2)

I The Gaussian process η has covariance

Cov{ηt(s), ηt ′(s′)

}= σ2exp

{−√||s− s′||2/ρ+ |t − t ′|2/φ

}I The range parameters ρ and φ determine the extent of

spatial and temporal dependence

I The covariance parameters to be estimated areθ = {σ2, τ2, ρ, φ}

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 21 / 37

Page 22: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Computation

I Maximum likelihood analysis is impossible

I The likelihood depends on function of the spatialcovariance matrix, which is huge (n × n)

I Overcoming this computational bottleneck is one of themain challenges in spatial statistics

I There are now many approaches9

9e.g., Heaton et al, 2019, JABESBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 22 / 37

Page 23: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Computation

I We use the Veccia approximation to estimate thecovariance parameters

I Training the model based on the joint distribution of all theobservations is too slow

I Instead we regress the current observation Yi onto therecent past Ni

I If Ni = {Y1, ...,Yn−1} this is exact and slow

I If Ni ⊂ {Y1, ...,Yn−1} this is approximate and fast

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 23 / 37

Page 24: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Computation

I The neighboring sets are observations in the recent past

Ni = {obs between l and l + u minutes prior to obs i}

I The results are insensitive to u so we set u = 60

I Taking l = 0 gives the best approximation to the likelihood

I Often it ts better to include some distant neighbors10

I We pick l by cross-validation

10Gramacy and Apley, 2015, TechnometricsBrian Reich, NC State Spatiotemporal methods for mobile monitoring data 24 / 37

Page 25: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Cross validation design

I Train the model using both cars’ data prior to time t

I Predict the value for both cars at time t + h forh ∈ {5,15,60} minutes

I We compare the non-spatial (“X”), spatial (“S”) andspatiotemporal (“ST”) methods

I We also compare fitting the model using one car’s dataand predicting the other car (“Car AB”)

I Methods are compared using the correlation betweenobserved and predicted

I Repeat using raw data (1 sec) and 1-minute block medians

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 25 / 37

Page 26: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Prediction correlation using 1 sec block medians

Prediction lagModel l 5 mins 15 mins 60 mins Car AB

X - 0.18 0.18 0.18 0.08S - 0.27 0.27 0.27 0.09

ST 0 0.45 0.25 0.18 0.09ST 5 0.58 0.36 0.28 0.10ST 15 0.57 0.36 0.31 0.10ST 60 0.55 0.38 0.28 0.09

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 26 / 37

Page 27: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Prediction correlation using 1 min block medians

Prediction lagModel l 5 mins 15 mins 60 mins Car AB

X - 0.28 0.28 0.28 0.19S - 0.34 0.34 0.34 0.21

ST 0 0.59 0.44 0.29 0.26ST 5 0.64 0.56 0.46 0.26ST 15 0.64 0.56 0.45 0.26ST 60 0.63 0.55 0.45 0.26

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 27 / 37

Page 28: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Estimated diurnal trends

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 28 / 37

Page 29: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Correlation parameter estimates

Block Spatial Spatial Temporalsize l R2 ratio range (km) range (hr)1 sec 0 1.00 0.95 0.19

5 0.63 4.82 3.8315 0.54 3.95 9.5360 0.77 1.38 2.32

1 min 0 0.92 3.52 0.235 0.64 5.21 9.2415 0.57 5.43 28.7260 0.60 3.62 4.19

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 29 / 37

Page 30: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

15 minutes ahead forecasts of NO2

I Forecast for 15:00 using the data from 13:45 to 14:45 onMay 5, 2016

I As expected, standard errors are lowest where data hasbeen obtained most recently from the two cars.

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 30 / 37

Page 31: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Dynamic model

I We envision the model being refitted periodically to adaptto evolving environmental, traffic and emissions patterns

I We refit the model in a sliding window of training data tostudy changes in parameter estimates and performance

I For each week from 12/07/2015 to 05/13/2016, we use thedata from the previous w weeks to train the model

I We compute 15-minute ahead prediction mean squarederror for that week

I We use l = 60 and 15-second block median data

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 31 / 37

Page 32: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Dynamic model – Spatial range estimates (w = 21)

Blue dots are estimates, black dots are bootstrap samples

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 32 / 37

Page 33: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Dynamic model – Temporal range estimates (w = 21)

Blue dots are estimates, black dots are bootstrap samples

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 33 / 37

Page 34: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Dynamic model – prediction MSE

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 34 / 37

Page 35: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Network designHow many cars should be deployed? How many fixed-locationsensors would provide the same quality of prediction?

Deploy mobile and stationary sensors and simulate data

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 35 / 37

Page 36: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Network design

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 36 / 37

Page 37: Fine-scale spatiotemporal air pollution analysis using ... · New sources of monitoring data I Air pollution epidemiology relies on a few stationary monitors per city I The field

Summary and future projects

I Our work11 shows that short-term forecasting of airpollution at a high spatial resolution is possible

I Future work:

I Fuse mobile and stationary sensorsI Model extremesI Multi-city analysisI Design efficient sampling routes

I Works supported by NIH and NSF

I THANKS!

11Guan et al (2020). Fine-scale spatiotemporal air pollution analysis usingmobile monitors on Google Street View vehicle. In press, Journal of theAmerican Statistical Association.

Brian Reich, NC State Spatiotemporal methods for mobile monitoring data 37 / 37