18
data assimilation for dynamic systems oksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper “A Statistical Overview and Perspectives on Data Assimilation for Marine Biogeochemical Models” by Dowd, Jones, Parslow Computer Experiments Reading Group November 3, 2015 — Slide 1/12

Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Data Assimilation for Dynamic Systems Models

Discussion of the paper “A Statistical Overview and Perspectives on Data

Assimilation for Marine Biogeochemical Models” by Dowd, Jones, Parslow

Computer Experiments Reading GroupNovember 3, 2015

— Slide 1/12

Page 2: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

The Plan

“Data assimilation refers to the quantitative methods by which theinformation in dynamic models and data are combined to provideestimates of the state and its key parameters”

This discussion will focus specifically on the use of data assimilation forinference and prediction for nonlinear dynamic systems models.

Slide 2/12

Page 3: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

The Model

Interested in inference and prediction for large-scale spatio-temporalmodels defined implicitly as partial differential equations (PDE).

An example from biogeochemistry (BGC):

∂x (i)

∂t+ u · ∇x (i) −∇ ·

(κ∇x (i)

)= fi (x , θ, γ) , i = 1, . . . ,m.

where

• x = [x (1), · · · , x (m)]> – 3-d spatial field

• u – 3-d current field

• κ – matrix of diffusion coefficients

• fi – governing equations (photosynthesis, predation, etc.)

• θ – biological parameters (may be high dimensional)

• γ – forcing fields (light, temperature, etc.)

Slide 3/12

Page 4: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

Global Ocean Circulation Model: SST snapshot

chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 5: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

The Model

What we need to keep in mind about these models

Issues:

• The BGC parameters, θ, are uncertain and vary by season etc.

• The BGC governing equations (RHS) are uncertain (they are notperfect models of reality)

• Ocean circulation models (LHS) are uncertain (spatial resolutionlimitations, etc.)

• Forcing fields, γ, are uncertain

• No closed form expression for the states, x , means we make inferenceon a discrete approximation of the PDE and not the PDE itself

(Stochastic models?)

Slide 4/12

Page 6: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

The Data

Data, yt , t = 1, . . . ,T for these applications can be quite complex

Examples:

• Indirect measurements of x at different locations and water depths(usually systematic sampling)

• Tracer data

• Remote sensing

Issues:

• Uncertain error models yt = h(xt , θ, εt)

• Indirect observations

• Combining different data types

Slide 5/12

Page 7: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

Measurements of the Physical Ocean  

chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 8: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

Measurements of Ocean Biology  

5me  series  (moored  instruments    -­‐  plankton,  nutrients)  

5me-­‐space  series  (glider  robots  -­‐  plankton)  

spa5al  data  (satellite  chorophyll)  

•  OBSERVATION  REVOLUTION  IN  OCEAN  SCIENCES,    •  STILL  A  PARTIALLY  OBSERVED  SYSTEM  AND  UNDERSAMPLED  •  COMPLEX  SPATIO-­‐TEMPORAL  SAMPLING  PROTOCOLS  

chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 9: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Inference and Prediction

We are interested in the joint distribution of the state and theparameters conditional on the observations:

[x1:T , θ | y1:T ]

Hierarchical model:

[x1:T , θ | y1:T ] ∝ [y1:T | x1:T , θ]︸ ︷︷ ︸data model

[x1:T | θ]︸ ︷︷ ︸PDE forward model

[θ]︸︷︷︸prior

Slide 6/12

Page 10: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Data Assimilation

Two main ways we could do (usually approximate) inference andprediction on x1:T and θ

1 Variational approaches (optimization based, “fast”, MAPestimation)

2 Sampling based methods (sampling from exact or approximatetarget posterior, “slow”, posterior uncertainty)

Slide 7/12

Page 11: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Variational Methods

Optimization problem of the following objective:

J =T∑t=1

||yt − h(xt , θ, εt)||2Σ−1yt−h(xt ,θ,εt )

with respect to θ and subject to the constraint

xt = d(xt−1, θ, γt),

which is a discretized transition model for the state (approximate PDEsolution obtained using numerical time-stepping from the previouslyestimated state xt−1).

Slide 8/12

Page 12: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Variational Methods

Examples:

Algorithms include 3DVAR, 4DVAR

Issues:

Minimization of J is challenging and computationally expensive

• objective multimodality

• θ has high dimension

• discretization method used to obtain transitions of x from one timepoint to the other are computationally expensive

Slide 9/12

Page 13: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Sampling-based Estimation

The target density for the DA problem is [x1:T , θ | y1:T ]

Examples:

MCMC, particle filter, ensemble Kalman filter

Issues:

• MCMC targets the posterior directly, but it is slow

• Particle filtering can be faster. Iterates between two steps:prediction xt | y1:t−1, θ and measurement update xt | y1:t , θ but doesnot incorporate prior information on θ

• Kalman filter is exact for linear, additive Gaussian models, buttargets a rough approximation of the posterior when the model isnonlinear

Slide 10/12

Page 14: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

time

measurement

nowcast forecast

time = t-1 time = t

prediction

nowcast [xt−1,θ | y1:t−1]

xt = d(xt−1,θ ) +ν t yt

[xt ,θ | y1:t−1] [xt ,θ | y1:t ]

-  Recursive estimation of system state through time

-  Forecast and Measurement steps

Single stage transition of system from time t-1 to time t Sequential Methods for DA

chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 15: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

Approaches for Parameter Estimation

xt=xtθt

⎝⎜⎜

⎠⎟⎟

1. State Augmentation: append parameters to the state

Can use ‘standard’ sequential MC methods with iterative filtering

2. Likelihood Methods: Use sample based likelihoods

[y1:T |θ ] = L(θ | y1:T ) = [yt | xt,θ ][xt | y1:t−1,θ ]dxt∫t=1

T

3.  Bayesian  Hierarchical: particle MCMC, SMC^2

chkrebtii.1
Rectangle
chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 16: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

Sample Based Likelihood Surface L(θ | y1:T ) ≈

1n

[yt | xt|t−1(i) ,θ ]

i=1

n∑( )t=1

T

chkrebtii.1
Text Box
Slide from "Miscellaneous Thoughts on Ocean Data Assimilation" by M. Dowd, presented at the 2015 STATMOS/NCAR Workshop on Data Assimilation
Page 17: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Challenges

The paper identifies a number of open problems:

• High dimensional states: xt (often m > 106); variational methodsand Kalman filter are used, but rely on approximation

• Numerical resolution limits the number of ensemble sizes (usually< 1000); surrogate models and emulators are often used

• Most DA techniques do not incorporate prior information on θ,which can be very informative

Slide 11/12

Page 18: Data Assimilation for Dynamic Systems Models · data assimilation for dynamic systemsoksana chkrebtii Data Assimilation for Dynamic Systems Models Discussion of the paper \A Statistical

data a s s im i lat i on for dynam i c sy stems ok sana chkrebt i i

Challenges

The paper identifies a number of open problems:

• Dynamic model complexity and model selection

• Can we obtain useful information from fitting a nonlinear modelwith so many sources of uncertainty?

• Specifying the data model: this modeling step is often neglected inBGC applications

• Sampling design

Slide 12/12