37
Tutorial on data assimilation Marc Bocquet CEREA, joint lab ´ Ecole des Ponts ParisTech and EdF R&D, Universit´ e Paris-Est, France Institut Pierre-Simon Laplace ([email protected]) M. Bocquet Journ´ ee du groupe SAMA de l’IPSL, ´ Ecole Normale Sup´ erieure, Paris, France, 13 March 2018 1 / 37

Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

  • Upload
    others

  • View
    27

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Tutorial on data assimilation

Marc Bocquet

CEREA, joint lab Ecole des Ponts ParisTech and EdF R&D, Universite Paris-Est, FranceInstitut Pierre-Simon Laplace

([email protected])

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 1 / 37

Page 2: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Data assimilation: principles

Outline

1 Data assimilation: principlesDefinitionMathematical framework

2 Main techniques3D-Var and optimal interpolationThe Kalman filterThe ensemble Kalman filter4D-Var

3 Advanced techniquesEnsemble variational methodsParticle filters

4 Uncertainty quantification of the best estimate

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 2 / 37

Page 3: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Data assimilation: principles Definition

Data assimilation: definition

IData assimilation is the set of techniques that allow to optimally combineobservations of a physical system with numerical models and prior information of thatsystem, so as to get an estimate of the state of the system.

I In the geosciences: Numerical models are often computationally costly. They areoften dynamical.

I In the geosciences: The state space and observations space are huge (up to 109/107

for operational systems, up to 107/105 for research systems). A big data problem withcostly dynamical models.

IWhat for?: estimate initial state of chaotic systems for forecasting, re-analysis,estimate parameters (∼ inverse modelling).

IExample: Data assimilation for prediction.

Observation

Model (forecast)

H

Analysis

Observation

Model (forecast)

H

Analysis

Observation

Model (forecast)

H

Analysis

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 3 / 37

Page 4: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Data assimilation: principles Definition

Data assimilation system

IData assimilation system = observation and evolution models + statistics of theerrors. Typically:

xk = Mk:k−1(xk−1) + ηk

yk = Hk(xk) + εk

with ηk ∼N (0,Qk) and εk ∼N (0,Rk).

t1 t2 tK tK+1 tK+2

Past Future

IDenoting xK :1 = x1,x2, . . . ,xK , yK :1 = y1,y2, . . . ,yK :

Prediction: Estimate xk for k > K , knowing yK :1;

Filtering: Estimate xK , knowing yK :1;

Smoothing: Estimate xK :1, knowing yK :1.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 4 / 37

Page 5: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Data assimilation: principles Mathematical framework

The ideal mathematical framework

IBayes/Laplace approach:

p(x|y) =p(y|x)p(x)

p(y)

with p(y|x) the likelihood of the observations, p(x) the prior/background on thesystem’s state, and p(y) the evidence. The evidence is a normalisation that does notdepend on x:

p(y) =∫

dxp(y|x)p(x) .

IThis is a probabilistic approach. It quantifies the uncertainty/the information. It doesnot provide a deterministic estimator. This would require to make a choice on top ofBayes’ rule.

IThe Bayesian approach is very satisfactorily [Jaynes et al., 2003]. Most DA methods canbe derived or comply with Bayes’ rule.

IBut it does not lend to a closed form analytically tractable solution.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 5 / 37

Page 6: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Data assimilation: principles Mathematical framework

Gaussian approximation

IA key to obtain a (approximate) solution is to truncate the errors to second-ordermoments ∼ the Gaussian approximation. Most of DA methods are fully or partiallybased on this assumption.

IThe elementary building block of DA schemes is the statistical BLUE (for Best LinearUnbiased Estimator) analysis. Time is considered fixed. H is assumed linear.

y = Hx+ εo x = xb + ε

b

where εo ∼N (0,R), and εb ∼N (0,B).

ISolution:

xa = xb +K(y−Hxb

)K = BHT

(R+HBHT

)−1

Pa = (I−KH)B.

yxb xa

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 6 / 37

Page 7: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques

Outline

1 Data assimilation: principlesDefinitionMathematical framework

2 Main techniques3D-Var and optimal interpolationThe Kalman filterThe ensemble Kalman filter4D-Var

3 Advanced techniquesEnsemble variational methodsParticle filters

4 Uncertainty quantification of the best estimate

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 7 / 37

Page 8: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques 3D-Var and optimal interpolation

3D-Var and optimal interpolation

IVariational formulation of the same problem

J(x) =1

2‖x−xb‖2

B−1 +1

2‖y−Hx‖2

R−1

where ‖x‖2A = xTAx, which is equivalent to

BLUE.

IProbabilistic/Bayesian interpretation:

p(x|y) ∝ e−J(x) yxb xa

ICapable of handling nonlinear observation operator using standard nonlinearoptimisation methods:

J(x) =1

2‖x−xb‖2

B−1 +1

2‖y−H(x)‖2

R−1 .

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 8 / 37

Page 9: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques 3D-Var and optimal interpolation

Chaining the analyses in time

IChaining the BLUE/3D-Var cycles:

1 Analysis with a forecast at tk : xfk and with static information B: xa

k ;

2 Forecast to tk+1: xfk+1 = Mk+1:k(xa

k).

IAlso known as optimal interpolation(if the analysis step is BLUE).

IRelatively cheap. Used in oceanogra-phy, atmospheric chemistry. Requiresa smart construction of B.

IBut the information about the errorsis not propagated in time . . .

t1 t2 t3 t4 t5

0.0

0.2

0.4

0.6

0.8

1.0

1.2truth3D-Varobservationanalysisforecast

Observation y,R

xf

B

Analysis xa

Observation

Model

y,R

xf

B

Analysis xa

Observation

Model

y,R

xf

B

Analysis xa

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 9 / 37

Page 10: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The Kalman filter

The Kalman filter

ISimilar to optimal interpolation. But, now, we want to replace the static B with adynamic Pf which needs updating and propagating.

IAnalysis step:

xak = xf

k +Kk

(yk −Hkx

fk

),

Kk = PfkH

Tk

(Rk +HkP

fHTk

)−1,

Pak = (I−KkHk)Pf

k .

IForecast step:

xfk+1 = Mk+1:kx

ak ,

Pfk+1 = Mk+1:kP

akM

Tk+1:k +Qk+1.

Observation y,R

xf

Pf

Analysis xa

Pa

Observation

Model

TLM

y,R

xf

Pf

Analysis xa

Pa

Observation

Model

TLM

y,R

xf

Pf

Analysis xa

Pa

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 10 / 37

Page 11: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The Kalman filter

The extended Kalman filter

IOptimal if the models are linear and if all the initial and observations errors areGaussian: it gives the perfect Gaussian solution of Bayes’ rule.

ICan be extended to nonlinear models: then

xfk+1 = Mk+1:k(xa

k),

Pfk+1 = M′k+1:kP

fkM′Tk+1:k +Qk+1,

where M′k+1:k is the tangent linear model.

IExtremely costly for large geophysical models: storage space (storage of Pf) and

computations (M′k+1:kPfkM′Tk+1:k requires 2n integrations of the model).

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 11 / 37

Page 12: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

The ensemble Kalman filter

IThe idea [Evensen, 1994; Houtekamer and Mitchell, 1998] is to make the KF work in highdimensions and replace P (Pa or Pf) with an ensemble of states x1, x2, . . . , xm. Themoments of the error could theoretically be approximated by the sample/empiricalmoments:

xf =1

m

m

∑i=1

x, Pf =1

m−1

m

∑i=1

(x(i)−x

)(x(i)−x

)T.

IAnalysis step: Similar to the KF but Pf explicitly or implicitly taken as the samplecovariance estimator.

IForecast step: The ensemble is propagated using the full nonlinear model (not thetangent linear model!)

x(i),fk+1 = Mk+1:k

(x

(i),ak

).

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 12 / 37

Page 13: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

The ensemble Kalman filter

ITwo main flavors of EnKFs: stochastic and deterministic, but many variants.IThe stochastic EnKF is the closest to traditional KF, but adds stochasticperturbations to the observations of each members to properly account for theobservation errors [Burgers et al., 1998]:

xa(i) = xf

(i) +K(y+ ε(i)−Hxf

(i)

).

IThe deterministic EnKF avoids the introduction of the stochastic perturbations byupdating the square root of Pf = XfX

Tf , i.e. Xf . One of the variant (ETKF, [Hunt et al.,

2007]) operates the linear algebra in the space of the perturbations:

xa = xf +Xfwa.

The analysis in the perturbation space is given by

wa =(Im +YT

f R−1Yf

)−1YT

f R−1(y−Hxf

)where Yf = Hxf. This updates the mean state via xa = xf +Xfw

a. The perturbationsaround it are updated via

Xa = Xf

(Im−YT

f (YfYTf +R)−1Yf

) 12U, where U ∈ O(N) and U1 = 1.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 13 / 37

Page 14: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

The downside of the EnKF: rank-deficiency

IThere is a heavy price to pay for replacing the Pf n×n covariance matrix of the KFwith the Xf m×n anomaly matrix: spurious correlations for distant state components.If P = XfX

Tf and B is the true error covariance matrix of a Gaussian process:

Cov([P]ii , [P]jj

)=

2

N−1[B]2ij , Cov

([P]ij , [P]ij

)=

1

N−1

([B]2ij + [B]ii [B]jj

).

IBut, for geophysical systems, we know that most long-range correlations aredampened exponentially. Consequently, the covariances are misestimated (too lowvariances, too high long-range covariances) and leads to divergence of the EnKF.−→ Practically, this is solved using two fixes: inflation and localisation.

I Inflation consists in inflating the covariances by a scalar in the hope to compensatefor the underestimation of the error statistics [Pham et al., 1998, Anderson et al., 1999]:

x(i)←− x(i) + λ

(x(i)−x

).

Can be avoided in a perfect-model context: finite-size EnKF (EnKF-N) [Bocquet et al.,

2011-2018].

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 14 / 37

Page 15: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

Localisation

ITwo flavors of localisation: domain localisation and covariance localisation.

IDomain localisation: divide and conquer.The DA analysis is performed in parallel in localdomains. The outcomes of these analyses arelater sewed together. This is applicable only ifthe long-range error correlations are negligible.

x

Local update

Observation

ICovariance localisation: killing off spurious correlation explicitly: Pf = ρ ◦(XfX

Tf).

0 50 100 150

0

50

100

150

True B

-1.00

-0.80

-0.60

-0.40

-0.20

0.00

0.20

0.40

0.60

0.80

1.00

0 50 100 150

0

50

100

150

Sampled B

-1.00

-0.80

-0.60

-0.40

-0.20

0.00

0.20

0.40

0.60

0.80

1.00

0 50 100 150

0

50

100

150

Regularized sampled B

-1.00

-0.80

-0.60

-0.40

-0.20

0.00

0.20

0.40

0.60

0.80

1.00

IThese strategies have successfully been applied to the EnKF [Hamill et al, 2001;

Houtekamer and Mitchell, 2001; Evensen, 2003; Hunt et al., 2007].

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 15 / 37

Page 16: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

Nonlinear chaotic models: the Lorenz-95 low-order model

0 100 200 300 400

Time

0

5

10

15

20

25

30

35x

7.5

5.0

2.5

0.0

2.5

5.0

7.5

10.0

I It represents a mid-latitude zonal circle of the global atmosphere.ISet of n = 40 ordinary differential equations [Lorenz and Emmanuel 1998]:

dxidt

= (xi+1−xi−2)xi−1−xi +F , (1)

where F = 8, and the boundary is cyclic.IConservative system except for a forcing term F and a dissipation term −xi .IChaotic dynamics, 13 positive and 1 neutral Lyapunov exponents, a doubling time ofabout 0.42 time units.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 16 / 37

Page 17: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

Illustration with the Lorenz-95 model

5 6 7 8 9 10 15 20 25 30 35 40 45 50

Ensemble size

0.2

0.3

0.4

0.5

5

1

2

3

4A

ver

age

anal

ysi

s ro

ot

mea

n s

qu

are

erro

r

EnKF no. loc. no infl.EnKF no loc. opt. infl.

EnKF opt. loc. no. infl.

EnKF opt. loc. opt. infl.

IPerformance of the EnKF in the absence/presence of inflation/localisation.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 17 / 37

Page 18: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques The ensemble Kalman filter

What about smoothing?

IThere are smoothing variants of theKalman filter [Anderson & Moore, 1979], theKalman smoother used in the geosciences[Cohn et al., 1994]

IAnd they have been adapted to the EnKFand variants [Evensen & van Leeuwen, 2000],

[Evensen, 2009], [Cosme et al., 2012], [Bocquet &

Sakov, 2014], etc.0 10 20 30 40 50

Lag

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

An

aly

sis

roo

t m

ean

sq

uar

e er

ror

EnKF, EnKS filtering

EnKS smoothing

ISometimes called asynchronous data assimilation [Sakov et al., 2010; Sakov & Bocquet, 2018].

IWith the notable exception of the IEnKS, these smoothers relies on Gaussianassumptions within the DAW.

I 4D-Var is a more natural method to handle nonlinearity within the DAW.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 18 / 37

Page 19: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Main techniques 4D-Var

4D-Var

IStrongly constrained 4D-Var, i.e. assuming the model is perfect

J(x0) =1

2‖x0−xb0‖2

B−1 +1

2

K

∑k=1

‖yk −Hk(xk)‖2R−1k

under the constraints that xk+1 = Mk+1:k(xk) for k = 0, . . . ,K −1.

IFits a model trajectory through the 4D datapoints.

I In high-dimensional spaces, requires ∇x0J foran efficient minimisation. But ∇x0J dependson the adjoint of Mk+1:k and Hk . This canbe a very difficult technical task if the modelis a huge piece of code for a nonlinear high-dimensional model.

t1 t2 t3 t4 t5

0.0

0.2

0.4

0.6

0.8

1.0truth4D-Varobservationanalysis

IWeakly constrained 4D-Var, i.e. assuming the model is imperfect

J(xK :0) =1

2‖x0−xb0‖2

B−1 +1

2

K

∑k=0

‖yk −Hk(xk)‖2R−1k

+1

2

K

∑k=1

‖xk −Mk:k−1(xk−1)‖2Q−1

k.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 19 / 37

Page 20: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques

Outline

1 Data assimilation: principlesDefinitionMathematical framework

2 Main techniques3D-Var and optimal interpolationThe Kalman filterThe ensemble Kalman filter4D-Var

3 Advanced techniquesEnsemble variational methodsParticle filters

4 Uncertainty quantification of the best estimate

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 20 / 37

Page 21: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Ensemble variational methods

Hybridising ensemble and variational methods

IHybrid: Use flow-dependent statistics froman EnKF into 3D-Var [Hamil & Snyder 2000;

Wang et al. 2007].

I 4D-LETKF [Hunt et al., 2004; Fertig et al., 2007]

IEDA: ensemble of 4D-Var (ECMWF, Meteo-France) [Raynaud et al., 2009; Bonavita et al., 2012;

Berre et al., 2015; Jardak & Talagrand 2018]

I 4DEnVar: Adjoint-less 4D-Var [Liu et al., 2008; Buehner et al. 2010; Zhang and Zhang, 2012;

Fairbairn et al. 2014, Desroziers et al. 2014], but ensemble update and nonlinearity still notcompletely addressed.

I IEnKS: has it all [Sakov et al. 2012, Bocquet & Sakov 2012-2016].

IAs ensemble methods, they all require localisation, which is more difficult toimplement in a 4D context [Bocquet, 2016] except if the adjoint is available.−→ For a review on EnVar methods, see Chapter 7 of the new book [Asch et al., 2016].

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 21 / 37

Page 22: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Ensemble variational methods

The iterative ensemble Kalman smoother (IEnKS)

IReduced scheme in ensemble space, x0 = x0 +X0w, where X0 is the ensembleanomaly matrix:

J(w) = J(x0 +X0w) .

IAnalysis IEnKS cost function in ensemble space:

J(w) =1

2

L

∑k=1

‖yk −Hk ◦Mk:0 (x0 +X0w)‖2βkR

−1k

+1

2(N−1)‖w‖2 .

{β0,β1, . . . ,βL} weight the observations impact within the window.

IAs a variational reduced method, one can use Gauss-Newton [Sakov et al., 2012],Levenberg-Marquardt [Bocquet & Sakov, 2012], quasi-Newton, trust region, etc.,minimisation schemes.

IPerturbation update: same as the ETKF

E?0 = x?01

T +√N−1X0

[∇

2wJ]−1/2

?U where U ∈ O(N) and U1 = 1 .

−→ Cecile Defforge’s talk.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 22 / 37

Page 23: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Ensemble variational methods

Chaining 4D analyses in time

IThe IEnKS opens up new perspectives on the chaining of DA cycles which was littlerelevant for either the EnKF or 4D-Var.

I L: length of the data assimilation window,

IS : shift of the data assimilation window in between two updates.

tL−3 tL−2

yL−3 yL−2

tL−1 tL

yL−1 yL

tL+1 tL+2

yL+1 yL+2

tL−2

tL

tL+2

S∆t

S∆t

L∆t

Variational analysis in ens. space → Posterior ens. generation → Ens. forecast

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 23 / 37

Page 24: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Ensemble variational methods

Performance comparison with Lorenz-95

1 5 10 15 20 25 30 35 40 45 50

Data assimilation window length L

0.15

0.16

0.17

0.18

0.19

0.20

0.21

0.22

0.24

0.26

0.28

0.30

0.32

Fil

teri

ng a

nal

ysi

s R

MS

E

4D-VarEnKFIEnKS

1 5 10 15 20 25 30 35 40 45 50

Data assimilation window length L

0.04

0.05

0.06

0.07

0.08

0.09

0.10

0.12

0.14

0.16

0.18

0.20

0.24

0.28

0.32

Sm

ooth

ing a

nal

ysi

s R

MS

E

4D-VarEnKSIEnKS S=1

IComparing 4D-Var, the EnKF, the EnKS and the IEnKS.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 24 / 37

Page 25: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Particle filters

Taking the bull by the horns: the particle filter

IThe particle filter is the Monte-Carlo solution of the Bayes’ equation. This is asequential Monte Carlo method.

IThe most simple algorithm of Monte Carlo type that solves the Bayesian filteringequations is called the bootstrap particle filter [Gordon et al. 1993] .

Sampling: Particles {x1,x2, . . . ,xM}.Pdf at time tk : pk(x)' ∑

Mi=1 ωk

i δ (x−xik).

Forecast: Particles propagated by

pk+1(x)'M

∑i=1

ωikδ (x−xik+1)

with xik+1 = Mk+1(xk).

Analysis: Weights updated according to

ωa,ik+1 ∝ ω

f,ik+1p(yk+1|xik+1) .

prior

likelihood

posterior

IAnalysis is carried out with only a few multiplications. No matrix inversion!

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 25 / 37

Page 26: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Particle filters

Taking the bull by the horns: the particle filter

IThese normalised statistical weights have a potentially large amplitude of fluctuation.One particle (one trajectory of the model) will stand out among the others. Its weightwill largely dominate the others (ωi . 1). Then the particle filter becomes veryinefficient as an estimating tool since it has lost its variability. This phenomenon iscalled degeneracy of the particle filter [Kong et al. 1994].

Resampling One way to mitigate this phenomenon is to resample the particles byredrawing a sample with uniform weights from the degenerate distribution. Afterresampling, all particles have the same weight: ω i

k = 1/M.

IHandles very well, very nonlinear low-dimensional systems. But, without modification,very inefficient for high-dimensional models. Avoiding degeneracy requires a greatnumber of particles that scales exponentially with the size of the system. This is amanifestation of the curse of dimensionality.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 26 / 37

Page 27: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Advanced techniques Particle filters

Application of the particle filter in the geosciences

IThe applicability of particle filters to high-dimensional models has been investigatedin the geosciences [van Leeuwen, 2009; Bocquet, 2010]. The impact of the curse ofdimensionality has been quantitatively studied in [Snyder et al., 2008]. It was known [Mackay

et al., 2003] that using an importance proposal to guide the particles towards regions ofhigh probability will not change this trend, albeit with a reduced exponential scaling,which was confirmed by [Snyder et al., 2015]: optimal importance sampling particle filter[Doucet et al., 2000; Bocquet, 2010; Snyder; 2011].

IParticle smoother over a data assimilation window, alternative and more efficientparticle filters can be designed, such as the implicit particle filter [Morzfeld et al., 2012].

IParticle filters can nevertheless be useful for high-dimensional models if the significantdegrees of nonlinearity are confined to a small subspace of the state space, e.g.Lagrangian data assimilation [Slivinski et al., 2015] .

I It is possible possible to design nonlinear filters for high-dimensional models such asthe equal-weight particle filter [van Leeuwen & Ades, 2010-2017].

I Localisation can be (should be?) used in conjunction with the particle filter [Reich et al.

2013; Potterjoy, 2016; Penny & Miyoshi, 2016; Farchi & Bocquet, 2018].−→ Alban Farchi’s talk.

I It has been applied in hydrology, nivology, climate, etc [Goosse, Dubinkina et al.].M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 27 / 37

Page 28: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Outline

1 Data assimilation: principlesDefinitionMathematical framework

2 Main techniques3D-Var and optimal interpolationThe Kalman filterThe ensemble Kalman filter4D-Var

3 Advanced techniquesEnsemble variational methodsParticle filters

4 Uncertainty quantification of the best estimate

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 28 / 37

Page 29: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Chernobyl and Fukushima accidents

Figure XI. Surface ground deposition of caesium-137 released in Europe after the Chernobyl accident [D13].

I 30 deaths in the first days of the accidentI 200 000 evacueesI 30 km exclusion zoneIMid and long term sanitary impact: thyroid cancer (thousands of cases).

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 29 / 37

Page 30: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Chernobyl and Fukushima accidents

Caesium-137 deposition [IRSN database] Air quality monitoring network

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 30 / 37

Page 31: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Chernobyl and Fukushima accidents

turbulent

diffusion

wet deposition

dry deposition

radioactive decay

and physicochemical

transformations

advection

emissions

IModelled by PDEs of the transport processes and physical and chemicalparametrisations.ISource term usually unknown.IParameters of the physical parametrisations often poorly know (effective turbulentdiffusion, scavenging and dry deposition parameters).

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 31 / 37

Page 32: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Cost function

ISource-receptor relationship: H. Linear model.

IProblem usually solved using 4D-Var [Bocquet, 2012] or methods equivalent to therepresenter technique. Here, study focused on UQ of the best estimate [Liu et al., 2017].

I Log-normal errors for the prior and for the observations. Non-Gaussian statistics.

ICost function from Bayes’ rule:

L (z;θ) =− lnp(z|y,θ) =− lnp(y|z,θ)− lnp(z|θ) + lnp(y|θ)

=1

2‖lny− lnHxez‖2

R−1 +1

2‖z‖2

B−1 +1

2ln |R|+ 1

2ln |B|+ ξ .

ITwo strategies to quantify the uncertainty of the best estimate:

Bayesian hierarchy (HB):

p(x,θ |y) =p(y|x,θ)p(x|θ)p(θ)

p(y), p(x|y) =

∫dθ p(x,θ |y). (2)

Empirical Bayes (EB):p(x|y)≈ p(x|y,θ?). (3)

θ? here estimated by the Expectation-Maximisation (EM) algorithm.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 32 / 37

Page 33: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Inversions (EB)

IUniform hyperparameters: R = r?I, B = b?I, where r? and b? are obtained from EM.

0 4 8 12 16 20 24 28 32 36 40date [h] from 25/04/1986 21:30 UTC

1012

1013

1014

1015

1016

releas

e rate [B

q/h]

(a)

CNPP source term (137Cs)Lognormal retrievalGaussian retrievalUnscear

11/03 13/03 15/03 17/03 19/03 21/03 23/03 25/03 27/03 29/03 31/03date [h]

1011

1012

1013

1014

1015

rele

ase

rate

[Bq/

h]

(c)

FDNPP source term (137Cs)Lognormal retrievalGaussian retrievalreferenceKatataTerada

IChernobyl and Fukushima-Daiichi source terms with Gaussian and lognormalassumptions on the observation errors. Comparison with the Unscear reference sourceterm.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 33 / 37

Page 34: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: UQ of the retrieved total radioactivity (EB)

IProbability density function of the total released activity for Chernobyl andFukushima-Daiichi.

75 80 85 90 95 100Total 137Cs released activity (PBq)

0

1

2

3

4

5

Dens

ity

(a)

UQ of the CNPP total released activityLaplace proposalnaive RTOunbiased RTOMCMC

12 14 18 20 22 24 26 28 30 32Total 137Cs released activity (PBq)

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

4.0

4.5

Density

(b)

UQ of the FDNPP total released activityLaplace proposalnaive RTOunbiased RTOMCMC

IEB: optimal hyperparameters are first determined. Followed by nonlinear sampling ofthe total activity using three methods: with a Laplace proposal, a random-then-optimisesampling, an unbiased random-then-optimise sampling and a basic MCMC.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 34 / 37

Page 35: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: Inversion (HB)

IFull solution of the Bayesian hierarchy (HB)

IObtained from a Monte Carlo Markov Chain (MCMC)

ITransdimensional analysis (adaptive grid). Here using only 20 grid cells for Chernobyland 40 grid cells for Fukushima.

0 4 8 12 16 20 24 28 32 36 40date [h] from 25/04/1986 21:30 UTC

1012

1013

1014

1015

1016

release rate [Bq/h]

(a)

CNPP source term (137Cs)Mean retrievalMedian retrievalUnscear

11/03 13/03 15/03 17/03 19/03 21/03 23/03 25/03 27/03 29/03 31/03date [h]

1011

1012

1013

1014

1015

release rate [Bq/h]

(c)

FDNPP source term (137Cs)Mean retrievalMedian retrievalReferenceKatataTerada

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 35 / 37

Page 36: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Case study: UQ of the retrieved total radioactivity (HB)

IProbability density function of the total released activity for Chernobyl andFukushima-Daiichi.

50 70 90 110 130 150 170Total 137Cs released activity (PBq)

0

1

2

3

4

5

Dens

ity

(b)

UQ of the CNPP total released activity

0 10 20 30 40 50 60Total 137Cs released activity (PBq)

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

4.0

4.5

Dens

ity

UQ of the FDNPP total released activity

IFull solution of the Bayesian hierarchy; obtained from an MCMC.

ITransdimensional analysis (adaptive grid). Here using only 20 grid cells for Chernobyland 40 grid cells for Fukushima.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 36 / 37

Page 37: Tutorial on data assimilation - Sciencesconf.org€¦ · Data assimilation: principles Outline 1 Data assimilation: principles De nition Mathematical framework 2 Main techniques 3D-Var

Uncertainty quantification of the best estimate

Final word

Thank you for your attention!

IPart I: A gentle introduction to DA.

IPart II: More advanced topics including EnKFand EnVar.

IPart III: Applications of DA including emergingones such as: glaciology, biology, geomagnetism,medicine, imaging and acoustics, economics andfinance, traffic control, etc.

M. Bocquet Journee du groupe SAMA de l’IPSL, Ecole Normale Superieure, Paris, France, 13 March 2018 37 / 37