Upload
darrell-sharp
View
218
Download
0
Tags:
Embed Size (px)
Citation preview
Data Assimilation in Meteorology
Chris Budd
Joint work with Chiara Piccolo, Mike Cullen (Met Office)Melina Freitag, Phil Browne, Emily Walsh, Nathan Smith and Sian Jenkins (Bath)
Understanding and forecasting the weather is essential to the future of planet earth and maths place a central role in doing this
Accurate weather forecasting is a mixture of
• Careful modelling of the complex physics of the ocean and atmosphere
• Accurate computations on these models
• Systematic collection of data
• A fusion of data and computation
Data assimilation is the optimal way of combining a complex model with uncertain data
Observations from space
Upper-air observations Weather radar
Surface observations
Integrated forecasting process
NWP forecasts
Analysis Intervention
Fine-tuning
Forecast productsand guidance
Modelling the global atmosphere and ocean
Vertical exchange between layers of momentum, heat and moisture
Horizontal exchange
between columnsof momentum,
heat and moisture
Vertical exchange between layers of momentum, heat and salts by diffusion, convection and upwelling
Orography, vegetation and surface characteristics included at surface on each grid box
Vertical exchange between layersby diffusion and advection
15° W60° N
3.75°
2.5°
11.25° E47.5° N
Met Office Current Configurations
North Atlantic and Europe (NAE)
Met. Office Global/Regional Ensemble Prediction System (MOGREPS) became fully operational in Sep 2008 after 3 years of trials. Focus on short-range out to 72hr.
UK
• Global 25km L70 model (was 40km L50)
• Incremental 4D-Var Data Assimilation
• 60km 24m ETKF ensemble (was 90km L38)
• Regional NAE 12km L70 model
• Incremental 4D-Var DA
• 16km 24m L70 ETKF ensemble (was 24km L38)
• UK 1.5km model (stretched) (was 4km)
• Incremental 3D-Var DA
Data: Sources of observation
Observation Volumes in 6 hours
CategoryCount %
usedCategory Count %
used
TEMPs 637 99% Satwinds: JMA 26103 4%
PILOTs 307 99% Satwinds: NESDIS 142478 3%
Wind Profiler1355 39% Satwinds:
EUMETSAT220957 1%
Land Synops
16551 99% Scatwinds: Seawinds
436566 1%
Ships 3034 84% Scatwinds: ERS 27075 2%
Buoys 8727 63% Scatwinds: ASCAT 241626 4%
Amdars 64147 23% SSMI/S 532140 1%
Aireps 7144 12% SSMI 698048 1%
GPS-RO 776 99% ATOVS 1127224 3%
AIRS 75824 6%
IASI 80280 3%
Performance Improvements
Met Office RMS surface pressure error over the N. Atlantic & W. Europe
“Improved by about a day per decade”
Andrew Lorenc
DA Introduced
What are the causes of improvements to NWP systems?
1. Model improvements, especially resolution and sub grid modelling
2. Better observations
3. Careful use of forecast & observations, allowing for their information content and errors.
Achieved by variational data assimilation
e.g. of satellite radiances.
Basic Idea of Data Assimilation
Numerical Weather Prediction NWP calculation gives a predicted state with an error
True state of the weather is
Make a series of observations y of some function of the true state
Eg. Limited set of temperature measurements with error€
H(x t )€
xb
Now combine the prediction with the observations
€
x t
Both the NWP prediction and the data have errors.
Can we optimally estimate the atmospheric state which is consistent with both the prediction and the data and estimate the resulting error?
NOTE: Approximately
10^9 degrees of freedom
10^6 data points
So significantly underdetermined problem
Assume initially:
1. Errors are unbiased Gaussian variables
2. Data and NWP prediction errors are uncorrelated
3. H(x) is a linear operator
yBest state estimate
analysis
Use to produce forecast
6 hours later
Data
NWP prediction
€
xa
€
xa
€
x f
€
x f
€
xB
€
xB
Can estimate using Bayesian analyis:
Maximum likelihood estimate of data y given
Best RMS unbiased estimate of the true state: BLUE
Minimum error variance
€
P(x t y)
P(x t )P(y) = P(y x t )
LikelihoodPrior
Posterior€
xa
€
x t
Assumptions about the error
Data error: Gaussian, Covariance R
Background (NWP) error: Gaussian, Covariance B
BLUE: Find which minimises
€
J(xa ) =1
2xa − xb( )
TB−1 xa − xb( ) +
1
2(Hxa − y)T R−1(Hxa − y)
€
xa
NOTE:
€
P(x y) /P(x) = e−J (x )
€
xB
If R and B are known then the best estimate of the analysis is
€
xa = xb + K(y − Hxb ), K = BHT (R + HBHT )−1
A = KRKT + (I − KH)B(I − KH)T
Kalman filter: Continuously updates the forecast and its error given the incoming data.
Covariance of the analysis error
€
J(xa ) =1
2xa − xb( )
TB−1 xa − xb( ) +
1
2(Hxa − y)T R−1(Hxa − y)
Implementation:
In the context of minimising the functional
This is implemented as 3D-VAR (since 1999 in the Met Office)
: Background, derived from 6 hour NWP forecast
: Analysis
: NWP forecast using as initial data
€
xB
€
xa
€
x f
€
xa
Ensemble Kalman Filter EnKF
This is a widely used Monte Carlo method that uses an ensemble of forecasts to estimate the terms in the Kalman filter
Idea: Take a large number of initial states and estimate the resulting background states
Estimate
€
x =1
Nxb,i,∑ , B =
1
N −1xB ,i − x ( ) xB ,i − x ( )
T∑
€
x i
€
x i€
x i
€
x i
€
x i
€
x i
€
xB ,i
€
xB ,i
€
xB ,i
€
xB ,i
€
xB ,i
€
xB ,i
y
Basic Filtering Idea
Advantages: Works well with high dimensional systems
Disadvantage: EnKF inaccurate with strong nonlinearity eg. Shear flow [Jones]
€
xa
€
xB
€
xa
€
xB
Correctedforecast
Previousforecast
4D VAR … Preferred variational method
Use window of several observations over 6 hours
Assimilation Window
9 UTC 12 UTC 15 UTC
Obs.
x
Time
xb
xa
Jo
Obs.
Jo
Jb
Obs.
Jo
Obs.
Jo
4D-VAR idea: Evolutionary model M (nonlinear)
Unknown initial state
Times Over a time window
Leads to state estimates
Data yi over window
Find so that the
estimates fit the data
Smoothing
€
x0
€
x1,x2,...
€
x0
€
t = t0, t1, t2,L
€
J(x0) =1
2x0 − xb( )
TB−1 x0 − xb( ) +
1
2(Hx i − y i)
T R−1(Hx i − y i)i= 0
N
∑
Minimise
Subject to the strong model constraint
€
x i+1 = M i(x i)
At present assume perfect model, but can also deal with certain types of model error (both random and systematic) by using a weak constraint instead
€
J(xa ) =1
2x0 − xb( )
TB−1 x0 − xb( ) +
1
2(Hx i − y i)
T R−1(Hx i − y i)i= 0
N
∑
+ λ i
i= 0
N−1
∑ (x i+1 − M x i)
Usually solved by introducing Lagrange multipliers
And solving the adjoint problems
€
0 =∇Ji = HT R−1(Hx i − y i) + λ i−1 − λ i mi x i
€
0 =∇J0 = B−1(x0 − xb ) + HT R−1(Hx0 − y0) − λ 0 m0 x0
Solution: Find to minimise nonlinear function J
Need forward calculation to find and backward solve to find
VERY expensive for high dimensional problems!!! Only have limited time to do the calculation (20 mins)
Incremental 4D-Var: Cheaper!
1. Assume is close to
2. Linearise J about and minimise this function using an iterative method eg. BFGS
3. Repeat if needed (not usually)
BUT: Relies on assumption of near linearity to work well
€
x0
€
x0
€
x i
€
xB
€
xB
€
λi
Very effective method!!
Met Office operational in 2004
[Lorenc, …. ]
Used by many other centres
Estimation of the background and covariance errors
Good estimates of the covariance matrices R and B are important to the effectiveness of 4D-VAR
1. To get the physics correct
2. To avoid spurious correlations between parameters
3. To give well conditioned systems
NOTE: B is a very large matrix, difficult to store and very difficult to update. Impractical to calculate using the Fokker-Plank equation
R Different instrument error characteristics and errors of
representativeness
B Enormous: 10^8 x 10^8
Deduce structure from:
Historical data
Known dynamical and physical structure of the atmosphere eg. Balance relationships
[Bannister]€
1 5 6
2 9 4
8 0 3
⎛
⎝
⎜ ⎜ ⎜
⎞
⎠
⎟ ⎟ ⎟
Build meteorology into the calculation of B through
Control Variable Transformations (CVTs)
IDEA: Choose more ‘natural’ physical variables which have uncorrelated errors so that the transformed covariance matrix is block diagonal or even the identity
Set€
χ
€
δx = Uχ = U pUvUhχ , B = UUT
Reduces the complexity of the system AND gives better conditioning for the linear systems
€
U p−1
€
Uv−1
€
Uh−1
Reduces vertical correlations by projecting onto empirical orthogonal vertical modes
Separates physical parameters into uncorrelated ones eg. temperature, wind, balanced and unbalanced
Effective, but errors arise due to lack of resolution of physical features leading to spurious correlations.
Reduces horizontal correlations by projecting onto spherical harmonics
Eg. Problems with stable boundary and inversion layers and assimilating radiosonde data
Poor resolution leads to inaccurate predictions of fog and ice
Solution one: increase global resolution
VERY EXPENSIVE!!!
Solution two: locally redistribute the computational mesh to resolve the features
Cheap and effective! [Piccolo, Cullen, B,Browne, Walsh]
€
δx = Uχ = U pUaUvUhχ , B = UUT
Adjust the vertical coordinates to concentrate points close to the inversion layer and reduce correlations
Introduce an extra transformation
€
Ua−1
Adaptive mesh transformation applied to latitude-longitude coordinates
Do this by using tools from adaptive mesh generation methods for PDES
€
dz
dξ= M(z)
Set: z original height variable
new ‘computational’ height variable
€
ξ
Relate these via the equation
M called the ‘monitor function’ [B, Huang, Russell, Walsh]
Take M large if there is active meteorology
Eg. High potential vorticity
€
M = 1+ c 2 ∂θ
∂z
⎛
⎝ ⎜
⎞
⎠ ⎟
2
Initially use background state estimate, then update
© Crown copyright Met Office
Monitor function and the Adaptive Grid
Piccolo&Cullen
QJR Met Soc 2011
© Crown copyright Met Office
First calculation
UK4 domain: 3 Jan 2011 00z
Updated calculation
Applied to the Met Office UK4 model
RMS T (K) RH (%) u (m/ s) v (m/s)Control 0.76 0.045 1.32 1.16
Test 0.64 0.045 1.29 1.16
Nobs 1011 901 819 819
Test case: 8th Feb 2010.
Significant reduction in RMS error especially for temperatures Piccolo&Cullen, QJR Met Soc 2011
Particularly effective for the 2m temperatures
Used together with Met Office Open Road software to advise councils on road gritting over Christmas
Adaptive mesh implemented operationally in November 2010.
Now extending it to a fully three dimensional implementation [B,Browne,Piccolo]
Other refinements to 4D-Var
Change in the background norm [Freitag, B, Nichols]
Total variation: Gives significantly better resolution of fronts, shocks and other localised features
But .. Hard to implement in high dimensions!!
€
J(x0) =1
2TV (x0 − xb ) +
1
2(Hx i − y i)
T R−1(Hx i − y i)i= 0
N
∑
Dealing with model error
If model has random errors with Covariance C can extend
4D-Var to find the minimiser of
€
J(x0) =1
2x0 − xb( )
TB−1 x0 − xb( ) +
1
2(Hx i − y i)
T R−1(Hx i − y i)i= 0
N
∑ +
1
2∑ x i+1 − M i(x i)( )
TC−1 x i+1 − M i(x i)( )
€
x0, x1, x2,L
However, most model errors eg. Numerical errors are systematic. Dealing with these and quantifying the uncertainty in DA is an area of active research
[Jenkins, B, Smith, Freitag], [Cullen&Piccolo], [Stuart]
Dealing with nonlinearity
Better use of appropriate (eg. Lagrangian) data
Tuning method to data [Jones, Stuart, Apte]
Use of particle filters
and MCMC methods [Peter Van Leeuwan]
Lot of research into finding a compromise between dealing with the high dimensionality and nonlinearity in the system
Conclusions
Data assimilation is an optimal way of
merging models with data
Useful for model tuning, validation,
evaluation, uncertainty quantification and reduction
Very effective in meteorology
Many other applications to Planet Earth
eg. Climate change, oil reservoir modelling, geophysics, energy management and even crowd behaviour
Problems with the simple Kalman Filter
• Assumption of Gaussian random variables
• Assumption of linearity
• Assumption of known covariances
• Covariance matrix B is VERY large for meteorological problems
• Minimisation is a large complex problem
Problems with 4D Var Accuracy
• Estimation of the background covariance matrix
• Ill conditioning of the linear systems
• Reliance of near linearity
• Inappropriate use of background data
• Incorrect covariance between data
• Unresolved random and systematic model error
• Poor resolution in the model
Improvements subject to much research