Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
THE UNIVERSITY OF READING
Entropy and Potential Vorticity in Dynamical
Core Atmosphere Models
Tim Woollings
A thesis submitted for the degree of Doctor of Philosophy
Department of Meteorology
October 2004
Declaration
I confirm that this is my own work and the use of all material from other sources has been
properly and fully acknowledged.
Tim Woollings
i
Abstract
Firstly, it is shown that the normal modes of atmospheric motion in isentropic coordinates
are best simulated using Charney-Phillips type vertical grids. Then, an isentropic model is
developed which uses a new technique to handle the intersection of model levels with the
ground, and which uses the PV conservation equation as a prognostic equation, so that the PV
evolution can be constrained to be closer to its evolution in the real atmosphere. Unfortunately,
the model is stable only for short periods. Nevertheless, the simulation of most of the life
cycle of an idealised baroclinic wave has been possible, and this is compared in detail with that
simulated by a normalised pressure coordinate model, the IGCM.
Some of the technical problems plaguing previous isentropic models have been overcome
here. For example, the surface flow is well represented, with strong fronts predicted which are
much more similar to those in a high resolution IGCM simulation than those the low resolution
IGCM predicts.
Different properties of the atmosphere are represented well in the two models, indicating
that both perspectives are valuable. Many of the synoptic differences are shown to result from
the contrasting small-scale dissipation mechanisms in the two models. The isentropic model
conserves entropy excellently, but it exhibits a drift in total energy that constitutes a more
serious error than that in entropy in the IGCM. This latter model shows a systematic increase
in global entropy, which is believed to be partly responsible for the cold bias of similar climate
models. The net increase of entropy is shown to be the residual effect of frequent changes
in the entropy of individual air parcels, so that the IGCM is seen to violate the Lagrangian
conservation law for entropy to a much greater extent than suggested by the global drift.
ii
Acknowledgements
I am deeply grateful to my supervisor, John Thuburn; without his enthusiastic support, guid-
ance and encouragement none of this would have happened. I am also very grateful to my thesis
committee, Brian Hoskins and Stephen Belcher, who have shown great interest and given in-
valuable advise. I have benefited greatly from interesting discussions with so many people both
in Reading, and elsewhere. A special mention must go to Mike Blackburn and John Methven
for their help with the IGCM, and to Manuel Pulido for his help with the isentropic model. I’d
also like to thank NERC for my funding.
Many thanks to my fellow PhD students past and present, and to all my friends in Reading,
Victoria and elsewhere, who have helped immensely and ensured that I’ve had a great few
years, and also to my parents for their ongoing support. Special thanks go to Helen for fantastic
proof-reading, and so much more.
iii
“The law that entropy always increases - the second law of thermodynamics - holds, I think,
the supreme position among the laws of nature. If someone points out that your pet theory
of the universe is in disagreement with Maxwell’s equations - then so much the worse for
Maxwell’s equations. If it is found to be contradicted by experiments - well, these experimen-
talists do bungle things sometimes. But if your theory is found to be against the second law
of thermodynamics I can give you no hope; there is nothing for it but to collapse in deepest
humiliation.”
Sir Arthur Stanley Eddington, in The Nature of the Physical World. Maxmillan, New York,
1948, p. 74.
iv
Contents
1 Introduction 1
1.1 The PV-θ Concept . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Isentropic Modelling of the Atmosphere . . . . . . . . . . . . . . . . . . . . . 6
1.2.1 Advantages of the Isentropic Coordinate . . . . . . . . . . . . . . . . . 6
1.2.2 Disadvantages of the Isentropic Coordinate . . . . . . . . . . . . . . . 8
1.2.3 Pure Isentropic Models . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2.4 Hybrid Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2.5 Isopycnal Ocean Models . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.2.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3 PV as a Prognostic Variable in Models . . . . . . . . . . . . . . . . . . . . . . 18
1.4 Outline of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2 Vertical Grid Staggering in Isentropic Coordinates 21
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.2 The Equation Set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.3 Analytical Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.4 Numerical Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.6 A Hydrostatic System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.7 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
2.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
3 The Isentropic Model 50
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.2 Boundary Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.3 Controlling the Massless Region . . . . . . . . . . . . . . . . . . . . . . . . . 57
3.4 The Advection Scheme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.4.1 PV Advection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.4.2 Flux Limiter for Density Advection . . . . . . . . . . . . . . . . . . . 59
Contents vi
3.4.3 Sub-gridscale Fit Near the Ground . . . . . . . . . . . . . . . . . . . . 60
3.4.4 Summary of the Mass Advection Step . . . . . . . . . . . . . . . . . . 62
3.5 Summary of the Model Timestep . . . . . . . . . . . . . . . . . . . . . . . . . 63
3.6 Basic Model Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
3.6.1 Solid Body Rotation . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
3.6.2 Perturbing the Massless Region . . . . . . . . . . . . . . . . . . . . . 66
3.6.3 Flow over a Hill . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
3.7 The Surface Divergence Effect . . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
4 The LC1 Baroclinic Life Cycle 79
4.1 Baroclinic Instability and a History of the LC1 Cycle . . . . . . . . . . . . . . 79
4.2 The IGCM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
4.3 An Overview of the LC1 Cycle in the T42 IGCM . . . . . . . . . . . . . . . . 84
4.4 A High Resolution Version of the IGCM LC1 Cycle . . . . . . . . . . . . . . . 89
4.5 Initial Conditions for the Isentropic Model Simulation . . . . . . . . . . . . . . 92
4.6 Overview of the LC1 Cycle in the Isentropic Model . . . . . . . . . . . . . . . 94
4.7 The IGCM LC1 Cycle with∇4 Hyper-diffusion . . . . . . . . . . . . . . . . . 101
4.8 The Terminal Instability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
4.9 Surface Fronts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
4.10 Surface PV Anomaly . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
4.11 Global Conservation Properties of the Two Models . . . . . . . . . . . . . . . 115
4.12 PV Conservation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
4.13 Effect of the Relaxation Scheme on the LC1 Cycle . . . . . . . . . . . . . . . 123
4.13.1 The Surface Divergence Effect . . . . . . . . . . . . . . . . . . . . . . 123
4.13.2 Sensitivity to the Relaxation . . . . . . . . . . . . . . . . . . . . . . . 124
4.14 Smoothness of ∇2M . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126
4.15 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
5 Entropy Production in the IGCM 130
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
5.1.1 Entropy and the Second Law . . . . . . . . . . . . . . . . . . . . . . . 131
5.1.2 The Entropy Budget of the Atmosphere . . . . . . . . . . . . . . . . . 132
Contents vii
5.1.3 Some Specific Motivations . . . . . . . . . . . . . . . . . . . . . . . . 134
5.2 Entropy Production in the LC1 Cycle . . . . . . . . . . . . . . . . . . . . . . 136
5.2.1 Magnitude and Sensitivity of the Source . . . . . . . . . . . . . . . . . 136
5.2.2 Location and Attribution of the Source . . . . . . . . . . . . . . . . . 140
5.2.3 Cross-Isentrope Mass Flow . . . . . . . . . . . . . . . . . . . . . . . . 146
5.3 The Entropy Theory for the Cold Bias . . . . . . . . . . . . . . . . . . . . . . 148
5.3.1 Summary of Johnson (1997) . . . . . . . . . . . . . . . . . . . . . . . 148
5.3.2 Analysis of the IGCM Results . . . . . . . . . . . . . . . . . . . . . . 150
5.4 Alternative Theories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
5.5 A Comment on the MEP Theory . . . . . . . . . . . . . . . . . . . . . . . . . 160
5.6 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162
5.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165
6 Conclusions 167
6.1 Summary of Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
6.2 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
A The LC2 Cycle 174
References 180
Chapter 1
Introduction
The atmosphere is a three-dimensional fluid system capable of supporting three-dimensional
turbulence, and as such might be expected to exhibit even greater complexity than it actually
does. However, the combined effects of buoyancy stratification and rotation strongly constrain
the flow, acting, for example, to inhibit large-scale three-dimensional turbulence. The potential
complexity is further reduced because the atmosphere is known to satisfy several conservation
laws. Over timescales of the order of a few days, the large-scale motion of the atmosphere is
dominated by its adiabatic component (unless strong latent heating occurs), where an adiabatic
process is one in which an air parcel does not exchange heat with its surroundings. Under
adiabatic flow an air parcel will conserve its potential temperature θ. This quantity is intimately
linked to the stratification, which acts to inhibit vertical motion and thus inhibit mixing across
surfaces of constant θ. These are referred to as isentropic surfaces; the specific entropy is
proportional to ln θ, so a surface of constant potential temperature is also a surface of constant
entropy. These surfaces are nearly horizontal, sloping gradually with respect to the ground but
almost never overturning, only doing so under the action of very small scale three-dimensional
eddies. Since, above the boundary layer, θ almost always increases monotonically with height
it can be used as a vertical coordinate for the atmosphere; it is referred to as an isentropic
coordinate. The adiabatic motion of the atmosphere is then two-dimensional, with individual
air parcels only moving along isentropic surfaces and never across them.
Deeper understanding can be obtained by considering the potential vorticity or PV, another
scalar quantity which is conserved by a fluid parcel under adiabatic, frictionless flow. The es-
sential idea is that there is potential for changing the vorticity of the parcel by changing its
latitude, and hence the size of the effect of the Earth’s rotation, or by adiabatically changing
Introduction 2
the separation between isentropic surfaces, resulting in vortex stretching. The dominant adi-
abatic component of atmospheric flow can therefore be visualised by the evolution of PV on
isentropic surfaces, acting as a tracer for individual air masses. The PV, however, differs from
a passive tracer in that it is a dynamical quantity whose distribution actually determines the
subsequent evolution of a large, and dominant, component of the flow. A given PV anomaly on
an isentropic surface is said to ‘induce’ a wind field which in turn acts to redistribute the PV.
The apparent complexity of the atmosphere is thus greatly reduced by thinking in terms
of the evolution of PV on isentropic surfaces, often referred to as PV-θ or isentropic PV (IPV)
thinking. Use of the PV-θ concept has greatly improved understanding of atmospheric dynam-
ical processes from diagnostic studies, both in synoptic weather systems (e.g. Hoskins et al.
(1985)) and in the general circulation (e.g. Hoskins (1991)). Attempts to make use of this
in the field of numerical modelling by building models based on the PV-θ concept have en-
countered significant problems. To predict the evolution of the PV field from the field itself is
much simpler if the discrete model surfaces are isentropic surfaces. In the troposphere these
slope downwards from the poles towards the equator to the extent that all isentropic surfaces
which are below the tropopause at the poles intersect the Earth’s surface at some point before
the equator (see e.g. Figure 3.4). This implies intersections of model levels with the boundary
in an ‘isentropic’ numerical model, and it is technical difficulties in handling these intersec-
tions that has most hindered the development of weather and climate prediction models which
attempt to take advantage of the PV-θ concept.
The first aim of the work presented in this thesis is to build a numerical model which pre-
dicts PV on isentropic model levels using a new technique to handle the surface intersections,
and to perform a detailed comparison of this model with a more conventional one in the sim-
ulation of an idealised atmospheric disturbance. A further aim is to investigate entropy errors
in the conventional model; are these serious enough to make isentropic models an attractive
alternative, despite the technical difficulties? We begin with a very brief introduction to the
PV-θ concept, concentrating on those aspects which are of most relevance to this project.
1.1 The PV-θ Concept
Potential temperature θ is defined by
θ = T
(p
p0
)−κ, (1.1)
Introduction 3
in terms of the temperature T , pressure p and the constant κ = R/cp, where R is the gas
constant and cp is the specific heat at constant pressure. It is the temperature an air parcel
would have if it were to be transported adiabatically to the reference pressure p0. Potential
temperature is conserved following an adiabatic, frictionless flow, so that
Dθ
Dt= 0. (1.2)
By taking the gradient of (1.2), and using both this and mass conservation, it can be shown
from the vector vorticity equation that the Ertel PV
Q =1
ρζa · ∇θ (1.3)
is conserved following a fluid parcel. Here ρ is the density and the absolute vorticity is
ζa = 2Ω +∇× u, (1.4)
where u is the full three-dimensional velocity vector and Ω is the rotation vector of the Earth.
This conservation is remarkably general, and holds for any adiabatic, frictionless motion. If
we then make the shallow atmosphere approximation, the 2Ω cosφ terms, associated with the
local horizontal component of Ω, are neglected in order to retain conservation of energy and
angular momentum. The planetary term then reduces to fk, where f is the Coriolis parameter
2Ω sinφ in terms of the latitude φ, and k is the unit vector in the vertical. If hydrostatic balance
is then assumed, the horizontal components of the vorticity are also neglected. Even with these
approximations, the definition of PV is still complicated, containing the three-dimensional
product of a curl and a grad, but this simplifies significantly when expressed in isentropic
coordinates as
Q =f + k · ∇θ × v
σ, (1.5)
where
σ = −1
g
∂p
∂θ(1.6)
is termed the isentropic density. Note that here v is the horizontal velocity field. The conser-
vation of PV following an adiabatic, frictionless flow is expressed as
DQ
Dt= 0. (1.7)
Introduction 4
This section is intended as a brief review only, so proofs of these statements are not given
here. More details and proofs are given in the review by Hoskins et al. (1985), who emphasise
two main principles underlying the PV-θ concept. The first is this Lagrangian conservation
principle, and the second is that of invertibility.
Given the mass distribution in isentropic coordinate space and some balance relation be-
tween the mass and wind fields, and provided it is done globally with appropriate boundary
conditions, the PV field can be ‘inverted’ to give the balanced component of the flow. Differ-
ent balance relations can be used. For example, the simplest and least accurate is geostrophic
balance; the PV then contains all the dynamical information about the geostrophic flow. The
inversion corresponds to the mathematical inversion of an elliptic operator and therefore has a
smoothing effect. Discussions often centre around the effect of isolated PV anomalies. These
can be either positive or negative with respect to the background PV, and are generally a com-
bination of anomalies in static stability and vorticity. The wind field induced by a PV anomaly
will have smoother features than the anomaly itself and will also extend over a larger scale,
both horizontally and vertically. This latter feature is known as the scale effect. An analogy
has been drawn between the wind field induced by a PV anomaly and the field induced by an
electrical charge. This analogy was investigated by Bishop and Thorpe (1994, 1995), motivated
by the problem of how to attribute different components of a given wind field to the effects of
a number of different PV anomalies.
The picture given so far is of PV being advected in two-dimensional motion along isen-
tropic surfaces in exactly the same way as an inert, non-diffusive chemical tracer. While useful
to some extent, this analogy can be misleading, as shown by Haynes and McIntyre (1987,
1990). PV can be regarded as the mass mixing ratio of an imaginary ‘PV substance’ or PVS
(that is, a ratio describing the amount of PVS per unit mass which is, however, permitted to
take negative values). Isentropic surfaces act as if they are impermeable to PVS so that there
can be no net transport of PVS across any isentropic surface. PVS is exactly conserved away
from lateral boundaries, i.e. it cannot be created or destroyed in a layer bounded only by two
isentropic surfaces. PVS can only be created or destroyed at lateral boundaries, i.e. at the
Earth’s surface if such a layer intersects it. This theorem is exact in that it holds regardless of
whether frictional or diabatic forces act and regardless of whether hydrostatic balance is as-
sumed, e.g. even in regions of three-dimensional turbulence and associated diabatic mixing. In
such regions mass (and also chemical species) can be transported across isentropic surfaces but
these diabatic mass transports act to concentrate or dilute the PVS on either side of the isen-
Introduction 5
tropic surface rather than to transport PVS across it. Away from the boundary, PVS can only
be transported along isentropic surfaces or, more precisely, along the intersections of isentropic
surfaces and surfaces of constant Bernoulli function B = cpT + 12u2 + gz (Schar (1993)). This
statement holds even for diabatic, frictional flow when the mass flux is no longer along these
intersections.
The rapid formation of thin filaments of vorticity is a well-known feature of fluid dynam-
ics, and in the atmosphere it is the PV which is drawn out into thin filaments in this way. It is
therefore the potential enstrophy Q2, or the variance of the PV field, which is said to cascade
to small length scales (e.g. Salmon (1998)). Filaments of high and low PV on an isentropic
surface are frequently seen to wrap up together before ultimately mixing under the action of
diabatic processes such as heating or turbulence. In this process mass is invariably transported
across isentropic surfaces but PVS is not. As the filaments become narrower their dynamic
effect is reduced in line with the scale effect. When the fine-scale structure is integrated over,
destructive interference results in large cancellations between the effects of positive and neg-
ative PV anomalies. The circulation around an isentropic contour enclosing this fine-scale
structure represents all the dynamical effect of the PV features within it, and the details of the
fine-scale structure can be replaced by a uniform PV value within the contour. This is known
as coarse-grain PV, and it is this quantity which is generally derived from observed or averaged
fields. All of the theorems above can be shown to hold also for coarse-grain PV.
Discrete numerical models by their very nature have difficulty in simulating the potential
enstrophy cascade. Some form of scale-selective dissipation is invariably used to dissipate
potential enstrophy at the gridscale, where it would otherwise accumulate. From the discussion
above it appears that an ideal model would dissipate potential enstrophy by re-arranging PV
on isentropic surfaces only, while simultaneously allowing just the correct amount of mass to
move across these surfaces. In the vast majority of models, however, both PV and θ are merely
diagnostic variables and the cascade is halted by numerical diffusion acting on model variables
such as wind, vorticity or temperature. One of the central aims of this thesis is to improve
understanding of how well this represents the underlying PV-θ dynamics of the atmosphere.
One final development is of relevance. The potential temperature varies across the Earth’s
surface, and as a result the boundary conditions for PV dynamics are inhomogeneous, compli-
cating the application of the PV-θ concept to problems where boundary effects are significant.
In quasi-geostrophic theory a quantity can be derived, referred to as quasi-geostrophic PV or
QGPV, which is analogous to the PV in some ways but is not an approximation to it. One aspect
Introduction 6
of the relationship between the two is that the variation of the QGPV on a surface of constant
pressure or height is proportional to the variation of the Ertel PV on an isentropic surface. A
key difference is that while the Ertel PV is conserved following an air parcel, the QGPV is con-
served following the geostrophic flow so that, for example, there is no vertical advection term
in the conservation equation for QGPV. See Andrews et al. (1987), Section 3.8.3 or Hoskins
et al. (1985) for more details. Bretherton (1966) introduced the concept of boundary QGPV.
The underlying principle is that the dynamical effect of the varying surface θ field is identical
to that induced by replacing it with a constant θ but imposing a QGPV distribution confined
to an infinitesimally thin layer at the surface. When an isentropic surface hits the ground it is
envisioned to extend along just above the surface. Many such isentropic surfaces pile up with
negligible mass separating them and the lowest of these is exactly coincident with the ground.
The boundary condition is then one of constant θ, that of this lowermost surface, and so is
homogeneous. This theory has recently been extended to arbitrary, non-geostrophic flows by
Schneider et al. (2003). In the impermeability theorem of Haynes and McIntyre (1987, 1990),
creation and destruction of PV is permitted where an isentropic surface intersects the ground.
This can be simply interpreted as an exchange of boundary and interior PV.
There is a clear similarity between this view of PV-θ dynamics and the massless layer ap-
proach to handle the lower boundary problem employed in isentropic models. In this approach
an isentropic model level which intersects the ground is extended with negligible mass along
the surface (see Sections 1.2.3 and 3.2). Previous work in the field of isentropic modelling is
now reviewed, starting with a comprehensive list of reasons for using, and for not using, an
isentropic vertical coordinate.
1.2 Isentropic Modelling of the Atmosphere
1.2.1 Advantages of the Isentropic Coordinate
1. An air parcel conserves its potential temperature θ under adiabatic motion. As mentioned
by Bleck (1973), the only significant synoptic-scale process acting in the free atmosphere
which changes the potential temperature of a parcel on timescales of the order of a few
days is latent heating. This means that, except in regions with strong latent heating such
as fronts, isentropic surfaces in the free atmosphere act as material surfaces. When the
atmosphere is viewed in an isentropic framework, the large adiabatic component of its
Introduction 7
seemingly three-dimensional motion is therefore reduced to a stack of two-dimensional
motions. By using this coordinate in an atmospheric model the size of the vertical advec-
tion terms is greatly reduced, resulting in a reduction of the model’s truncation errors;
this was the motivation for the original isentropic model of Eliassen and Raustein (1968).
2. The coordinate is adaptive in nature, in that the model surfaces move close together in re-
gions of strong temperature gradients, which are likely to be of importance. Atmospheric
fronts are represented by the tight packing of isentropic levels between air masses of dif-
ferent temperatures. The front lies nearly parallel to the isentropic surfaces and so the
discontinuities seen in z or p space are much smoother features when viewed in θ space.
Shapiro (1975) quantified this by noting that on z and p surfaces which cut through a
front the variations of temperature and velocity occur on scales of order 100 km, but that
on isentropes contained within the front the variations of pressure and velocity occur
on scales of order 1000 km. There is potential, then, to simulate fronts accurately at a
significantly lower horizontal resolution using the isentropic coordinate.
3. In isentropic coordinates it is easy to satisfy the fourth of the integral constraints of the
continuous governing equations laid out by Arakawa and Lamb (1977), namely that the
integral over the entire mass of any function of θ is conserved under adiabatic processes.
In an isentropic model this reduces to the conservation of mass within each isentropic
level, which is relatively simple to enforce.
4. The conservation of the mass in every isentropic layer also ensures that the unavailable
part of the potential energy of the system is conserved, since this is given by the mass
distribution in θ space (see Section 4.1). The unavailable part is by far the largest pro-
portion of the total potential energy, and this in turn is far larger than the kinetic energy
so that the greater part of the total energy of the system is automatically conserved.
5. Hsu and Arakawa (1990) showed that in isentropic coordinates it is possible to construct
a vertical scheme that formally conserves both energy and angular momentum with-
out resorting to a Lorenz-type grid, which supports potentially damaging computational
modes (see Chapter 2).
6. The definition of Ertel’s PV (after making the hydrostatic and shallow atmosphere ap-
proximations) is simplest in isentropic coordinates (1.5), and does not involve the vertical
Introduction 8
derivative of the velocity. This should make it easier to conserve the PV in a vertically
discrete system.
7. In isentropic coordinates the horizontal pressure gradient force is given by −∇θM ,
where M is the Montgomery potential. This force is therefore represented by an irrota-
tional vector and so does not yield a horizontal circulation along any coordinate surface,
which should reduce the impact of the discretization error on the total circulation. In
other coordinate systems this force is generally expressed as the gradient of two oppos-
ing terms and so is a greater source of numerical error. As an aside it has recently been
suggested by Purser et al. (2004) that this force can be expressed as the gradient of one
‘modified Montgomery’ potential in a special hybrid σ − p coordinate framework.
8. It is believed that moist processes such as condensation heating could be simulated more
accurately in isentropic coordinates. As described by Konor and Arakawa (2000), this is
because these processes are highly dependent on the moisture transport, which is more
accurate in the absence of any truncation errors in vertical advection (see Johnson et al.
(1993, 2000, 2002)). Mahowald et al. (2002) showed that transport of chemical species
is improved when using isentropic coordinates for the same reason. Properties such as
water vapour and chemical species can exhibit large gradients in the vertical and are thus
especially susceptible to spurious numerical dispersion and truncation errors.
1.2.2 Disadvantages of the Isentropic Coordinate
1. The isentropic coordinate surfaces slope in the atmosphere and intersect the ground.
There are technical difficulties associated with representing these intersection points in
a discrete model, and furthermore in allowing them to be advected along the ground as
the surface θ distribution evolves. These intersections have always been the underlying
cause of the greatest problems encountered by isentropic models, in the guise of gravity
wave noise, instabilities or lack of conservation, and are still the source of the outstanding
problems today.
2. In the real atmosphere it is possible for unstable layers with ∂θ∂z< 0 to exist. This, how-
ever, is forbidden in isentropic models because the assumption of a monotonic variation
of θ with z must be made for it to be a valid coordinate choice. Without this assumption
it would be possible for z and p to become multivalued at a point in the model’s (x, y, θ)
Introduction 9
space, a problem referred to as the ‘folding’ of coordinate surfaces. The hybrid models
discussed in Section 1.2.4 are motivated partly by a desire to avoid this limitation.
3. The mass within an isentropic model layer can become very small, and indeed this is to
be expected given the adaptive nature of the coordinate. This has caused computational
problems in the past.
4. In isentropic coordinates some important regions, most notably the tropical troposphere,
are less well resolved in the vertical than the rest of the domain, due to the distribution
of θ throughout the atmosphere. Similarly there will be a lack of vertical resolution in
less stable regions near the surface, potentially making it harder to incorporate standard
boundary layer processes there.
5. If surface heating is included in an isentropic model then the coordinate surfaces may
become vertical near the ground. It therefore seems likely that a separate boundary layer
scheme would be necessary in a model more complete than the one developed here.
6. The pressure on an isentropic model level depends only on the temperature, so if numer-
ical errors introduce noise to the temperature profile, this can result in irregularly spaced
levels, potentially leading to further inaccuracies (see Webster et al. (1999)).
1.2.3 Pure Isentropic Models
Despite the clear advantages to be gained by diagnosing the state of the atmosphere in an isen-
tropic coordinate framework, the approach had only limited success, at least until the 1960s,
and so was used little by the meteorological community. As described by Bleck (1973), in
addition to the difficulties near the surface, this was at least partly due to a flaw in the adopted
procedure for calculating the Montgomery potential M on isentropes (which is analogous to
the geopotential Φ on pressure surfaces). The flaw was eventually discovered by Danielsen
(1959), but by this time pressure was firmly in place as the vertical coordinate of choice.
The first attempt to overcome the anticipated shortcomings of the isentropic coordinate
and create a prognostic, primitive equation isentropic model of the atmosphere was made by
Eliassen and Raustein (1968), and it was remarkably successful. Their model consisted of three
information surfaces: the ground and two isentropic surfaces, the lowest of which intersected
the ground. Horizontal velocities were predicted on all levels, as well as the pressure on the
Introduction 10
isentropic levels, and the surface potential temperature θs. A technical problem had to be over-
come, namely how to calculate horizontal differences at the point where the lowest isentropic
level intersected the ground and so terminated. To do this, imaginary values were created just
below the surface and used in the horizontal differencing. The pressure at these points was ex-
trapolated vertically by prescribing that the static stability in the imaginary underground region
was identical to that in the real air just above the surface.
The results were encouraging. A 72 hour test integration was completed successfully,
though a special non-linear diffusion was needed. Sharp fronts formed in the simulation so that
one of the key advantages of the isentropic coordinate already appeared to have been realised.
In a following paper, however, Eliassen and Raustein (1970) extended the model to include
five isentropic levels and noted that no such clear fronts developed. This they attributed to the
fact that the earlier model highly underestimated the static stability, due to its coarse vertical
resolution, and so frontogenetic circulations were exaggerated.
Results from the extended model, however, were still impressive, and the model performed
well enough to be used by Trevisan (1975), with a slightly modified underground extrapola-
tion, in a study to improve understanding of the influence of orography on cyclone formation.
Shapiro (1975) used the same model in his study of upper level frontogenesis, this time with
20 isentropic levels. The modified underground extrapolation technique used information from
both of the two lowest isentropic surfaces, giving improved results when several isentropic lev-
els intersected the surface in close proximity, and when the lowest isentropic level contained
only a small amount of mass. The model simulated upper level frontogenesis well, so that the
anticipated ability of the isentropic coordinate to adapt to give increased resolution in frontal
regions had indeed been realised.
By this time a predecessor of the isentropic model presented in this thesis had also been
developed. Bleck (1973) recognised not only the promise shown by the isentropic coordinate
but also its close association with the PV, and developed a balanced isentropic model with PV
as the only prognostic variable. It was Bleck (1973) that first described an improved extrap-
olation of the type used later by Trevisan (1975) and Shapiro (1975). The balanced model
performed well, successfully handling four cases of explosive cyclogenesis which ‘had consti-
tuted indisputable forecast failures’.
Later, however, Bleck (1974) abandoned the balanced model in favour of a primitive equa-
tion version which predicted the three variables u, v and p on isentropic surfaces. This version,
surprisingly, was significantly cheaper, and also ‘outperformed the potential vorticity model by
Introduction 11
a wide margin’. In this paper the first detailed discussion was given of an ongoing problem
which appears to have affected all the pure isentropic models mentioned so far. An undesirably
high level of gravity wave noise affected the flow near the surface, and Bleck showed that, as
suspected, the source of this noise was the points where the isentropic model levels intersected
the ground. A further, perhaps more intrinsic, problem with the underground extrapolation
models was that it proved hard to maintain conservation of integrated quantities such as energy
and angular momentum. These problems proved so significant that research focussed on the
possibility of coupling an upper level isentropic coordinate to a terrain-following one at the
surface to form a hybrid model. The next advance in the quest for a pure isentropic model was
not for a further 10 years.
Bleck (1984) identified the fundamental flaw holding back all of the previous models.
Each of these needed to predict the location of the intersection points by solving a prognostic
equation for θs on the regular horizontal grid. Inevitably this θs field would not always agree
exactly with the isentropic level thicknesses predicted by solving the continuity equation, and
imbalances and subsequent gravity wave noise would result. This is clear with hindsight and
in the light of recent results. As discussed in Section 4.9, there appears to be no limit to
the strength of a frontal gradient in adiabatic fluid dynamics, and isentropic models such as the
one described in this thesis, and that of Fulton and Schubert (1991), form fronts with incredibly
large temperature gradients. This happens as the mass between isentropic surfaces in the frontal
region becomes very small. Such gradients cannot be exhibited by the θs field, however, since
the gradient of θs is limited both by the horizontal grid scale and by any smoothing device
employed in the model, as had been noted by Bleck (1973). Disagreement between θs and the
distribution of isentropic levels was therefore inevitable at surface fronts, where large amounts
of noise in the pure isentropic models was reported by Bleck (1978).
Bleck (1984) then pioneered a new method recently enabled by a development in numeri-
cal advection techniques; that of Flux-Corrected Transport (FCT) schemes by Boris and Book
(1973) and Zalesak (1979). The idea behind FCT is to combine the effects of two different
schemes. The first is a low-accuracy scheme which has a diffusive effect but does not produce
unwanted ripples in the advected field; the second is higher-order accurate but prone to produce
ripples. The FCT combination has only a small diffusive effect and is free from ripples, i.e. no
new maxima or minima are produced. The details are similar to the flux-limiter described in
Section 3.4.2 so will not be given here.
The key benefit for isentropic models is that FCT allows the mass within an isentropic
Introduction 12
layer to go to zero anywhere within the domain without ever becoming negative (which would
constitute a new minimum). Bleck (1984) then developed the first prognostic model to use the
so-called massless layer idea first used by Lorenz (1955). An isentropic level which intersects
the ground is then envisioned to extend along just above the surface containing negligible
mass. The FCT algorithm ensures that there is no overshoot as the mass field goes to zero,
and so no negative mass is produced. The horizontal pressure gradient force (HPGF) terms
near the intersections (which previously depended on extrapolated underground values) are
now evaluated using values from the massless region, where the HPGF is formulated in Bleck
(1984) to be identical to the surface HPGF. Crucially, with the massless layer approach there is
no need for a predicted θs field; a method is given in this paper for obtaining it from the mass
distributions in the intersecting isentropic levels.
Bleck achieved good results with this method, and the most significant development in
pure isentropic modelling since, that of Hsu and Arakawa (1990), is based on his technique. In
this model considerable effort was put into the numerical schemes to ensure that the infinite PV
values implied in an isentropic layer of infinitesimal mass did not affect the above ground flow,
and also that dispersion errors were minimised at the intersection of a level with the ground,
where the gradient of the mass is discontinuous. The lower boundary condition is implemented
by defining M on the lowest massy level of a given column, i.e. the lowest level containing
mass. This is used to start the integration of the hydrostatic equation upwards from the surface,
so that θs is not needed and is never explicitly calculated. The same technique is used in the
model presented here, so further details can be found in Section 3.2. In a test run, the model
of Hsu and Arakawa (1990) simulated the evolution of a baroclinic wave with no significant
computational difficulties at the surface. The model is formulated with a strong emphasis on
conservation properties, indicating that use of the coordinate was becoming developed enough
for long-term integrations to be considered. The vertical scheme formally conserves both angu-
lar momentum and energy, and the horizontal momentum differencing conserves energy while
allowing the potential enstrophy to be dissipated at small scales, as suggested by Sadourny and
Basdevant (1985).
However, there were two, somewhat related, outstanding problems, the first of which is
mentioned at the end of Hsu and Arakawa (1990) and later in Arakawa et al. (1992). It was
found that the model had difficulty in representing sufficiently smooth M distributions, at all
levels, in points above the locations of the intersections. This is attributed to a lack of degrees
of freedom in the implied surface temperature distribution, since this is determined solely by
Introduction 13
the number of model levels intersecting the surface. The problem was studied in detail by
Randall et al. (2000) in a model combining the vertical scheme of Hsu and Arakawa (1990)
with a geodesic horizontal grid based on the Z grid of Randall (1994). M is affected by noise
throughout the domain to the extent that the geostrophic vorticity
ζg = ∇.(
1
f∇M
)(1.8)
is almost discontinuous at all points above an intersection, even in a test with a continuous
prescribed pressure distribution. The reason given is that, although θs is never explicitly calcu-
lated, the flow at all levels is affected by the discontinuous nature of the θs field implied by the
intersection with the ground of a number of levels of constant θ. The lower boundary condition
for the hydrostatic equation∂M
∂θ= Π(p) (1.9)
is therefore discontinuous, andM is affected at all levels during the integration of (1.9) up from
the surface. The solution proposed by both Arakawa and Randall is that the vertical resolution
should be matched to the horizontal resolution so that on average there is one isentropic level
intersecting the surface for every horizontal grid cell. Further discussion of this problem is
given in Section 4.14 where results from the isentropic model developed here are presented.
The second outstanding difficulty is attributed to essentially the same cause: the lack of
degrees of freedom in the implied θs field. Arakawa et al. (1992) studied fronts at mid and
upper levels in their model, as had Shapiro (1975) for the same reason, which was simply
that surface fronts were not well simulated by either model. However, while Shapiro’s model
failed because it explicitly predicted the θs field (as described above), Arakawa et al. (1992)
attributed the failure of their model to the fact that it did not. The θs field implied by the
intersecting of model levels effectively had too low a horizontal resolution to simulate a front,
they concluded, because the vertical resolution of 10 K level spacing used was too low. The
surface fronts predicted by the model described here are discussed in Section 4.9.
As a final comment in this section it should be noted that one of the major early concerns
over the isentropic coordinate was unfounded. It was anticipated that the coordinate surface
might overturn so that the pressure would become multi-valued at a given (x, y, θ) location.
This has never occurred in any of the isentropic models discussed here and the reason is laid
out clearly by Hsu and Arakawa (1990). By choosing θ as a vertical coordinate its variation
Introduction 14
with height is assumed to be monotonic. An isentropic model predicts the mass between, or
pressure difference across, two θ surfaces, and it is implicitly assumed by the monotonicity of
θ that this mass is distributed without folding of coordinate surfaces. Therefore an isentropic
model automatically adjusts any unstable layers with ∂θ∂z< 0 in the same way as the explicit
adjustment applied in any conventional model. Furthermore, as mentioned by Eliassen and
Raustein (1970), a dramatic folding of the isentropes is very unlikely as this ‘would normally
correspond to a significant increase in the available potential energy of the system’.
1.2.4 Hybrid Models
Significant success over the years has been achieved in the field of hybrid isentropic coordinate
modelling. The intention here is to avoid both the technical difficulties at the ground, and also
the restriction imposed by demanding that ∂θ∂z
> 0 near the surface, by using a generalised
vertical coordinate, which is typically terrain following near the surface and isentropic higher
up. Arakawa (2000) notes that in the presence of surface heating the movement of air parcels
near the ground is in fact closer to terrain-following surfaces than θ surfaces, and so a hybrid
coordinate may be even closer to a true material coordinate than an isentropic coordinate is.
Furthermore, the lack of resolution in the tropical troposphere is a fundamental drawback of the
pure isentropic model which has so far only been overcome by adopting a hybrid coordinate.
Bleck (1978) reviewed a selection of proposed methods of coupling a σ coordinate domain
to an isentropic one, concluding that surface noise was indeed significantly reduced in com-
parison with pure isentropic models. Note that in this section σ represents the conventional
terrain-following vertical coordinate p/ps rather than the isentropic density.
An early concern was that it would be hard to guarantee satisfaction of the various conser-
vation laws generally demanded of atmospheric models and yet still maintain full interaction
between the two domains. This concern, however, was dispelled by Uccellini et al. (1979) with
their careful formulation of a hybrid model in which isentropic levels in the free atmosphere in-
tersected a fixed number of σ levels near the surface. The University of Wisconsin hybrid θ−σmodel described by Zapotocny et al. (1994) is descended from that of Uccellini et al. (1979).
As hoped the hybrid model performs significantly better than a pure σ version in representing
the transport of trace constituents. This is an obvious benefit for the modelling of the important
stratosphere-troposphere exchanges of water vapour and chemical species, and it is for this
reason that Mahowald et al. (2002) chose to use a hybrid model for their study of stratospheric
Introduction 15
transport. Several chemical transport models (CTMs) employ isentropic or hybrid-isentropic
coordinates for this reason (e.g. Chipperfield (1999), McKenna et al. (2002)).
Zhu et al. (1992) formulated the first hybrid model (the HIGCM) to feature a smooth
transition between the σ and θ regions, thus eliminating the problem of noise generation due to
imbalances at the points where isentropic surfaces intersected the σ domain. As many levels as
possible were at constant θ, with at least one level at the surface of constant σ and some levels in
between on which θ and σ both varied. Early tests of the model were successful, although extra
vertical diffusion had to be included to avoid noisy temperature profiles and irregularly spaced
model levels. Thuburn (1993) performed further experiments, concluding that the anticipated
benefits of the isentropic coordinate ‘appeared to have been realised only in a small way’. The
vertical scheme was improved in Webster et al. (1999) so that the ‘ad-hoc’ vertical diffusion
was no longer necessary, and results improved. The problems with the vertical scheme were
attributed partly to the use of the Lorenz vertical grid which is now known to have drawbacks
(see Chapter 2). The same hybrid coordinate scheme was also implemented in a different GCM
by Zhu and Schneider (1997) who found significant improvements in the model’s simulation
of the stratosphere.
A fundamental disadvantage of the HIGCM scheme is that a large proportion of the tro-
posphere is not modelled isentropically. No isentropic surface which intersects the ground can
be used as a model level, and yet surfaces in the vicinity of 300 K which touch the surface in
the tropics also graze the tropopause near the poles. An alternative was proposed by Bleck and
Benjamin (1993) in which an individual coordinate surface is able to be isentropic in one re-
gion of (x, y) space and to follow a constant σ surface in another. Hence, a coordinate surface
can follow an intersecting isentropic surface until it is near the ground, and then is ‘nudged’ to
follow a σ surface instead. In this way a greater portion of the domain is modelled using the
advantageous isentropic coordinate. This scheme performed well enough to be the basis for
the RUC model of Benjamin et al. (2004) which is currently used operationally by NCEP.
Konor and Arakawa (1997) achieved a similar effect with their generalised vertical coor-
dinate η = F (θ, σ). An expression for F was chosen that was a function of σ only near the
surface, or where ∂θ∂z. 0, but quickly and smoothly became only a function of θ away from
these regions. The Charney-Phillips vertical staggering was used to avoid problems caused by
the Lorenz staggering as encountered by the HIGCM.
There are therefore three hybrid isentropic coordinate formulations in use in the research
community at this time. Current developments in isentropic modelling, of which there are
Introduction 16
many, are based on these hybrid methods. The Wisconsin model has evolved into the hybrid
θ−η model described by Schaack et al. (2004), which no longer has a discrete interface between
σ and θ domains. This model is currently undergoing testing in both weather forecasting
and climate scenarios with standard NCEP and NCAR physics packages incorporated, and
forms the dynamical core of the NASA/Wisconsin Regional Air Quality Modelling System
(RAQMS, Pierce et al. (2003)). A project is also under way to couple the Wisconsin θ −η model with the OMEGA model of Bacon et al. (2000) to produce a model with a finite-
difference formulation on isentropic surfaces above about 336 K and an adaptive, unstructured
grid formulation below. Version 3 of the Colorado State University model is under construction
as the Coupled Colorado State Model or CCoSM. The atmospheric component will feature the
hybrid coordinate of Konor and Arakawa (1997), a new explicit time-stepping scheme stable
for Courant numbers above one (Arakawa and Konor (2004), personal communication) and a
new ‘embedded’ planetary boundary layer comprising, by definition, the lowest nmodel levels.
Finally, there is the RUC model (Benjamin et al. (2004) based on Bleck and Benjamin (1993))
which is, at this time, the only hybrid-isentropic model in the world to be used operationally
for numerical weather prediction.
1.2.5 Isopycnal Ocean Models
While presenting his balanced atmospheric model, Bleck (1973) argued that skeptics of the
isentropic coordinate ‘should literally attempt to change their perspective and look at this prob-
lem in terms of an arbitrary boundary in a regularly spaced (x, y, θ) mesh’. This viewpoint is
even more natural in the ocean context where all models have to include lateral boundaries.
Furthermore, the interior of the ocean is dominated by adiabatic processes to a greater extent
than the atmosphere, making a material coordinate surface model even more attractive. The
presence of variable salinity S in the ocean results in a nonlinear equation of state, which gives
density as a function of T , p and S. The potential density is given by replacing T and p with θ
and the reference pressure p0 in this function, and it is this which is conserved by a fluid parcel
under adiabatic motion in the ocean, in the same way as θ is in the atmosphere.
Models with potential density as a vertical coordinate are termed isopycnal models and
show much similarity to isentropic atmosphere models. Development in the two disciplines
has, in fact, been closely related. For example, the isopycnal ocean model of Bleck and Boudra
(1986) and Bleck et al. (1992) is based on the massless layer technique introduced in the at-
Introduction 17
mosphere model of Bleck (1984), and the hybrid atmosphere model of Bleck and Benjamin
(1993) is based on the scheme introduced in the ocean model of Bleck and Boudra (1981).
It is interesting that, due to the extremely adiabatic nature of motion in the ocean interior,
and the small scale of the important eddies relative to typical resolutions, much work has been
done to ensure that the diabatic motion is not exaggerated in ocean models with non-isopycnal
coordinates. In particular, the conventional Laplacian-based diffusion operators acting on non-
isopycnal model surfaces clearly mix fluid across isopycnal surfaces, constituting spurious
diabatic motion. Common solutions are to rotate the diffusion operator to act parallel to isopy-
cnal surfaces (e.g. Redi (1982)), to diffuse isopycnal level thickness (Gent and McWilliams
(1990)), or some combination of the two (Griffies (1998)). The majority of modern ocean
models employ one or other of these techniques. In contrast, however, there have been few
attempts to stray from the standard∇q diffusion employed in most atmospheric models.
1.2.6 Summary
• There are both advantages and disadvantages to using an isentropic vertical coordinate.
However, it should be noted that in general the advantages spring from the fact that the
isentropic coordinate framework is fundamentally a natural one in which to view the
motion of the atmosphere; the disadvantages, on the other hand, are largely technical.
• Much success has been achieved with hybrid models, with only the region closest to the
surface not being modelled isentropically in modern schemes. It has been argued that
motion here is not isentropic in any case. Some benefit, however, could still be obtained
from the adaptive nature of the isentropic coordinate. This, after all, is the region where
the strongest atmospheric fronts are to be found.
• Pure isentropic models have also reached a certain maturity, with only two outstanding
problems described in the literature:
1. Noise in the Montgomery potential throughout the domain.
2. Inability to simulate surface fronts.
Both are attributed to the lack of degrees of freedom available to represent the surface
potential temperature.
Introduction 18
• Considerable effort has been made in the ocean modelling community to ensure that
parameterizations of the sub-gridscale flow do not result in spurious diabatic motions
in non-isopycnal models. Little or no effort however has been made to ensure this in
non-isentropic atmosphere models.
1.3 PV as a Prognostic Variable in Models
QGPV has frequently been used as a prognostic variable in quasi-geostrophic numerical models
of the atmosphere (e.g. Sadourny and Basdevant (1985)) but its counterpart, the Ertel PV of
equation (1.3), has rarely been used in non-geostrophic models and examples are few and far
between.
One such example is in the field of contour dynamics or contour advection. The evolution
of PV distributions, including the cascade to small scales, has been well simulated by these
techniques (e.g. Waugh and Plumb (1994)), in which imaginary particles are placed along a
contour of constant PV. These particles are then advected by a wind field, either prescribed from
observations or GCM runs or determined from the PV distribution, and their final positions de-
fine the new location of the contour. Recently, Mohebalhojeh and Dritschel (2004) performed
a trial study of the use of the contour-advective semi-Lagrangian algorithm of Dritschel and
Ambaum (1997) in a three-dimensional primitive equation model, and achieved encouraging
results.
Numerical shallow-water models have been developed which explicitly predict PV, for ex-
ample Temperton and Staniforth (1987) and Thuburn (1997) (from which the model described
here is ultimately descended). A different approach is taken by Lin and Rood (1997); while PV
is not a predicted variable in their model, a ‘reverse engineering’ procedure is used to ensure
that the PV is simulated better. The prognostic equations in u, v and thickness h are discretized
in such a way that the mass and absolute vorticity are transported consistently. The correlation
between the two fields is better preserved during transport, and so the shallow-water PV (ζa/h)
is transported more accurately.
Another shallow-water example is the model of Bates et al. (1995), which has recently
been developed into a full primitive equation model by Li et al. (2000). A variable termed
pseudo-PV is defined, and then used as a prognostic variable. Results are shown only for a
σ vertical coordinate. Future development is planned for a version built on the hybrid σ − θ
coordinate of Konor and Arakawa (1997), and the pseudo-PV has been constructed so as to
Introduction 19
reduce to the real PV in the isentropic part of the domain. The model developed in this thesis
can be viewed as an attempt to perform in a pure isentropic model what Li et al. (2000) are
attempting in a hybrid model.
The PV evolution is known to be central to the behaviour of the dominant, balanced pro-
cesses acting in the atmosphere, so it is perhaps surprising that so few attempts have been made
to improve model accuracy by directly predicting the PV. Two potential difficulties are, at least
partly, to blame for this. Firstly, if PV is predicted then the divergence must also be predicted in
any unbalanced model. The divergence equation is complicated, containing several terms other
than pure advection, and so it is difficult, and non-intuitive, to formulate numerical schemes for
it. Secondly, the PV field has to be ‘inverted’ in some way to find the winds at every timestep.
This adds to the computational expense of a model, and introduces potential for instabilities.
Robertson (1984) adapted the model of Hoskins and Simmons (1975) to predict PV on σ co-
ordinate levels. The inversion step was found to be very sensitive to the static stability, and the
model became unstable when ∂∂σ
ln θ became small. Robertson recommended changing to an
isentropic vertical coordinate to overcome this. As described above, this has its own technical
difficulties, which is certainly a further reason why attempts to formulate models predicting
PV have been limited.
1.4 Outline of the Thesis
The overall aim is to construct a global numerical model of the atmosphere solving the primi-
tive equations with PV as a prognostic variable, and with θ as a vertical coordinate throughout
the entire domain, and to compare this in detail with a conventional model. Is this approach
useful, and are the errors in entropy and PV in the conventional model significant?
Model development will be described in the first two chapters. Chapter 2 presents results
of an idealised study to determine which vertical scheme, out of many potential contenders,
best represents the linearised normal mode solutions of the continuous equations. Both hy-
drostatic and non-hydrostatic cases are considered. This work is part of a larger study across
three different vertical coordinates, published as Thuburn and Woollings (2005). The hydro-
static scheme recommended by this study is already used in the model of Gregory (1999) from
which the global model used here is constructed. This construction is described in Chapter 3,
which also includes some simple tests of the model and descriptions of known problems.
The remainder of the thesis is given over to a detailed comparison of this model with
Introduction 20
a conventional normalised pressure coordinate model (the Intermediate General Circulation
Model or IGCM). Note that from this point onwards the term ‘normalised pressure’ is used
for the quantity p/ps to avoid confusion with the isentropic density σ. Chapter 4 compares
an idealised baroclinic wave life cycle simulation performed by each model. Chapter 5 then
focusses on the IGCM life cycle and its representation of the entropy. This is motivated by two
controversial hypotheses which attribute large features of both real and modelled atmospheric
states to the effects of small terms in the global entropy budget. The extent to which filaments
of PV and gradients of θ are dissipated at small scales, as described in Section 1.1, is critical
to these theories.
Chapter 2
Vertical Grid Staggering in Isentropic
Coordinates
2.1 Introduction
When constructing a numerical model of the atmosphere there is freedom in the choice of
predicted variables used, and also how the variable set is arranged in space. Often it is desirable
to stagger one variable with respect to another, so that the two variables are not stored at exactly
the same points but at interleaving levels. In order to find the optimal scheme one has to
consider various combinations of variables and all the possible ways of staggering them, since
the best grid for one set of variables will not necessarily be the best for a different set.
There are many criteria which have to be considered when deciding which is the optimal
configuration. For example it is important that the scheme respects the conservation properties
of the continuous equation set, and that it couples well with the physical parameterisations used
in the model. Here we attempt to judge schemes by their ability to simulate normal mode so-
lutions of the continuous equations, and in particular to accurately represent their frequencies.
It is important to represent these modes accurately; the acoustic modes for correct simulation
of adjustment to hydrostatic balance, the gravity modes for correct simulation of adjustment to
geostrophic balance and the Rossby modes because of their meteorological significance.
This approach has been used previously to assess grids. For example, Randall (1994)
tested horizontal grids on their ability to simulate the dispersion relation of inertia-gravity
waves in the shallow-water equations. Fox-Rabinovitz (1994, 1996) judged vertically stag-
gered grids according to their representation of dispersion relations and group velocities. This
Vertical Grid Staggering in Isentropic Coordinates 22
work, however, is based on equations in a log(pressure) coordinate system and restricts atten-
tion to one particular choice of variables. Finally, Konor and Arakawa (2000) used a similar
technique to judge vertical discretizations of the moisture continuity equation in an isentropic
coordinate model.
The work presented here parallels studies by John Thuburn on vertical grids in a height
coordinate and a mass-based terrain-following coordinate. Results for all three studies are
given in Thuburn and Woollings (2005). Here we examine the basic atmospheric equations in
isentropic coordinates, which are linearised so that they support simple normal mode solutions
that can be found analytically. Each of the various cases of variables and grid staggering is then
solved numerically for the mode frequencies and these are compared with the analytic values.
Here we consider various choices of thermodynamic variables and vertical grid staggering.
There is also freedom of choice in the wind velocities and the horizontal grid staggering, but
this is assumed to be a separate problem independent of the vertical scheme.
2.2 The Equation Set
The normal mode analysis was performed on the non-hydrostatic compressible Euler equations
in isentropic coordinates. The usual shallow atmosphere approximation is made, resulting in
the following equation set, stated here with the friction and heating terms for completeness:
Dv
Dt= −fk× v−∇M + F, (2.1)
Dw
Dt= − 1
σ
∂p
∂θ− g, (2.2)
∂σ
∂t+∇ · (σv) +
∂
∂θ(σθ) = 0, (2.3)
Dθ
Dt=Q
π. (2.4)
Here DXDt
= X = ∂X∂t
+ u∂X∂x
+ v ∂X∂y
+ θ ∂X∂θ
, v and∇ are the horizontal components of the
velocity and the gradient operator, F is friction, Q is heating and π(p) is the Exner function.
These equations were generated following the procedure laid out by Kasahara (1974), where
the basic atmospheric equations are converted into various vertical coordinate systems. The
system here differs from the isentropic example in Kasahara’s work in that here we do not
make the hydrostatic assumption, and so we maintain the vertical velocity w (= DzDt
) as a
Vertical Grid Staggering in Isentropic Coordinates 23
predicted variable. The small 2Ω cosφ Coriolis terms representing the horizontal component of
the Earth’s rotation Ω at latitude φ have been neglected, as usual, in order to retain conservation
of energy and angular momentum under the shallow atmosphere approximation. Initially we
will also neglect the latitudinal variation of the Coriolis parameter f = 2Ω sinφ, although
we include this later. The thermodynamic variables pressure p, density σ(= ρ ∂z∂θ
), height z,
Montgomery function M(= cpT + gz), total energy E(= cvT + gz) and the velocity variables
u and v are then small perturbations from this reference state, which is denoted by a subscript s
(e.g. ps, σs, zs, Ts). The equations are then linearised about an isothermal, hydrostatic reference
state, and now we neglect the diabatic heating and the effects of friction. With other subscripts
denoting partial derivatives, the simplest forms of the momentum equations are
∂u
∂t= fv − 1
σs(zsθpx − psθzx) = fv −Mx, (2.5)
∂v
∂t= −fu− 1
σs(zsθpy − psθzy) = −fu−My, (2.6)
∂w
∂t= − 1
σs(pθ + gσ) =
1
zsθ
(R
(psp0
)κp
ps−Mθ
), (2.7)
where p0 = 105 Nm−2. Note that two forms of the pressure gradient terms are given. These
will be referred to as the p-form and the M -form respectively. Each case studied has been
tested using both forms and that which gives the best results is shown here. It is anticipated
that if M is a predicted variable then the M -form will perform best. After all, one of the main
advantages of the isentropic coordinate is the fact that the horizontal pressure gradient force
can be expressed as the gradient of just one term, rather than two opposing ones. The five
possibilities considered here for the thermodynamic variables are predicted by the equations
∂p
∂t=
psκ− 1
(ux + vy +
wθzsθ
), (2.8)
∂σ
∂t= −σs(ux + vy), (2.9)
∂z
∂t= w, (2.10)
∂M
∂t=
R
κ− 1Ts
(ux + vy +
wθzsθ
)+ gw, (2.11)
∂E
∂t=
cvκ
κ− 1Ts
(ux + vy +
wθzsθ
)+ gw. (2.12)
Vertical Grid Staggering in Isentropic Coordinates 24
The full equation set is formed by adding two of these thermodynamic equations to the three
momentum equations above. Here we have considered the combinations
σ, z p, z p, σ p,M p,E σ,M z,M
so there are seven thermodynamic cases. Any thermodynamic variables needed in the tendency
equations but not explicitly stored are calculated from the others using the linearised diagnostic
equations:pps
= σσs
+ TTs− zθ
zsθ, T
Ts= κ p
ps,
M = cpT + gz, E = cvT + gz.
(2.13)
Note that the temperature perturbation T on an isentrope is directly proportional to the pressure
perturbation p. These two variables are therefore considered equivalent and only the pressure is
used as a prognostic variable in this work. Note also that there are other possible combinations
of these thermodynamic variables, namely the pairing ofE with σ , z andM . In general M and
E were found to be interchangeable with negligible effect on the results, so only oneE scheme
is given as an example. The rigid boundary condition w = 0 was used in these experiments.
The non-hydrostatic isentropic equations have never been studied in depth and it is inter-
esting that z, the height of the isentrope, appears as a thermodynamic variable either explicitly
or contained in the Montgomery function M or the total energy E. In order to eliminate z in
the (p, σ) thermodynamic case the height z had to be found at each level by integrating up over
the levels beneath:
z(θ) =
∫ θ
θB
zsθ
(σ
σs+ (κ− 1)
p
ps
)dθ, (2.14)
where θB is the bottom isentrope of the model. Here z = 0 has been applied on the boundary
as a consequence of the rigid boundary condition. Similarly for the (σ,M) case, z had to be
found by integratingzθzsθ− g(1− κ)
RTsz =
σ
σs− (1− κ)
RTsM, (2.15)
which is obtained by eliminating p from (2.13).
All the coefficients in the equations (2.5) to (2.12) are independent of the variables x, y
and t and so the system has wave-like solutions of the form exp i(kx + ly − ωt) and these
are the normal modes that will be examined. Assuming solutions of this form the equations
become:
−iωu = fv − ik
σs(zsθp− psθz) = fv − ikM, (2.16)
Vertical Grid Staggering in Isentropic Coordinates 25
−iωv = −fu− il
σs(zsθp− psθz) = −fu− ilM, (2.17)
−iωw = − 1
σs(pθ + gσ) =
1
zsθ
(R
(psp0
)κp
ps−Mθ
), (2.18)
−iωp =ps
κ− 1
(iku+ ilv +
wθzsθ
), (2.19)
−iωσ = −σs(iku+ ilv), (2.20)
−iωz = w, (2.21)
−iωM =cpκ
κ− 1Ts
(iku+ ilv +
wθzsθ
)+ gw, (2.22)
−iωE =cvκ
κ− 1Ts
(iku+ ilv +
wθzsθ
)+ gw. (2.23)
These equations assume a constant Coriolis parameter f so there is no β-effect in the
system, and therefore no Rossby restoring mechanism or time-dependent Rossby modes. These
modes, however, are of meteorological interest and so an attempt is now made to include a β-
effect in the equations in order to assess the ability of the various schemes to represent them.
If we simply replace the constant parameter f in equations (2.5) and (2.6) with the standard
β-plane approximation f = f0 + βy we no longer have constant coefficients, and so there
are no longer solutions of the form exp i(kx + ly − ωt). We begin by using this β-plane
approximation and taking the horizontal divergence and vertical component of the curl of the
horizontal equations (2.5) and (2.6) to give
∂δ
∂t= fξ − βu−∇2P, (2.24)
∂ξ
∂t= −fδ − βv, (2.25)
where δ = ux + vy, ξ = vx − uy and P is used to represent the pressure term 1σs
(zsθp− psθz)or M . With f and β now held constant, wave solutions proportional to exp i(kx+ ly−ωt) are
possible. Following White (2002), βu and βv need to be replaced by βudiv and βvrot (i.e. the
rotational and divergent components only) in order to retain energy conservation under this
approximation. The neglect of βurot and βvdiv are justifiable on the grounds of scale analysis.
For example, since β is of the same size as f/a (where a is the radius of the Earth), the βurot
term in (2.24) is much smaller than the fξ term because the length scale on which the velocity
varies is much smaller than the radius of the Earth. For solutions of this form equations (2.24)
Vertical Grid Staggering in Isentropic Coordinates 26
and (2.25) become
−iω(iku+ ilv) = f(ikv − ilu)− βudiv − (k2 + l2)P, (2.26)
−iω(ikv − ilu) = −f(iku+ ilv)− βvrot. (2.27)
These equations are then rearranged to give the u and v tendencies:
∂u
∂t= −iωu = fv +
ikβ
k2 + l2u− ikP, (2.28)
∂v
∂t= −iωv = −fu+
ikβ
k2 + l2v − ilP. (2.29)
Here the identities ikurot+ ilvrot = 0 and ikvdiv− iludiv = 0 have been used; i.e. the rotational
component is non-divergent and the divergent component is non-rotational. Equations (2.28)
and (2.29) are the same as (2.16) and (2.17) but with the two linear β terms added and we use
these as the prognostic equations in the horizontal. We have maintained the linear normal mode
structure but also included a β-effect. Rotating the wave-vector (k, l) effectively rescales the
magnitude of the β-effect. With l = 0 this is a maximum, so here attention is restricted to this
case only.
Although it is not the full β-plane approximation, this device modifies the equations in
the same way as the full approximation by adjusting the horizontal velocity tendencies, so is
equivalent for the purposes of judging vertical grid staggerings. With these β terms included
the equations can now be used to examine realistic normal mode behaviour for Rossby modes
as well as for acoustic and gravity modes.
For each case the complete equation set consists of five prognostic equations in five pre-
dicted variables: the horizontal momentum equations with the β-effect (2.28, 2.29), the vertical
momentum equation (2.18) and then the appropriate pair of thermodynamic equations (2.19-
2.23). The rigid boundary condition w = 0 is imposed on both upper and lower boundaries.
2.3 Analytical Solution
If we restrict ourselves to an isothermal reference state (i.e. Ts = constant) then the dispersion
relation can be found analytically. The method used here for judging discrete vertical schemes
is to compare the normal mode frequencies they predict with those given by this analytical
Vertical Grid Staggering in Isentropic Coordinates 27
solution. Since the mode frequencies are the same in isentropic coordinates as in height co-
ordinates, we can use the dispersion relation as derived in height coordinates in Thuburn and
Woollings (2005). An outline of this derivation is given here as this provides an insight into
the nature of the normal modes.
This derivation starts from the same basic equations but expressed in z coordinates with
predicted variables u, v, w, θ and p. In these coordinates the reference static stability is N 2s =
gθsz/θs, and with the assumption of an isothermal reference state both this quantity and the
reference sound speed squared, c2s = RTs/(1− κ), are constants. The variables u, v, w, and θ
are eliminated from the five equations to leave one equation in p:
(∂
∂z+ A
)(∂
∂z+B
)p+ Cp = 0, (2.30)
where the coefficients are
A = N2s
g, B = g
c2s,
C =
(1c2s
+(K2 + kβ
ω
) [(f − ilβ
K2
)2 −(ω + kβ
K2
)2]−1)
(ω2 −N2s ) .
(2.31)
The boundary condition w = 0 at z = 0 and z = D (the depth of the domain) becomes, after
elimination of the other variables,
(∂
∂z+B
)p = 0. (2.32)
We now look for solutions of the form p = p exp(− z2H
) where 1H
= (A+B) =(N2s
g+ g
c2s
).
Equations (2.30) and (2.32) become
(∂2
∂z2+(C − Γ2
))p = 0 (2.33)
with(∂
∂z+ Γ
)p = 0 on z = 0, D, (2.34)
where Γ = B−A2
= 12
(gc2s− N2
s
g
). These have solutions of the form
p = a exp(ikzz) + b exp(−ikzz), (2.35)
Vertical Grid Staggering in Isentropic Coordinates 28
where
k2z = C − Γ2. (2.36)
Applying the boundary conditions (2.34) imposes the following restrictions on the values of
kz:
a(ikz + Γ) + b(−ikz + Γ) = 0, (2.37)
a(ikz + Γ) exp(ikzD) + b(−ikz + Γ) exp(−ikzD) = 0, (2.38)
and this pair of equations in a and b can only be solved if
(Γ2 + k2z)(exp(ikzD)− exp(−ikzD)) = 0. (2.39)
This condition has to be satisfied by any solutions of the form considered here, and any value of
the frequency ω that satisfies this condition for some vertical wave number kz is the frequency
of a normal mode of the system with that vertical wave number.
There are two possibilities. Firstly, we could have (Γ2 + k2z) = 0, and by (2.36) this is
equivalent to C = 0, i.e.
(1
c2s
[(f − ilβ
K2
)2
−(ω +
kβ
K2
)2]
+
(K2 +
kβ
ω
))(ω2 −N2
s
)= 0. (2.40)
Solutions with ω2 = N2s correspond to p ≡ 0 and are not physically interesting solutions, but
the other roots given by
ω
[(f − ilβ
K2
)2
−(ω +
kβ
K2
)2]
+ c2sK
2
(ω +
kβ
K2
)= 0 (2.41)
are physically significant and this is the dispersion relation for the external modes. This is a
cubic equation in ω, the three roots corresponding to two acoustic modes and one Rossby mode.
In the dispersion plots which follow, the external modes are those with zero wavenumber.
Secondly, equation (2.39) is also satisfied if (exp(ikzD) − exp(−ikzD)) = 0, which in
turn is satisfied if kz = mπ/D for some positive integer m. Substituting this expression for kz
and the definition of C (2.31) into equation (2.36) gives a quintic equation for ω. The roots of
this quintic are the internal mode frequencies, and for every positive integer m there are five
solutions which correspond to two acoustic modes, two inertia-gravity modes and one Rossby
mode.
Vertical Grid Staggering in Isentropic Coordinates 29
The roots of these equations are calculated numerically using a standard iterative solver to
give the analytic values of the normal mode frequencies which the different discretizations are
tested against.
2.4 Numerical Solution
The values of the predicted variables are taken at either the full-levels of the model or at the
interleaving half-levels. For each case the equations are discretized by replacing any vertical
derivatives with a centred difference over the adjacent levels. For example, if the vertical
derivative of pressure pθ is needed on half-levels the difference is taken over the pressure values
from the full-levels immediately above and below the half-level, i.e. (pθ)k+1/2 =pk+1−pkθk+1−θk ,
where the full-levels k and k + 1 are separated by the half-level k + 1/2. A standard linear
interpolation is used to transfer variables between half-levels and full-levels when necessary, by
taking the average of its values on the adjacent levels (e.g. pk+1/2 = (pk +pk+1)/2). Intuitively
we would expect a good grid to be one which minimises the amount of averaging necessary, or
more specifically the averaging necessary in the largest and therefore most important terms in
the equations.
The boundary condition used is simply that the vertical velocity w is zero at both the top
and bottom boundaries of the model. In order to apply this condition it is necessary to store w
values at the boundaries, and for this reason both the boundaries are half-levels and w is always
stored on half-levels. The results shown here were generated on a grid of 21 half-levels with
20 interleaving full-levels. Each of the other four variables can either be stored on half-levels
(i.e. unstaggered with respect to w) or on full-levels (staggered with respect to w), so that
for every choice of thermodynamic variable pairing there are 16 possible choices of vertical
staggering. However, it is clear from the equations that any attempt to store the velocities
u and v on different levels will only add to the amount of averaging involved with no extra
advantage. With the additional constraint that u and v are stored on the same levels there are
eight staggering cases for each of the seven thermodynamic cases, making 56 different cases
to examine in total.
When the thermodynamic variables are p and σ it is necessary to evaluate the integral
given in equation (2.14) for the value of z. This is done by summing the value of the integrand
in the levels below, and a switch is built in to the program code to do this by summing either
half-level values or full-level values. Often it is necessary to add on a contribution from half
Vertical Grid Staggering in Isentropic Coordinates 30
a level either at the top or bottom of the sum, or at both. For example, summing full-level
values up to full-level i will give the value of the integral at the next half-level up, so if z is
required on full-levels an extra contribution will have to be added to get the value of the integral
at full-level i + 1, and this contribution represents the final half a level of the domain of the
integral. These contributions were approximated by taking half of the relevant complete level
contribution, following ideas in Simmons and Burridge (1981).
For the (σ,M) cases z has to be obtained from equation (2.15). This equation is discretized
at full level k with (zθ)k =zk+1/2−zk−1/2
θk+1/2−θk−1/2and zk =
zk−1/2+zk+1/2
2, and then re-arranged to give
zk+1/2 from zk−1/2. It can then be integrated up from z = 0 on the lower boundary to give z at
all half-levels.
A further switch is built in to the program to prescribe whether the model layers are equally
spaced in θ or in height. In the latter case the coordinate levels are still isentropic surfaces but
their θ values are chosen so that the levels are approximately evenly spaced in z. This configu-
ration did give better results, with most of the cases capturing a few more modes correctly than
with the layers evenly spaced in θ, and all the results shown here use this coordinate system. It
is perhaps to be expected that this system gives better results as the solutions we are trying to
capture have p proportional to exp(ikzz), i.e. there is a wave-like variation with z rather than
θ.
Once the equations are discretized they can be solved to give the normal mode structures
and frequencies. A state vector s is formed of all the predicted variables at all levels, and then
the full system of equations can be written as
−iωs = As, (2.42)
where the elements on the left-hand side are the variable tendencies (since ∂s∂t
= −iωs for
normal modes) and those on the right-hand side are the forcings. The elements of the matrix A
are simply the coefficients of the linear prognostic equations.
This is then an eigenvalue problem, where the eigenvectors sj are the state vectors of the
normal modes and the eigenvalues are −iωj , where ωj are the corresponding normal mode
frequencies. The problem is solved for both mode structures and frequencies using a standard
NAG library routine. The frequencies can then be compared with the analytic values and the
structure of any mode with an unexpected frequency can be examined.
The following parameter values were chosen: domain depth = 10 km, β = 1.619 ×
Vertical Grid Staggering in Isentropic Coordinates 31
10−11 s−1m−1 (corresponding to 45N), reference state temperature Ts = 250 K and horizontal
wavelength k = 1000 km.
2.5 Results
In general the accuracy of the vertical schemes was very varied. Many of the configurations
represented the dispersion relation extremely badly, with most of the modes either missing,
misrepresented or unstable. Others, however, gave very good results, with the acoustic and
gravity modes captured almost perfectly and just a slight deviation in the frequencies of the
Rossby modes.
As anticipated the schemes performed better using the M -form of the pressure gradient
terms if M was a prognostic variable. The results shown for all other cases used the p-form.
A numerical mode is classed as unstable if its frequency has a significant imaginary com-
ponent, specifically if |Im(ω)| ≥ 0.1 × |Re(ω)|. As the modal solutions are proportional to
exp(iωt) any significantly large imaginary part of ω will cause the solution to grow or decay
exponentially. Any configuration that has unstable modes cannot be recommended, simply be-
cause they are not physically realistic and are also likely to damage the conservation properties
of the model.
It is also unacceptable for a scheme not to capture one or more of the modes, as this shows
that it is unable to handle the equations accurately enough to represent these structures. As
an example, Figure 2.1 shows the dispersion plots for a staggered grid that has been unable to
correctly capture most of the gravity and Rossby modes, and those that are shown are mostly
unstable. The scheme shown in Figure 2.2 has performed slightly better but still has both
missing and unstable modes and therefore is not considered viable. Also note that for the
higher acoustic modes the variation of frequency with mode number has the wrong sign, which
will result in a simulated vertical group velocity in the wrong direction (e.g. Acheson (1990)).
Of the 56 configurations examined here there were only 12 with no missing or unstable
modes. These 12 cases are considered viable options for a vertical scheme, though there is still
a considerable hierarchy within this group. There is at least one viable configuration for all
choices of thermodynamic variable pair. These viable cases are laid out in Table 2.1 using the
notation (w..., ...) to indicate which variables are stored on half-levels with w and which are
staggered with respect to w.
Cases 42, 55, 43, 51 and 52 (e.g. Figures 2.3 and 2.4) simulate both the acoustic and grav-
Vertical Grid Staggering in Isentropic Coordinates 32
Figure 2.1: Frequency ( s−1) plotted against vertical mode number for normal modes of aresting, isothermal, non-hydrostatic atmosphere. The three external modes are the points withvertical mode number 0. The other five sets of modes are, in order of ascending frequency,the internal Rossby, gravity and acoustic modes. Diamonds indicate analytical values. Crossesand asterisks indicate numerical values for the grid and variable configuration shown in theschematic below the dispersion plots. Asterisks indicate that the numerical mode is unstable.This particular vertical scheme has represented the normal modes extremely badly.
Vertical Grid Staggering in Isentropic Coordinates 33
Figure 2.2: Dispersion curves, as in Figure 2.1, but for the configuration shown. Note thatthere are several modes missing from the numerical solution and that several of the Rossby andgravity modes that have been simulated are unstable.
Vertical Grid Staggering in Isentropic Coordinates 34
Category Case ID Grid
1 42 (wz, uvM)M
55 (w, uvσM)Ma
2 44 (wz, uvp)39 (wp, uvM)Mb
43 (wz, uvσ)p
3 51 (w, uvpz)52 (w, uvpσ)53 (w, uvpM)54 (w, uvpE)56 (w, uvzM)
5 23 (wuv, pz)p
26 (wuv, pE)
Table 2.1: The configurations of the 12 viable cases arranged in the categories defined in thetext. Superscripts M and p indicate that the grid only qualifies for this category if the M - orp-form of the equations is used, respectively. Superscript a indicates that this scheme qualifiesfor category 1 only if one degree of freedom in σ is suppressed. Similarly, b indicates that theboundary values of the thermodynamic variables must be suppressed.
ity modes extremely well and slightly overestimate the last half of the Rossby modes. The
remaining seven viable cases each represent almost all the modes very well, only underesti-
mating the higher Rossby modes. Examples of these are given in Figures 2.6 and 2.5.
There are no obvious criteria which guarantee accurate representation of the modes, though
schemes which minimise the averaging in the largest terms of the equations generally perform
best. As in the companion coordinate studies, it appears that the Rossby modes are not well
represented unless the gravity modes are, and that these are not well represented unless the
acoustic modes are. This reflects the greater complexity of physics involved in Rossby waves
compared to gravity waves, and gravity waves compared to acoustic waves.
The dispersion plots enable us to eliminate configurations which have missing or unstable
modes, but not to identify those which have computational modes. These are modes of the
discrete system which do not correspond to any mode of the continuous system, and so have
no physical meaning. Computational modes are often two-grid oscillations in one or more of
the variables of the form ηk = (−1)k∆η for the general variable η, where ∆η is a constant
in the simplest case, but not in general. The interpolated half-level values ηk+1/2 etc. then
all vanish, and so modes like this can be invisible to the rest of the system. In this way they
are allowed to exist by the averagings involved, but additionally they can only occur if there
are extra degrees of freedom in the discrete system. Every discrete variable (e.g. ui, vi, pi)
corresponds to one row of the state vector and therefore one degree of freedom. For every
Vertical Grid Staggering in Isentropic Coordinates 35
degree of freedom there is a solution to the matrix equation, so if there are more degrees of
freedom than there are resolvable modes of the continuous system it is possible that one of the
extra solutions is a computational mode. In order to find these modes it is necessary to compare
the number of eigenmodes with the number of degrees of freedom of the scheme, and then to
examine the structure of any modes which are not accounted for.
Modes such as these, although not directly interfering with the dynamics of a model, can
affect conservation properties and physical parameterisations. A well-known example of a
computational mode of this form is that which occurs when using the Lorenz grid in pressure
coordinates (see e.g.Arakawa and Moorthi (1988), who also discuss how the same extra degree
of freedom is indirectly responsible for the spurious amplification of short waves).
Some of the configurations contain computational modes which have zero frequency and
are just z, p and σ curves, resembling exponentials, with all other variables and tendencies
practically zero. This simply represents a time-independent shift in height of the isentropic
levels. For simplicity we have applied only the boundary condition w ≡ 0 on the boundaries,
whereas in operational use the additional condition that the height perturbation z is also zero
would be applied (as z must be constant to have w = 0). For each extra condition a degree
of freedom is removed and modes such as these are impossible. They are therefore ignored
in the results given here, although this can only be done in cases where z is stored on half-
levels so that this extra condition can be applied on the boundaries. Some configurations have
computational modes consisting of oscillations in z, p and σ which also have z 6= 0 on the
boundaries, and these are similarly ignored.
All of the other computational modes consist of a two-grid oscillation in two or more
of the variables. As an example of these modes consider case 23 (shown in Figure 2.6). On
examining the dispersion plots this configuration appears to give good results, with only the last
three Rossby mode frequencies significantly underestimated. However, this configuration has
three computational modes, and therefore cannot be recommended. One of the computational
modes is a two-grid oscillation in u, v, p and z which cancels in the discrete equations so that
all the tendencies are zero. The frequency of this mode is also zero to within numerical error
and so it does not appear on the dispersion plot. The other two computational modes are two-
grid oscillations in u, v and their tendencies, with all other variables and tendencies effectively
vanishing. They agree closely in both frequency and mode number with the mode number 18
gravity waves and so overlay these modes and appear as single points on the dispersion plots.
In the table the schemes are arranged in categories as in Thuburn and Woollings (2005)
Vertical Grid Staggering in Isentropic Coordinates 36
which are outlined here:
• Category 1: Optimal. No computational modes and distinctly better dispersion rela-
tions than all other schemes.
• Category 2: Near-optimal. No computational modes and still reasonably good disper-
sion relations.
• Category 3: Single zero-frequency computational mode. Generally a two-grid oscil-
lation in one or both of the thermodynamic variables.
• Category 4: Inertial frequency decoupled modes. The only isentropic grid in this
category is (wuvz,M)M and since this does not qualify as a viable scheme this category
is not described here.
• Category 5: The rest.
Out of all the viable cases there are only five which can qualify for categories 1 and 2
(although they do all have one or two of the easily removable z modes described above). Ex-
amples are shown in Figures 2.3 and 2.5. These therefore are the configurations that can be
recommended as best representing the dispersion relations of the normal modes. Cases 42 and
55 clearly have the best numerical dispersion relations. These are practically identical so only
one is shown here (case 55 in Figure 2.4).
The recommendation of two out of these five grids is given under certain conditions. Case
55 supports a computational mode in which all prognostic variables are zero except for the
value of σ nearest the upper boundary. This is the boundary at which z is not defined as zero in
the integration of equation (2.15) and the computational mode corresponds to a non-zero value
of z there. This σ value has no effect on any of the other prognostic variables and is a redundant
degree of freedom. The computational mode is then trivially removed by dropping this degree
of freedom. Similarly, case 39 supports two trivial computational modes since the p values
stored on the two boundaries are never used in the equations; dropping these two degrees of
freedom removes the computational modes.
As mentioned in Section 2.4, when the variable pair is p and σ there is an option as to
whether the integral in equation 2.14 is evaluated by summing half-level values or full level
values. Of the two terms in the integrand it is the σ term which is dominant and the results
show that, in general, if σ is stored on half-levels it is best to sum over half-levels, and if it
Vertical Grid Staggering in Isentropic Coordinates 37
Figure 2.3: Dispersion curves, as in Figure 2.1, but for the configuration shown. This is one ofthe five configurations recommended as best representing the normal mode frequencies. Thereare no unstable, missing or computational modes.
Vertical Grid Staggering in Isentropic Coordinates 38
Figure 2.4: Dispersion curves, as in Figure 2.1, but for the configuration shown. This scheme ispotentially the best of all the configurations recommended here if a trivial computational modeis removed. Case 42 has an identical dispersion relation; these two schemes best represent thenormal mode frequencies out of all 56 schemes tested.
Vertical Grid Staggering in Isentropic Coordinates 39
Figure 2.5: Dispersion curves, as in Figure 2.1, but for the configuration shown. This is anotherof the configurations recommended as best representing the normal mode frequencies. Thepattern of staggering in this case is identical to that in Figure 2.3, the only difference is that thiscase has p as a thermodynamic variable while the other has σ.
Vertical Grid Staggering in Isentropic Coordinates 40
Figure 2.6: Dispersion curves, as in Figure 2.1, but for the configuration shown. From theseplots this would appear to be a good scheme, as only the last three Rossby modes are signifi-cantly wrong. However, this case has three computational modes which are invisible here andso this configuration cannot be recommended.
Vertical Grid Staggering in Isentropic Coordinates 41
is stored on full-levels to sum over these. The only exception to this rule is the case with p
and σ on full-levels and u on half-levels. In this case it is best to sum over the half-levels
as the discrete integral then consists only of complete layer terms of thickness ∆θ, and it is
not necessary to add any approximated half-layer terms of thickness ∆θ/2 at either end of the
domain.
In fact, case 52 is the only viable configuration with p and σ as the thermodynamic vari-
ables. In this configuration σ is stored on full-levels and so the integral is evaluated by sum-
ming the full-level values. There is one computational mode, with p, σ, and therefore also the
implied z, resembling exponentials. On the upper boundary θT we have, by equation (2.14),
zt(θT ) =
∫ θT
θB
zsθ
(σtσs
+ (κ− 1)ptps
)dθ
=
∫ θT
θB
wθdθ using equations 2.8 and 2.9
= w(θT )− w(θB) = 0, (2.43)
i.e. z is constant on the upper boundary. This is satisfied for the computational mode because
its frequency is zero, so there is no time dependence. This mode, however, has a constant value
of z which is not zero, due to the exponential vertical structure. It could, then, theoretically be
removed by imposing boundary conditions such that z = 0. In practise this may be non-trivial
so the scheme is placed in category 3.
All the results shown here are for modes with a horizontal wavelength of 1000 km in a
10 km deep atmosphere. To demonstrate that our conclusions are robust and can be used in a
range of applications, the same test was run with these parameter values altered. In order to
demonstrate the application to features of different horizontal scales the wavelength was set at
100 km and at 4000 km, and for stratospheric applications the vertical domain was extended
to 50 km and then to 80 km. In all these cases the results proved to be robust, in that the five
recommended configurations in categories 1 and 2 remained the best and no other configura-
tions became viable. The dispersion plots for all cases were simply stretched or compressed
versions of their originals.
Vertical Grid Staggering in Isentropic Coordinates 42
2.6 A Hydrostatic System
In the sections above, the non-hydrostatic equations in isentropic coordinates were exam-
ined for direct comparison with the studies in other coordinate systems. However, the non-
hydrostatic isentropic equations are currently not widely used as most applications assume
hydrostatic balance. For the linearised system in isentropic coordinates this is simply the as-
sumption that the w tendency ∂w∂t
can be neglected in the vertical momentum equation (2.7).
This results in a smaller system with only three predicted variables, since the vertical momen-
tum equation becomes a diagnostic linking p and σ. To compare vertical grid staggerings for
hydrostatic models the experiment was repeated using the following equation set in the three
variables u, v and σ:
∂u
∂t=∂M
∂x, (2.44)
∂v
∂t=∂M
∂y, (2.45)
∂σ
∂t= −σs
(∂u
∂x+∂v
∂y
), (2.46)
whereM(= cpT+gz) is the Montgomery potential as before. The diagnostic equation obtained
from the vertical momentum equation is
σ∂ps∂θ
= σs∂p
∂θ(2.47)
so that, since it is the vertical derivative of p that is equated to σ, a grid with p on half levels,
for example, is equivalent to a similar grid but with σ on full levels.
Two different vertical schemes were examined, one with the horizontal velocities u and v
staggered with respect to the density σ, and one without. The unstaggered grid is equivalent
to a grid predicting the pressure p staggered with respect to the horizontal velocities u and v.
This scheme is used in the models of UCLA (Konor and Arakawa (1997), Hsu and Arakawa
(1990)), Colorado State University (Randall et al. (2000)), The University of Wisconsin (Pierce
et al. (1991), Uccellini et al. (1979)) and in the stratosphere model of Gregory (1999), and is of-
ten referred to as the Charney-Phillips grid in isentropic coordinates. This grid has recognised
advantages when using pressure coordinates. In both Fox-Rabinovitz (1996) and the compan-
ion study in height coordinates (Thuburn and Woollings (2005)), the Charney-Phillips θ-p grid
Vertical Grid Staggering in Isentropic Coordinates 43
is found to have the best simulated dispersion relation, and the grid is also recommended by
Arakawa and Moorthi (1988) for its lack of computational modes. At least one of the hybrid
coordinate models above (Konor and Arakawa (1997)) uses this grid for its advantages in the
region of normalised pressure coordinates (p/ps) near the ground.
An alternative, used by Zhu et al. (1992) following Simmons and Burridge (1981), is the
Lorenz grid in isentropic coordinates. This model predicts u, v and temperature T on the same
levels. Due to the equivalence of T and p mentioned in Section 2.2 this can be compared with
grids predicting p on the same levels as the velocities, i.e. with σ staggered from them. This
therefore corresponds to the staggered grid in our hydrostatic test.
In the original experiment a rigid boundary condition (i.e. w = 0) was applied at both
the top and bottom boundaries. This is harder to apply in the current system where w does not
appear in the prognostic equations. In order to rewrite this boundary condition as a condition on
σ and p, the non-hydrostatic equations for σ and p (equations (2.19) and (2.20)) were combined
to eliminate the horizontal velocities, and then this equation was integrated over the entire
domain. The boundary conditions w = 0 at the limits of integration were then applied, and the
result is an integral constraint on the values of p and σ throughout the entire domain:
(1− κ)
∫
D
∂zs∂θ
p
psdθ =
∫
D
∂zs∂θ
σ
σsdθ. (2.48)
The procedure used in this study to evaluate the tendencies ∂u∂t, ∂v∂t
and ∂σ∂t
is as follows.
Define a relative pressure p′ by p = pT + p′, where pT is the pressure at the top of the domain.
At first pT is set to zero. Then the relative pressure p′ at all other levels is calculated from the
σ values by integrating equation (2.47) down from the top of the model. In both of the vertical
schemes σ is stored on full levels, so p is then naturally obtained on half-levels. Equation (2.48)
becomes an expression for pT :
(1− κ)
∫
D
1
ps
∂zs∂θ
dθ pT =
∫
D
∂zs∂θ
(σ
σs− p′
ps
)dθ (2.49)
in which both integrals can be evaluated. This gives pT , and the full pressures are given by
p = pT + p′.
In order to calculate the Montgomery potential M from the pressure we use the diagnostic
relation∂M
∂θ= Π = cp
(p
p0
)κ(2.50)
Vertical Grid Staggering in Isentropic Coordinates 44
which, after linearising about the reference state, becomes
∂M
∂θ= κ
∂Ms
∂θ
p
ps. (2.51)
M is defined on the bottom boundary (denoted by a subscript B) by MB = RTspBps
and then
can be found at all levels by integrating up following equation (2.51). We have p stored on
half-levels and so we evaluate this integral by summing these half-level values, naturally giv-
ing M on full-levels. Up to this point the two schemes tested are identical. For the unstaggered
(Charney-Phillips) grid the calculation is now complete as M is needed on full-levels to evalu-
ate the horizontal tendencies. In the staggered (Lorenz) grid however the horizontal velocities
are stored on half-levels so the M values need to be interpolated from the full-levels first.
The Lorenz grid appears, therefore, to have two extra sources of error that do not arise
when using the Charney-Phillips grid: the M field has to be interpolated in order to evaluate
the horizontal equations, and the u and v fields will have to be interpolated to evaluate the
σ equation. We would therefore expect the Charney-Phillips grid to perform better than the
Lorenz grid in our experiment.
Dispersion plots for the two schemes are shown in Figures 2.7 and 2.8. The Charney-
Phillips grid (Figure 2.8) does indeed simulate the normal mode frequencies better than the
Lorenz grid (Figure 2.7), which significantly underestimates the frequency of the higher Rossby
modes. Both of the schemes are viable, however, in that all the modes are captured and are
stable. (Note that as there are only three degrees of freedom in the continuous equation set
there are only three sets of modes; the eastward and westward gravity modes and the Rossby
mode.)
Furthermore, as the Lorenz grid has the velocities u and v on half-levels there are two
extra degrees of freedom in the model (there are 62 in the discrete equation set but only 60
resolvable modes). These permit the existence of two computational modes, both of which are
two-grid waves in u and v. On the dispersion plot these overlay the gravity waves with mode
numbers 14 eastward and 18 westward. The Charney-Phillips grid, in contrast, introduces no
extra degrees of freedom into the system, and there are no computational modes. On these
grounds we conclude that, for a hydrostatic system, the Charney-Phillips grid is preferable to
the Lorenz grid for the simulation of vertical normal modes in isentropic coordinates.
Vertical Grid Staggering in Isentropic Coordinates 45
Figure 2.7: Dispersion curves, as in Figure 2.1, but for the configuration shown, which isreferred to here as the Lorenz grid for a hydrostatic system in isentropic coordinates. There aretwo computational modes whose frequencies appear exactly overlaying the eastward gravitymode number 14 and westward gravity mode number 18.
Vertical Grid Staggering in Isentropic Coordinates 46
Figure 2.8: Dispersion curves, as in Figure 2.1, but for the configuration shown, which isreferred to here as the Charney-Phillips grid for a hydrostatic system in isentropic coordinates.There are no computational modes and all the frequencies are simulated well, so this grid isrecommended over the Lorenz grid.
Vertical Grid Staggering in Isentropic Coordinates 47
2.7 Discussion
In this study we have identified two optimal schemes for the accurate simulation of the disper-
sion properties of the continuous non-hydrostatic system, though one of these (case 55) comes
with a proviso. Further to this, the three schemes in category 2 can be recommended for hav-
ing relatively good dispersion properties and supporting no damaging computational modes.
The remaining seven schemes listed in the table all support one or more potentially damaging
computational modes.
All five of the category 1 and 2 configurations represent the acoustic and gravity modes
extremely well. The only deviations from the analytic values are in the higher Rossby modes.
It could be argued that, if a background horizontal wind is added to the system, the observed
frequencies would be dominated by the Doppler shift, and so these errors would not be sig-
nificant. However, it could also be argued that the vertical group velocity of these higher
Rossby modes would be misrepresented, and this could affect the simulation of features such
as the Quasi-Biennial Oscillation, which is thought to be partly driven by equatorially trapped
Rossby-gravity modes. This issue deserves further consideration. Cases 42 and 55 (Figure 2.4)
represent all the modes very well, even these higher Rossby modes, and are therefore recom-
mended as the optimal non-hydrostatic schemes for normal mode representation in isentropic
coordinates. The mode frequencies predicted by these schemes are as accurate as those pre-
dicted by the best configuration in other coordinate systems tested in Thuburn and Woollings
(2005).
On examination of the underlying equations it is apparent that the category 1 and 2 schemes
are indeed sensible choices of grid staggering for the variables given. When z is one of the ther-
modynamic variables it seems sensible to store it on the same levels as w because of the close
relationship between the two. Both the viable (p, z) and (σ, z) grids have this layout. When
either p or σ are variables they appear staggered with respect to w, which is convenient because
often their derivatives appear in the tendency equation for w, or vice-versa. Finally p and σ
appear on the same levels as the horizontal velocities, and in the tendency equations for u and
v it is always p and σ that are required rather than their vertical derivatives.
The non-hydrostatic equations studied here are interesting in that they naturally predict z as
a prognostic variable. Several ways were found to eliminate z from the equations. With p and σ
as prognostic variables the height z is calculated at every timestep from the diagnostic integral
in equation (2.14), and one of the resulting schemes qualifies for category 3 as described in
Vertical Grid Staggering in Isentropic Coordinates 48
section 2.5. Another method is to combine z with the temperature T in either the Montgomery
potential M or the total energy E, but none of the resulting schemes are recommended. In
both cases very promising dispersion plots (similar to that in Figure 2.5) are obtained if all
variables are staggered with respect to w. However, these configurations each contain one
computational mode of zero frequency consisting of two-grid waves in the thermodynamic
variables. If these computational modes could be removed somehow then the two schemes
would be valid options.
Elimination of z using equation (2.15) gives the optimal scheme (w, uvσM)M (case 55)
with its proviso that the redundant degree of freedom in σ be removed. This would be easy
in the linear normal mode calculations presented here but might not be in a fully nonlinear
model. Furthermore, the need to solve the nonlinear, nonlocal equation (2.15) at each timestep
would add to the cost of such a scheme. In spite of these potential drawbacks this scheme holds
significant promise, as follows.
It should be stressed that the ability of a vertical scheme to simulate the dispersion relation
is only one of several criteria which should be considered when choosing such a scheme. Other
features are desirable, such as the ability to model advection, to support any physical parame-
terisations and to satisfy conservation laws. One potential benefit of isentropic coordinates is
that, in the absence of diabatic processes, air parcels move along coordinate surfaces and so it
is easier to achieve certain conservation properties in models, such as conservation of mass and
of potential vorticity. The potential offered by case 55 is that it might, in addition, enable other
conservation properties to be satisfied easily, since its mass variable σ is stored at the same lo-
cations as its wind velocities. This feature makes it significantly easier to conserve properties
such as energy and angular momentum which involve a product of density and velocity. When
using height or pressure coordinates it has been difficult to find a scheme which gives both
good dispersion properties and good conservation properties; this scheme has the potential to
do both.
Of the two hydrostatic schemes assessed in Section 2.6 the Charney-Phillips grid (Fig-
ure 2.8) clearly simulates the normal modes best, a reassuring result as this scheme is widely
used in existing isentropic models. Again, the mass and wind variables are stored at the same
levels, with the potential to easily satisfy conservation laws.
Vertical Grid Staggering in Isentropic Coordinates 49
2.8 Summary
In this chapter a wide range of vertical grid schemes in isentropic coordinates have been as-
sessed on their ability to represent the normal modes of the linearised system, in particular the
dispersion relation. The principal findings are outlined below.
• Some general guidelines for constructing vertical grids have emerged. In general, schemes
which minimise the averaging required in the dominant terms of the equations will per-
form well. The number of degrees of freedom in the discretized system is an accurate
guide to the existence of computational modes.
• The non-hydrostatic equations in isentropic coordinates are most readily formulated with
the height z as a prognostic thermodynamic variable. All successful schemes without z
as a prognostic variable have to derive it using a non-local diagnostic equation.
• For the non-hydrostatic system the scheme (wz, uvM)M is optimal for representation of
the mode frequencies and supports no computational modes. The scheme (w, uvσM)Ma
is also optimal under a minor proviso, and has the potential to offer both good dispersion
and conservation properties. The isentropic coordinate is the only one in which such a
scheme has been found. Both of these schemes have dispersion properties as good as
any scheme examined in other coordinate systems.
• Three further non-hydrostatic schemes are identified which support no computational
modes and have dispersion relations only slightly worse than the two optimal schemes.
• For the hydrostatic system the Charney-Phillips grid (−, uvσ) has the best dispersion
properties and also supports no computational modes. This scheme is used in the isen-
tropic model developed in the following chapter.
Chapter 3
The Isentropic Model
3.1 Introduction
The isentropic model used here is an extension to the ground of the stratosphere model of
Gregory (1999), itself a multi-level extension of the shallow-water model of Thuburn (1997).
The model solves the hydrostatic primitive equations on a full global domain and, for the sim-
ulations presented in this thesis, is adiabatic so that DθDt
= 0. The vertical levels are labelled
with the generalised vertical coordinate η which is identically equal to the potential tempera-
ture θ = T (p/p0)−κ at all points above the ground. Each level retains its η coordinate label
after it has intersected the ground, though this should no longer be thought of as the potential
temperature.
The three prognostic variables are the potential vorticity Q, the divergence δ and the isen-
tropic density σ, defined by
Q = f+ξσ, σ dθ = ρ dz, δ = ux + vy (3.1)
where f is the Coriolis parameter 2Ω sinφ, ξ is the relative vorticity vx − uy and ρ is the
density in height (z) coordinates. In this framework the primitive equations are particularly
simple, with PV and σ evolving purely by advection:
σt = −∇.(σv), (3.2)
(σQ)t = −∇. [σQv + k×X] , (3.3)
The Isentropic Model 51
δt = −∇.[σQk× v +∇
(M +
1
2v2
)−X
], (3.4)
where v is the horizontal velocity, k is the unit vertical vector, M = cpT + gz is the Mont-
gomery potential and X represents Rayleigh friction at upper levels. The friction is used to
reduce reflection of gravity waves from the upper boundary; it is optional and is not needed
with the relatively low model top used in the simulations in this thesis.
By formulating the model with (3.3) as a prognostic equation it is hoped that the La-
grangian conservation of PV will be improved. Note, however, that it is σQ rather than Q
which is predicted by this equation, so that the model does not attempt to formally conserve
the PV of an air parcel. The correlation between σQ and Q can change during advection, be-
cause in each case the advective flux is dependent on an upstream interpolation of the respective
field (see Section 3.4). In fact, the PV field is smoothed because of this upstream interpolation,
but it should be remembered that, in any case, perfect Lagrangian conservation is not possible
in any discrete Eulerian model.
Finite difference versions of these equations are solved on a hexagonal-icosahedral hori-
zontal grid, an example of which is shown in Figure 3.1. The grid is nested so that a dense grid
is formed by sub-dividing the elements of a sparse grid, and then projecting the new elements
onto the sphere. For example, a regular hexagon is comprised of six equilateral triangles. Each
of these triangles is sub-divided into four by connecting the central points of its edges, and
new hexagons, each a quarter of the area of the original, can be formed. The first grid in the
hierarchy is the icosahedron; sub-dividing this results in a grid with a pentagon at the centre
of each of the original pentagons, surrounded by five hexagons. Subsequent grids all have four
times as many hexagons as the previous, but still exactly 12 pentagons.
As a result of the projection onto the sphere, the grid cells are not regular hexagons or
pentagons. This introduces error in the horizontal differential operators when the straight line
between the centres of two adjacent cells is not perpendicular to the edge between them. To
overcome this the grid is ‘tweaked’ using the method of Heikes and Randall (1995) to ensure
that all operators converge uniformly to their continuous counterparts in the limit of infinite
resolution. The model can still suffer from small amplitude noise because truncation errors
invariably reflect the structure of the grid. This is most noticeable at the pentagons, whose
surface area is slightly smaller than any of the hexagons. This gives a wavenumber 5 shape to
the noise due to the pentagonal shape of the original icosahedron. Recently an optimisation has
been proposed by Tomita et al. (2002) to ensure a more homogeneous grid; this optimisation
The Isentropic Model 52
has not been implemented here. The magnitude of the noise is small, and the only reason for
concern is the possible triggering of spurious Rossby and gravity waves of wavenumber 5 (see
Appendix A).
In the horizontal, the three prognostic variables are stored at box centres and the winds
at box edges in components normal and tangential to the edge. This is equivalent to the Z
grid recommended by Randall (1994) for simulating the dispersion relation in a geostrophic
adjustment problem better than any of the staggered A-E grids of Arakawa and Lamb (1977).
The vertical grid is of the Charney-Phillips type recommended in Chapter 2, with just the
pressure staggered with respect to the prognostic variables.
At every timestep the wind fields need to be obtained from Q and δ. The streamfunction
ψ and velocity potential φ are found by solving the Poisson equations:
∇2ψ = σQ− f, ∇2φ = δ. (3.5)
These are solved efficiently by a multigrid method. This first solves the equations on the
coarsest grid in the hierarchy, and then uses this solution as a first guess for the solution on the
second grid, and so on. The rotational and divergent wind fields are given by:
v = vrot + vdiv = k×∇ψ +∇φ. (3.6)
The divergence field is stepped using a predictor-corrector method whereby a dummy diver-
gence step gives a new vdiv to use in the final divergence update (for more details see Gregory
(1999)). A ∇4 hyper-diffusion is applied to the divergence field with a damping timescale of
12 hours on grid scale features. No explicit diffusion is applied to either PV or density as these
are updated with a slightly diffusive advection scheme.
3.2 Boundary Formulation
The greatest problem in isentropic modelling is in handling the intersection of model levels
with the ground. One technique for dealing with this is to extend an intersecting level along
the ground as a so-called ‘massless layer’. This approach has proved successful, for example
in the model of Hsu and Arakawa (1990) (see Chapter 1). The isentropic density is given by
(3.1) and the hydrostatic law as σ = − 1g∂p∂θ
. In an isentropic model this represents both the
The Isentropic Model 53
Figure 3.1: The fourth buckyball grid in the hierarchy. The simulations presented here wereproduced on the fifth grid, which has roughly four times the number of cells.
mass contained in a grid cell and the static stability. In a massless layer there is, by definition,
no mass, and yet the infinite PV values implied by this present difficulties in a discrete model
predicting PV.
To avoid such difficulties the model developed here uses two different density variables,
which are identically equal in all cells which are entirely above the ground, but behave differ-
ently when a level intersects the surface. The aim is to separate the two quantities represented
by the isentropic density; one of the densities ensures mass is treated correctly at the surface,
while the other gives a finite value to the static stability in a massless layer so that Q is well
defined. These densities can be defined using the generalised vertical coordinate η by
σ = −1
g
∂p
∂η, σ = −1
g
∂p
∂θ, (3.7)
where the derivative in the definition of σ is assumed to exist in the massless region where
δp, δθ → 0.
The first of the two, denoted σ , is used to keep track of the mass in the model. The value
of σ goes to zero when a model level hits the ground, as in the massless layer approach, since
δp vanishes but δη does not. The mass in a cell of area A and depth δη is therefore given by
σAδη.
The second density, denoted σ, is used to relate the PV to the absolute vorticity ζ (= f + ξ)
in (3.1). In order that Q is finite for non-zero ζ , σ has a non-zero value after a level has
intersected the surface. This is intended to represent the limit of the static stability as both δp
and δθ tend to zero, and is in line with the ideas of Eliassen and Raustein (1968) who extended
their model levels underground by choosing values such that the static stability just below
The Isentropic Model 54
U U
Figure 3.2: The application of Kelvin’s Theorem to the massless region. The circulation arounda closed region U on a surface is given by the area integral over the region of the component ofvorticity normal to the surface, and is independent of time for certain flows. On the right, theregion U is the massless region of an isentropic level that intersects the ground in the tropics;vorticity values in this region define the circulation along a θ contour on the Earth’s surface.See also Hoskins (1991).
the surface was equal to the static stability just above it. These σ values in the ‘massless’ or
‘boundary’ region are used practically for several purposes:
1. To aid in accurate advection of mass (σ ) at the ground (see Section 3.4.3).
2. To evaluate horizontal finite differences near the ground, for example the vorticity term
−∇. (σQk× v) in the divergence equation (3.4) and the horizontal advection of Q in
the PV equation (3.3).
3. To define sensible PV values in the massless region which then yield the correct surface
flow under solution of the Poisson equation (3.5). This can be thought of as an applica-
tion of Kelvin’s Circulation Theorem (see e.g. Acheson (1990)), whereby defining the
vorticity at every point in a massless region U of a given model level defines the circula-
tion around the boundary ∂U , in this case the contour where the level meets the ground.
See Figure 3.2.
For an arbitrary flow, initial σ values in the massless region are created by extrapolating
horizontally along the intersecting level from the above ground values. Both densities then
evolve according to the mass continuity equation (3.2), with the distribution of σ defining the
location of the ground relative to the isentropic levels.
The representation of the ground is critical in the calculation of the Montgomery poten-
tial M , the Laplacian of which gives the horizontal pressure gradient force in the divergence
The Isentropic Model 55
i
^
ss iΜ := Φ + π η
ii−1M i−1/2
i+1/2
^
η = θ
i+1M
η
sΦ = Φ
i+1η
i−1/2η
ηi
i−1η
i+1/2
i
σ = σ
σ = 0^
+ π ∆ η= M
= M − π ∆ η
pi−1/2
i+1/2
i+1/2p
i−1/2
iσ
Figure 3.3: A schematic of the vertical scheme in the model showing the intersection with theground. The equation on the right definingMi applied at the lowest massy cell of each column;the arrow indicates which cell this is for the furthest right column shown.
equation (3.4). This is given by integrating the hydrostatic equation
∂M
∂θ= π(p) := cp
(p
p0
)κ(3.8)
from the surface where
Ms = Φs + πsθs. (3.9)
Here κ = R/cp, Φ = gz is the geopotential, a subscript s represents a value at the Earth’s
surface and π is termed the Exner function. The procedure for calculating M from the mass
field is now described and is sketched in Figure 3.3.
The pressure p is calculated on half-levels by integrating ∂p∂η
= −gσ down from the model
top where p = 0; similarly, p is calculated by integrating ∂p∂η
= −gσ. The surface pressure ps
is given by p on the lowest model half-level, and this in turn gives πs. In any given column the
lowest massy cell is defined as the lowest cell containing a non-zero σ. Labelling this by its
level i (counting down from the top of the domain), the Montgomery potential in this cell Mi
is defined using a finite difference approximation to the hydrostatic equation (3.8):
Mi = Ms + πs(ηi − θs)
= Φs + πsηi with substitution from (3.9). (3.10)
This definition of Mi is identical to that used in the models of Hsu and Arakawa (1990) and
Randall et al. (2000), and has the advantageous property that θs is not needed to calculate
Mi. From here M is calculated at all levels j in the column above using the finite difference
The Isentropic Model 56
0 10 20 30 40 50 60 70 80 90
0
1
2
3
4
5
6
7
8
9
10
x 104
Latitude
Pre
ssur
e (P
a)
Location of model levels
Figure 3.4: Isentropic level locations on a pressure axis for the initial conditions of the baro-clinic wave life cycle experiments of Chapter 4.
approximation to the hydrostatic equation (3.8),
Mj−1 −Mj
ηj−1 − ηj= πj− 1
2, (3.11)
and at all levels j in the column below using the pressure derived from the second density σ:
Mj−1 −Mj
ηj−1 − ηj= πj− 1
2. (3.12)
The vertical levels have been chosen to ensure, as far as possible, a similar vertical resolu-
tion to the 15 level version of the Reading IGCM used in later comparisons. There are 21 full
levels of which typically 19 are fully above ground at the poles and 11 at the equator. The level
values were calculated by demanding an exponential change in spacing ∆η with roughly a 6 K
spacing for most of the troposphere and a top half-level height of 800 K, giving half-levels
every 6 K from 244 K through to 334 K and then at 341 K, 350 K, 366 K, 402 K, 501 K and
800 K. Typical locations of the levels in pressure coordinates are shown in Figure 3.4.
In summary, when an isentropic level intersects the ground it continues along the surface
as a massless layer, with σ = 0. A second density variable σ represents the limit of the static
stability as the ground is approached, and does not vanish. In the massless region σ , Q and δ
The Isentropic Model 57
evolve according to (3.2–3.4), and σ according to
σt = −∇.(σv). (3.13)
3.3 Controlling the Massless Region
The vertical scheme has been designed to avoid contamination of the real atmospheric flow by
the artificial massless region. As shown in Figure 3.3, information only propagates downward
below ground so that the massless values of p and M in a column do not affect the above
ground region of the column. This, however, has an unwanted side-effect since the lack of
vertical feedback means that the massless region suffers from an unusual kind of instability.
Suppose a small positive density perturbation is applied in a grid cell above ground, for
example by the action of a small convergence. This results in a positive perturbation to the pres-
sure field at all levels below, and so to the surface pressure ps. By equation (3.10) this causes
a positive perturbation to Mi, and so to M at all levels above ground through the integration
upward of the hydrostatic equation (3.11). The grid cell originally perturbed is then forced by
a positive divergence tendency −∇2M which acts to reduce the original density perturbation.
This feedback loop is simply the buoyancy effect which comprises the gravity wave feedback
mechanism. In the unphysical massless region, however, this feedback loop does not exist and
so any perturbations applied here will simply accumulate. Perturbations grow polynomially
rather than exponentially, but nevertheless the region is unstable.
In order to control this instability, σ in the massless region is continually relaxed back
towards its initial value σin, and the winds u and v are relaxed towards zero, without changing
the surface fields. The relaxation uses a forward timestep for stability and is given by
σ → σ+τFσin1+τF
, where F (σ, σ) =(σ−σσ
)2
l−1. (3.14)
Here τ is the ratio of the model timestep to the relaxation timescale, and F ensures a smooth
transition at the edge of the massless region, since the strength of the relaxation increases if the
fraction of the cell above (i.e. at level l − 1) that is filled with mass decreases. This relaxation
needs to be strong to control the instability; for the simulations presented in this thesis τ = 12.
Given that such a strong relaxation has to be taken to control the instability, and that the
massless region is, by definition, unphysical in nature, one should ask whether these measures
The Isentropic Model 58
are justified.
The σ values at the edge of the massless region are used in various ways, for example,
to evaluate horizontal finite differences at the ground such as the vorticity term ∇. (σQk× v)
in the divergence equation (3.4). They are also used to improve the accuracy of the advection
scheme near the ground by providing a means to quantify the proportion of a grid box which
lies above ground, and so define exactly where, within the grid box, the ground lies. This is
explained in Section 3.4.3.
The fields at ‘lower’ massless levels, however, are used solely in the solution of the Poisson
equations (3.5). To obtain the winds u and v above ground in an intersecting level these need
to be solved in that level. This is done either by imposing boundary conditions at the ground
∂U or by creating values in the empty region U to reproduce the correct surface winds under
solution of (3.5). Only with this second approach can an efficient multigrid method be used.
The relaxation can then be justified because, unphysical though it is, by choosing to relax the
winds rather than the PV we ensure that the surface fields are unchanged. This, coupled with
the fact that Q and δ are unchanged above ground, means that the relaxation does not affect the
above ground region through the solution of (3.5).
3.4 The Advection Scheme
In the absence of friction the PV and densities evolve solely by advection. This alone means
that the advection scheme is a key part of the model, but in addition it is the advection scheme
that handles many of the technical difficulties introduced by the surface intersection. The
underlying scheme is that of the original shallow-water model of Thuburn (1997) with the two-
dimensional flux-limiter of Thuburn (1996); the only modifications aim to represent the flow
near the ground more accurately.
Note the importance of the surface θ distribution as a part of the PV-θ view; dynamical
features are often represented by perturbations of both internal PV and surface θ. By using this
scheme, with the same winds, to advect both PV and isentropic density (which defines the sur-
face θ) we improve consistency in the evolution of the internal PV and surface θ distributions,
which should improve the representation of dynamical features.
The Isentropic Model 59
3.4.1 PV Advection
The PV is advected using the original scheme, which is both shape-preserving and conserva-
tive. In order to calculate the flow across a cell edge an interfacial value of the PV at the edge
needs to be defined. A two dimensional quadratic function is fitted through the PV values in
an upstream-biased neighbourhood of the edge, and this quadratic is integrated to calculate the
amount of PV which would be swept across the edge during the timestep. The constant value
which reproduces this PV flux over the timestep forms a first guess at the interfacial PV value.
This first guess is then adjusted by the flux-limiter by applying bounds to the edge value to
ensure that no new maxima or minima are created in the PV distribution; the scheme is thus
shape-preserving. For more details see Thuburn (1996).
It is worth mentioning here that it is the quadratic upstream interpolation which makes
the scheme slightly diffusive; the PV distribution is smoothed slightly by using a quadratic fit
to calculate the flux across the cell edge. This implicit diffusion is also present in the density
advection.
3.4.2 Flux Limiter for Density Advection
Both densities σ and σ are advected by the model winds. When a level hits the ground, σ drops
suddenly to zero so the horizontal gradient of σ is discontinuous, and this sharp edge should
be preserved by the advection scheme. The first concern is that no negative mass is produced:
σ should drop to zero (actually an infinitesimally small positive value) but not below. This is
ensured by a modification of the flux-limiter as follows.
Convergence and divergence during the flow evolution will act to create new maxima and
minima in the density distribution so, with this in mind, the outflow bounds imposed by the
flux-limiter to ensure shape-preservation are removed. Instead, an outflow bound is applied
to simply limit the flow of mass from a grid box so that the scheme does not remove more
mass from the box than it contains. In the rest of this subsection a suitable value for this
outflow bound is derived using notation from Thuburn (1996) and following the suggestions of
Eckhnel Kleine (personal communication).
Let qk denote the density (either σ or σ ) in grid cell k and qj the interfacial density at edge
j of the cell. (Note that when used with q,ˆdenotes an edge value rather than any connection
with σ.) The aim is to define a maximum outflow bound for cell k, denoted(q
(out)k
)max
, and
The Isentropic Model 60
to apply this bound to the interfacial values so that
qj ≤(q
(out)k
)max
(3.15)
for all edges j of cell k. Then the mass flow out of cell k during the timestep qout is given by
qout =∑
j
cj qj, (3.16)
where cj is the outflow courant number for edge j given by
cj = max
(0,vnj γj∆t
Ak
)(3.17)
and vnj is the velocity in the outwards normal direction, γj is the length of edge j, ∆t is the
time increment and Ak is the area of cell k. On applying the new outflow bound to each qj
equation (3.16) becomes
qout ≤∑
j
cj
(q
(out)k
)max
=(q
(out)k
)max
∑
j
cj. (3.18)
The scheme will not remove more mass from cell k than it contains so long as
qout ≤ qk (3.19)
and so, by comparison with (3.18), the new outflow bound can be defined as:
(q
(out)k
)max
=qk∑j cj
. (3.20)
This formally prevents the production of negative mass; both densities are indeed strictly posi-
tive in all the simulations in this thesis.
3.4.3 Sub-gridscale Fit Near the Ground
As mentioned earlier it is important to preserve the shape of the sharp change in σ as a level
intersects the ground, yet this feature is bound to be smoothed by the diffusive effect of the
The Isentropic Model 61
x
x0
B C D E
1
Gradient m
u
A
Figure 3.5: A schematic of the σ profile (shaded) in a level intersecting the ground. The methodused to construct the sub-grid fit for the sharp feature in box B is given in the text.
quadratic fit described in Section 3.4.1. In order to prevent this the advection scheme has
been modified to detect the intersection with the ground and to replace the quadratic fit with a
piecewise linear fit if necessary.
This process takes advantage of one of the benefits of the two-density formulation. The
ratio σ/σ gives the fraction of the relevant grid cell which lies above ground, and so it is easy
to clearly define when the cell is fully above ground, i.e. full with respect to σ . Currently the
tolerance is set at 0.1% so that
σ/σ < 0.001 ⇒ the cell is empty,
0.001 ≤ σ/σ ≤ 0.999 ⇒ the cell is partially filled,
0.999 < σ/σ ⇒ the cell is full.
(3.21)
Consider the advection of the shaded σ profile across the edge AB in the one dimensional
example in Figure 3.5. Box A is empty but box B is partially filled so that when the sub-
gridscale distribution of the σ field is constructed as shown the sharp feature lies within box
B. The original scheme would fit a quadratic through the values of σ in boxes A, B and C
to represent this distribution, and the area under this quadratic, integrated between appropriate
limits, would be the mass transferred to box A during the timestep. Clearly the quadratic fit is
not an accurate representation of the sub-gridscale distribution of the mass in box B. Instead
the advection scheme uses a piecewise linear fit with the σ values in boxes B and C defining
the gradient m of the linear fit and the intercept x0, which is then the horizontal extent of mass
within box B. If the wind advects by a distance less than x0 then there is no flux into box A
and so the interfacial value at edge AB will be set to zero. However, if it advects by a distance
x1 as shown, the heavily shaded region of mass should flow into box A. To ensure this the
interfacial σ value at edge AB is set to Muγ∆t
where M is the mass in the shaded region, γ is the
length of edge AB and ∆t is the timestep.
The Isentropic Model 62
0 10 20
0
0.2
0.4
0.6
0.8
1
No Flux−Limiter and No Subgrid FIT
0 10 20
0
0.2
0.4
0.6
0.8
1
Flux−Limiter but No Subgrid FIT
0 10 20
0
0.2
0.4
0.6
0.8
1
Flux−Limiter and Subgrid FIT
Figure 3.6: Results of a 1D test of the sub-grid fit. In each case the dotted line is the initialcondition of a sharp density profile and the solid line is the profile after 400 advection steps.Not all of the 40 cell domain is shown.
The same sub-gridscale fit would also be used to calculate the interfacial value at edge
BC if the wind were reversed to flow from left to right. There is another sharp feature in the
sub-gridscale σ distribution in box E and this is handled in exactly the same way by applying
the above procedure to the variable σ − σ.
Figure 3.6 shows the results of a test of this sub-grid fit using a one-dimensional version
of the advection scheme. A prescribed density profile was advected by a constant wind U = 1
and courant number C = U ∆t∆X
= 12
on a periodic domain of 40 cells of width ∆X = 140
,
where all values are non-dimensional. The sharp initial density profile is dotted, and the solid
lines show the density after 400 advection timesteps when the profile has been advected five
times around the domain. The flux limiter succeeds in preventing the generation of negative
mass, but the sharp edges are diffused by the advection scheme. With the piecewise linear fit,
however, these edges are maintained. This scheme has been applied on the two dimensional
hexagonal grid in an approximate manner as follows. To calculate the interfacial value at a cell
edge, the first and second cells upwind of the edge are identified and used as boxes B and C in
the above routine.
3.4.4 Summary of the Mass Advection Step
The full advection scheme used to advance the two density fields σ and σ by one timestep is as
follows:
1. First guess interfacial σ values are calculated using quadratic upstream interpolation.
2. These first guess values are adjusted to lie within inflow bounds for the downwind box,
The Isentropic Model 63
which are the maximum and minimum values in an upstream biased neighbourhood of
this downwind box.
3. The outflow bound(q
(out)k
)max
is applied to the interfacial σ values as in Section 3.4.2
to ensure that no negative mass is generated.
4. First guess interfacial σ values are calculated for each edge as follows:
• If the first upstream box contains a sharp feature as in boxes B and E in Figure 3.5
then the piecewise linear fit is used.
• If, by (3.21), the first upstream box is full then the σ interfacial value is set to equal
the σ interfacial value.
• If, by (3.21), the first upstream box is empty then the σ interfacial value is set equal
to the (small) σ value in this first upstream box (see Section 3.7).
• If none of the previous three criteria are satisfied then the interfacial σ value is
found using the quadratic upstream interpolation.
5. Inflow bounds are applied to the interfacial σ values as for σ .
6. The new outflow bound is applied to the interfacial σ values as for σ .
7. The interfacial σ values are stored and used for the mass flux in the PV advection step.
8. Both σ and σ are advected using the interfacial values.
3.5 Summary of the Model Timestep
To advance the prognostic variables by one timestep (15 minutes for the horizontal resolution
used here) the following procedure is applied.
1. The densities σ and σ are advected as in Section 3.4.4.
2. The PV is advected as in Section 3.4.1.
3. The Poisson equations are solved using a multigrid method to obtain wind fields from ξ
and δ.
4. The Montgomery potential M is calculated from the mass distribution as in Section 3.2.
The Isentropic Model 64
5. The δ field is time-stepped using a predictor-corrector method and applying a ∇4 hyper-
diffusion.
6. The Poisson equation is solved once more to obtain the new divergent wind.
7. Relaxation is applied to σ and winds as in Section 3.3, only to grid cells which are defined
as empty by equation (3.21).
8. The PV is re-calculated from the new σ and wind distributions.
3.6 Basic Model Tests
In order to test the model, and in particular the surface formulation, some simulations of sim-
ple flow patterns were performed. The focus of these tests was to assess the accuracy of the
numerical surface formulation rather than of the global-scale flow produced.
3.6.1 Solid Body Rotation
The model was initialised in a state of isothermal solid body rotation, which is an exact steady-
state solution of the primitive equations (3.2)-(3.4) defined by
T = T0, u = U0 cosφ, v = 0, (3.22)
where T0 = 220 K is the constant temperature and U0 = 50 ms−1 is the zonal wind at the
equator. This is a steady state if the divergence tendency vanishes. Substitution of (3.22) into
(3.4) shows that this is satisfied providing
M(θ, φ) = f(θ) +1
2(2Ωa+ U0)U0 cos2 φ, (3.23)
where the function f(θ) is given by the hydrostatic law. The initial state is constructed as
follows:
1. Pressure on half levels and σ on full levels are defined as
pj+ 12
= p0
(T0
ηj+ 12
) 1κ
, σj = −1
g
δpjδηj
. (3.24)
The Isentropic Model 65
2. The wind fields are then defined by
Qj =2(Ω + U0
a
)sinφ
σj, δj = 0. (3.25)
3. To satisfy (3.23) the Montgomery potential at the bottom of the model domain is defined
as
M0 =1
2(2Ωa+ U0)U0 cos2 φ+ α, (3.26)
where α = 1.9× 105 m2 s−2 is a suitable constant which determines the mean η value of
the z = 0 surface.
4. The discrete hydrostatic equation (3.12) is then integrated upwards to give the Mont-
gomery potential Mj on all model levels.
5. In order to define the location of the ground in the η domain the σ distribution needs to
be defined so that, in the lowest massy level i of each column, the boundary condition
(3.10) is satisfied. For each column, level i is identified as the highest level for which the
half-level pressure pi+ 12
exceeds the value of the surface pressure ps given by (3.10) as
π(ps) =Mi − Φs
ηi, (3.27)
where Φs = gzs = 0 here for a flat surface.
6. This level i is then the lowest massy level in this column, and is partially filled by
σi = −1
g
pi− 12− ps
ηi− 12− ηi+ 1
2
, (3.28)
with σ = σ in all cells in the column above and σ = 0 in all cells below.
It should be noted that this steady-state solution is rather unphysical; the atmosphere is
isothermal and on the surface (z = 0) the potential temperature increases from equator to pole.
It is nonetheless a useful analytical solution with which to quantify model errors.
The zonal and meridional wind fields at 383 K after 20 days are shown in Figure 3.7 for
comparison with an identical test of the original stratosphere model (described in Section 2.5.3
of Gregory (1999)). The errors are larger in the full atmosphere model; the maximum merid-
ional wind speed is 1.5 ms−1 compared to 0.3 ms−1 in Gregory (1999), and large density errors
The Isentropic Model 66
0 100 200 300
−80
−60
−40
−20
0
20
40
60
80
Longitude
Latit
ude
Zonal wind
45
30
15
45
30
15
0 100 200 300
−80
−60
−40
−20
0
20
40
60
80
Longitude
Latit
ude
Meridional wind
Figure 3.7: Zonal and meridional wind on the 383 K surface on day 20 of the solid bodyrotation simulation. The contour interval is 5 ms−1 for zonal wind and 0.4 ms−1 for meridionalwind.
are present with a drift of up to 5% from the initial distribution. These density errors are prob-
ably the cause of the wind errors since they act to push the flow away from the steady-state
solution, and are discussed in Section 3.7.
3.6.2 Perturbing the Massless Region
In order to test that the real atmospheric flow is not contaminated by the massless region and
its relaxation, the model was initialised in solid body rotation but with a large perturbation
applied to the wind fields in the massless region. A wavenumber 5 perturbation centred on the
mid-latitudes was applied to the wind field in order to force the model at the same wavenumber
as the grid noise (see section 3.1). The perturbation was given by
ξ′ = −5v0
asin2 φ cos 5λ F (σ, σ),
δ′ = 2v0
asin 5λ sinφ cos 2φ F (σ, σ),
where F (σ, σ) =
(σ − σσ
)2
l−1
. (3.29)
Here λ is the longitude, a is the radius of the Earth and the resulting wind fields are shown
in Figure 3.8. This construction ensures a large meridional component as to force flow on
model levels normal to their intersection with the ground; a value of v0 = 75 ms−1 was chosen,
The Isentropic Model 67
0 100 200 300
−80
−60
−40
−20
0
20
40
60
80
Longitude
Latit
ude
Zonal Wind
50
35
15
50
35
20
0 100 200 300
−80
−60
−40
−20
0
20
40
60
80
Longitude
Latit
ude
Meridional Wind
Figure 3.8: Initial wind fields in the massless region from Section 3.6.2. The contour intervalis 5 ms−1 with the zero contour dotted and negative contours dashed.
resulting in a maximum perturbation meridional wind speed of 20 ms−1. As in the relaxation
equation (3.14), the factor F ensures a smooth growth of the perturbation just below the surface.
Provided σ and σ are zonally symmetric, which is certainly true during the relaxation period,
the integral over longitude of the equivalent vorticity perturbation vanishes so that there is no
net surface circulation implied by this perturbation.
In the resulting simulation the perturbation is rapidly removed by the model relaxation.
The global energy and angular momentum decrease significantly during this time as shown in
0 2 4 6 8 10 12 14 16 18 2097
97.5
98
98.5
99
99.5
100
100.5Global Energy and Angular Momentum
Time (days)
Drif
t (%
)
Figure 3.9: Global integrals of total energy and angular momentum for a 20 day solid body ro-tation run with perturbed flow in the massless region. Energy values are marked by circles andmomentum values by crosses, with dashed lines for an integral taken over the whole domainand solid lines for an integral taken over the above ground region only. All quantities are givenas a percentage of their initial value.
The Isentropic Model 68
Figure 3.9 but the above ground values remain constant. Errors in the flow field above ground
are of the same size as for unperturbed solid body rotation; there is no adverse effect on the
above ground flow due to the perturbation of the massless region.
3.6.3 Flow over a Hill
In more complicated atmospheric flows the surface θ distribution would be expected to vary in
both space and time. The aim of this test was to verify that the intersecting isentropic levels
are able to move freely across the ground in response to the evolving flow.
A hill of maximum height 2 km was introduced, centred at φ0 = 45N, λ0 = 90E, and
defined as a surface geopotential distribution:
Φb =ghw2
a2 [(λ− λ0)2 + (φ− φ0)2] + w2, (3.30)
where a is the radius of the Earth, h is the height of the hill and w = 1000 km is a width scale.
The model was initialised in solid body rotation with a flat surface, and the Φb perturbation
added gradually to allow the flow to evolve; i.e. the hill grows during the initial hours of the
simulation. If the growth is rapid, for example, occurring in the first hour of integration only,
then a large gravity wave is produced as would be expected (see Figure 3.10). To reach a steady
state the hill was instead grown slowly over the first 24 hours, and the model run for a further
5 days after this.
An air parcel is able to pass over an obstacle of height h provided the Richardson number
Ri = N2h2
u2 < 1, where u and N are the upstream velocity and Brunt-Vaisala frequency
respectively (see e.g. Gill (1982), p. 293). Using this, and that N 2 = gθdθdz
, we estimate the
maximum elevation obtainable by the parcel as
hmax ≈u√gθdθdz
≈ U0 cosφ√g
300 K6 K
500 m
≈ 1750 m. (3.31)
Only at the very centre of the hill does the height exceed this, so by this estimate most of the
air approaching the hill does have enough energy to flow over it. The final location of the
The Isentropic Model 69
0 100 200 300
−50
0
50
Longitude
Latit
ude
Divergence after 4 hours
0 100 200 300
−50
0
50
Longitude
Latit
ude
Divergence after 10 hours
0 100 200 300
−50
0
50
Longitude
Latit
ude
Divergence after 16 hours
0 100 200 300
−50
0
50
LongitudeLa
titud
e
Divergence after 24 hours
Figure 3.10: The divergence on the 301 K model level shows a gravity wave propagating awayfrom the rapidly-grown hill, and later converging on the opposite side of the globe. The contourinterval is 2× 10−6 s−1 with negative contours dashed.
0 10 20 30 40 50 60 70 80 900
1000
2000
3000
4000Location of isentropic levels after 120 hours
Latitude
Hei
ght (
m)
0 20 40 60 80 100 120 140 160 1800
500
1000
1500
2000
2500
3000
3500
Longitude
Hei
ght (
m)
Figure 3.11: The lower levels (spaced every 6 K) are shown as a function of height z after 5days of the flow over a hill simulation, with the initial level locations shown as dotted lines.The flow is from west to east i.e. from left to right in the lower panel and out of the page in theupper panel.
The Isentropic Model 70
40 60 80 100 120 140 160 18020
25
30
35
40
45
50
55
60
65
70
Longitude
Latit
ude
Flow and height at 265K after 5 days
500
1000
1500
Figure 3.12: Wind vectors on the 265 K model level and height of the level in solid contours.The dashed line marks the location where this model level intersects the ground.
isentropic levels over the hill is shown in Figure 3.11. The model levels rise up over the hill
indicating that the general flow is indeed over the hill as predicted.
Figure 3.12 shows the wind and height of the 265 K level at day 5. Isentropic levels are
compressed in the region above the hill resulting in an anticyclonic anomaly over the hill due
to conservation of Q, as given by (3.1). On the downslope side of the hill a parcel acquires a
southward velocity under the influence of the eastern edge of the anticyclone and accelerates
through a region of low pressure on the lee of the hill. The parcel’s planetary vorticity f
decreases and its momentum carries it south past the point where f = Qσ so that, to satisfy PV
conservation, its relative vorticity ξ increases, turning the flow northwards cyclonically. The
parcel oscillates northward and southward under this vorticity-induced restoration mechanism,
forming a Rossby wave train which is shown in Figure 3.13. The wave is similar to that
predicted by Grose and Hoskins (1979) for flow over an isolated mountain of similar size.
Although it is reassuring to see that the surface topography forces the atmosphere in a
reasonable manner, the aim here is to test the ability of the intersecting levels to move with the
flow. Figure 3.12 shows the location of the 265 K surface intersection as a dashed line which
lies just to the north of the hill under this solid body rotation background flow. It is clear that
the intersection has indeed adjusted to the flow as the meridional meanders of individual air
The Isentropic Model 71
0 50 100 150 200 250 300 350
−50
0
50
Meridional wind after 24 hours
0 50 100 150 200 250 300 350
−50
0
50
Meridional wind after 72 hours
0 50 100 150 200 250 300 350
−50
0
50
Meridional wind after 120 hours
Figure 3.13: The meridional wind field shows the propagation of a Rossby wave train behindthe hill. Contours are drawn every 5 ms−1, with the zero contour dotted and negative contoursdashed.
parcels moving on isentropes imply an oscillation of surface θ with longitude.
3.7 The Surface Divergence Effect
As mentioned in Section 3.3 there is no vertical contamination of the above ground flow by
the artificial massless region. However, there is evidence of a small horizontal effect and this
is the cause of the density errors in the solid body rotation experiment. Figure 3.14 shows the
location of the lowest model levels after 20 days of solid body rotation, and demonstrates that
there is a tendency for mass to collect over the equator, especially noticeable in the lowest level
but also present above. There is a σ error of up to 5% at 383 K, compared to less than 0.1%
at 1240 K in Gregory (1999), and it is likely that this density error is responsible for the larger
wind errors noted in section 3.6.1. The total global angular momentum of the above ground
region increases by almost 0.2% over 20 days due to a 5% increase in the angular momentum
in level 20, the lowest massy level in Figure 3.14.
The development of this distribution is shown in Figure 3.15. In the early stages a negative
σ anomaly develops only in the first empty grid cell encountered when the level intersects the
The Isentropic Model 72
−80 −60 −40 −20 0 20 40 60 800
500
1000
1500
2000
2500
Latitude
Hei
ght(
m)
Height of isentropic model levels
Figure 3.14: The intersecting isentropic levels on a height axis for the solid body rotationinitial conditions (dotted contours) and at day 20 (solid contours), showing the migration ofmass towards the equator.
ground, as defined by equation (3.21). This anomaly grows with time and increases in extent
so that partially filled cells equatorward of the empty cell are affected. Mass of both varieties
(σ and σ) migrates from the region of intersection towards the equator where it collects form-
ing the unphysical spike seen in Figure 3.14. The underlying cause of this clearly erroneous
feature is a persistent anomaly in the divergence field at the intersection, itself a product of the
relaxation.
The divergence tendency is given by equation (3.4) which is reprinted here with the friction
term omitted
δt = −∇.[σQk× v +∇
(M +
1
2v2
)]. (3.32)
The tendency is therefore the sum of three terms which are generally nearly in balance: the first
is the vorticity term, referred to here as the Q term, the second is the horizontal pressure gra-
dient force or the M term, and the third is the kinetic energy or the E term. Figure 3.16 shows
the balance between these three terms and the resulting tendency at the initial conditions, and
after one hour of integration (i.e. four timesteps). The large positive anomaly in the divergence
tendency at the surface intersection persists throughout the run and it is the resulting divergence
which is removing mass from this region.
From Figure 3.16 it is clear that it is the Q term in equation (3.32) that is causing this
divergence tendency. The grid cell in which the anomaly starts is just in the massless region.
According to equation (3.21) the cell is empty but its first equatorward neighbour is not. This
means that the wind at the poleward edge is relaxed towards zero, but that the wind on the
The Isentropic Model 73
−50 0 500
50
100
150
200
250ICs
−50 0 500
50
100
150
200
250After 2 hours
−50 0 500
50
100
150
200
250After 4 hours
−50 0 500
50
100
150
200
250After 6 hours
−50 0 500
50
100
150
200
250After 5 days
−50 0 500
50
100
150
200
250After 10 days
−50 0 500
50
100
150
200
250After 15 days
−50 0 500
50
100
150
200
250After 20 days
Figure 3.15: Snapshots of the latitudinal distributions of σ (solid lines) and σ (dotted lines) inthe lowest model level which contains any mass. This shows the migration of mass towards theequator in this solid body rotation run. The units are kg m−2 K−1.
The Isentropic Model 74
−100 −50 0 50 100
−2
−1
0
1
2
x 10−6 Sum of terms
−100 −50 0 50 100
−2
−1
0
1
2
x 10−6 E term
−100 −50 0 50 100
−2
−1
0
1
2
x 10−6 M term
−100 −50 0 50 100
−2
−1
0
1
2
x 10−6 Q term
Figure 3.16: Divergence tendency contributions from the various terms in equation (3.32) asfunctions of latitude for the lowest massy level, in s−2. The first panel is the sum of the otherthree and is therefore the divergence tendency δt. Values are shown for the initial state as dottedlines, and after 1 hour as solid lines.
equatorward edge is not. The wind is westerly everywhere so the relaxation induces a cyclonic
relative vorticity perturbation in the grid cell, and therefore a cyclonic PV perturbation. The
relaxation also results in a large wind shear ∂u∂y
which is negative in the northern hemisphere,
and it is this which causes the divergence anomaly. The Q term is given by
Q term = −∇. (σQk× v)
= −∇. (σQui) if v = ui
= − ∂
∂y(uσQ) . (3.33)
While there are Q and σ perturbations in the first massless cell, the wind u has a variation
which is both larger relative to its mean value and monotonic with latitude, and so it is this
factor which dominates the Q term, resulting in the large positive divergence tendency seen.
The cyclonic PV perturbation induced by the relaxation is enhanced by the negative σ anomaly
resulting from the divergence. These effects are most noticeable in the lower levels because
these levels intersect the surface closer to the equator where the wind, and therefore also the
The Isentropic Model 75
−100 −80 −60 −40 −20 0 20 40 60 80 100−3
−2
−1
0
1
2
3
Latitude
PV
(P
VU
)
−100 −80 −60 −40 −20 0 20 40 60 80 100−1
−0.5
0
0.5
1x 10
−6
Latitude
Mas
s−w
eigh
ted
PV
(s−1
)
Figure 3.17: Zonal mean PV (top) and above ground mass-weighted PV (σQ) (bottom) in thelowest massy level for the initial conditions (dotted line), after 2 days (dashed line) and after20 days (solid line). Note the large change in above ground mass-weighted PV between days2 and 20 due to the horizontal extension of the PV anomaly.
wind shear, is stronger.
Both the size and extent of the PV and σ anomalies increase steadily with time. As de-
scribed in Section 3.3, there is no buoyancy feedback mechanism in the massless region to
restore the σ perturbation. The horizontal extent of the anomalies increases so that σ and σ de-
crease in the partially filled region as mass is advected away in response to the large divergence
just below the ground. The PV anomaly also spreads into the partially filled region, as shown in
Figure 3.17, and runtime diagnostics show that it is the advection step which causes this. The
flux-limiter in the advection scheme prevents the generation of new PV maxima, but once the
maximum just below the surface is created by the relaxation step this PV anomaly is spread into
the partially filled region by the advection. The divergence anomaly forces a PV flux normal to
the surface intersection, and the quadratic upstream interpolation in the advection scheme acts
to diffuse the below ground PV anomaly into the partially filled region.
The PV anomaly induced in the partially filled region is a significant feature of the above
ground flow. The PV anomalies are large, as can be seen in Figure 3.17, and of opposing sign
in the two hemispheres. In any level the integral of the mass-weighted PV over all cells j
The Isentropic Model 76
(∑j σQ δA δθ
)should be zero since the flow is symmetric about the equator. In general this
is observed in the model with
∑σQ∑ |σQ| = O(10−6) = O(round-off error) (3.34)
in levels which do not intersect the ground. However in the lowest massy level this figure is
O(10−3); this error is larger than pure round-off error.
There is one final manifestation of the divergence anomaly at the surface. Suppose an
amount of σ mass leaks into the massless region that is too small for the cells to be counted as
partially filled, i.e.
0 <σ
σ< 0.001. (3.35)
This mass will then be advected by the wind, the divergent part of which is forced as in the first
panel of Figure 3.16 to give divergence at the surface intersection and convergence towards
the poles. This convergence is also due to the relaxation: at the initial conditions the σ field
is in balance with the wind in solid body rotation, but with the winds relaxed to zero the
Q term in (3.32) is too small to balance the M term. Acting under this divergent field the
mass migrates away from the surface and collects at the poles. This development is shown
in Figure 3.18, and it is clear that the σ values are small. After 20 days these values are still
small but they are larger than 0.001 σ, so that the lowest of these ‘outcropped’ levels at the
pole does count as the lowest massy level for some columns in the calculation of M described
in Section 3.2. This is concerning in its own right but to date no problems have arisen as a
result. The original σ leak could perhaps be prevented in future by altering the algorithm in
step 4 of the advection summary in Section 3.4.4 so that, if the first upstream box is empty
(σ < 0.001σ), the interfacial value is set to zero rather than to this small σ value. This small
modification was not made for the simulations in this thesis.
To summarise, there is a contamination of the above ground flow by the massless values
because the relaxation causes a strong horizontal wind shear which in turn induces a divergence
anomaly at the surface. The contamination is stronger in the solid body rotation flow studied
here than might be expected in more realistic flows due to the strong and persistent surface
wind field. Nevertheless, this process is monitored closely in the simulations presented later.
The Isentropic Model 77
0 20 40 60 800
1
2
3
4
Latitude
Hei
ght(
m)
Day 5
0 20 40 60 800
1
2
3
4
LatitudeH
eigh
t(m
)
Day 15
0 20 40 60 800
0.01
0.02
0.03
0.04
0.05
0.06
Latitude
Hei
ght(
m)
6 hours
0 20 40 60 800
0.01
0.02
0.03
0.04
0.05
0.06
Latitude
Hei
ght(
m)
Day 1
Figure 3.18: Meridional sections of the height of intersecting level surfaces to show the devel-opment of the outcropping at the pole. Even at day 5 the σ mass in the outcropped region isso small that the cells are considered empty. By day 15, however, these cells are consideredpartially filled and the lowest is counted as the lowest massy cell for columns over the pole,and so used to define the surface Montgomery potential by (3.10).
3.8 Summary
The isentropic stratosphere model of Gregory (1999) has been extended to include a surface
formulation. Two densities are used: σ goes to zero at the ground as in the massless layer
approach, while σ has non-zero values used to compute ξ from Q and also to aid in smooth
evaluation of terms near the ground. This massless region is unstable and a relaxation term is
used to control this. The advection scheme has been modified in two ways: the flux-limiter
simply prevents the production of negative mass, and a piecewise linear sub-grid fit more ac-
curately advects the sharp edge in the σ field at the ground.
The model is able to simulate a 20 day period of solid body rotation, albeit with larger
errors than the original stratosphere model. Large perturbations to the massless region do not
affect the above ground flow, and the model is able to realistically simulate the flow over a
large hill. The larger errors seen in the solid body rotation simulation are due to a horizontal
contamination of the above ground flow by surface divergence anomalies induced by the relax-
The Isentropic Model 78
ation term. This effect is considerably smaller in the simulations in following chapters, where
the surface winds are smaller and less persistent. The contamination needs to be monitored
closely and should be reduced in future development of the model.
Chapter 4
The LC1 Baroclinic Life Cycle
4.1 Baroclinic Instability and a History of the LC1 Cycle
The atmosphere of the Earth is a baroclinic fluid, defined as a fluid for which the density at a
given location is dependent on the temperature as well as the pressure. The temperature is then
able to vary on a surface of constant pressure, and the atmosphere can develop a temperature
gradient on the planetary surface in response to the differential solar heating. This implies
a distribution of potential temperature θ in which isentropes high above the ground at the
poles slope down to intersect the surface at warmer regions towards the equator. Air of a high
potential temperature towards the equator is then lower, when viewed in height or pressure
coordinates, than relatively colder air towards the poles, and so there is potential energy in the
basic state which is available for conversion to kinetic energy.
It is useful to define here this notion of available potential energy. Air of a high potential
temperature is essentially more buoyant than air of a lower potential temperature. In an at-
mosphere at its state of lowest possible potential energy, more buoyant air must lie above less
buoyant air, and therefore in this state surfaces of constant potential temperature and pressure
are both uniformly horizontal, i.e. they coincide with surfaces of constant height. There is still
a large amount of potential energy in this state but there is no way to adiabatically extract it
from the system. This is therefore referred to as the unavailable part of the potential energy and
that part gained by adiabatically re-arranging the mass within isentropic layers is the available
part. For a more rigorous and detailed description see e.g. Holton (1992), Section 8.3.
This basic state is unstable to wave-like perturbations which grow into the patterns of
cyclonic and anticyclonic weather systems commonly seen in the mid-latitudes. These are
The LC1 Baroclinic Life Cycle 80
essentially synoptic-scale eddies which act to advect warm air upwards and towards the poles
and cold air downwards and towards the equator. In this way heat is transported poleward
and upwards, contributing significantly to the work of the atmosphere as a heat engine. From
this viewpoint the atmosphere is seen as an engine transporting heat from the tropics, where
incoming solar radiation dominates the outgoing radiation and so the Earth is heated, to the
extratropics, where the outgoing radiation dominates and the Earth is cooled (see e.g. Curry
and Webster (1999)).
The wave acts to flatten the isentropes in the mid-latitudes, so they lie closer to surfaces of
constant pressure, as available potential energy is extracted from the mean flow and converted
into eddy kinetic energy. The wave therefore modifies the original basic state into a more
stable one. Surface temperature contours deform during the wave growth, and the mean merid-
ional surface temperature gradient in the mid-latitudes temporarily decreases. The background
state is therefore more stable and the wave decays. The atmosphere, however, is continually
forced by relative heating in the tropics and cooling towards the poles, enforcing a permanent
temperature gradient, and new systems frequently grow during the decay of their predecessors.
Instabilities of this kind which can only exist in a baroclinic fluid are referred to as baro-
clinic instabilities. They can only be realised with a temperature gradient on pressure surfaces,
and a corresponding vertical structure in the wind field in accordance with thermal wind bal-
ance. Many, more detailed, introductions are given in the standard textbooks, for example
Holton (1992).
A key aim of atmospheric science over the past century has been to improve our under-
standing of this type of atmospheric flow by modelling the growth of wave-like perturbations in
baroclinically unstable flows. Early models were analytic, starting with the pioneering work of
Charney (1947) and Eady (1949). These models represented very idealised situations in which
the zonal mean is independent of latitude and the flow is assumed to be quasi-geostrophic and
takes place on an f or β plane. Such models greatly improved our understanding of baroclinic
instability in the atmosphere. However, to study the more realistic case of solutions to the
primitive equations in spherical geometry demands the use of numerical models. The life cy-
cles studied in this thesis are just some out of perhaps hundreds of numerical wave simulations
which have been presented in the literature; they have, however, evolved from some of the
earliest examples of such simulations.
Hoskins and Simmons (1975) presented a multi-layer model of the atmosphere which
solved the primitive equations on the sphere by representing the prognostic variables as trun-
The LC1 Baroclinic Life Cycle 81
cated spherical harmonics. They demonstrated the use of their model by simulating the growth
of a baroclinic wave on a zonal flow of differential solid body rotation with velocity increas-
ing with height. Simmons and Hoskins (1976) went on to study the waves which developed
on background states consisting of the same solid body rotation but with the addition of two
different mid-latitude westerly jets. Using an initial value technique they obtained the growth
rates, phase speeds and structures of the most unstable linear modes at various zonal wave-
lengths by perturbing the zonal flow and taking the perturbation at the time when its growth
became exponential to be the normal mode. The waves were in general considered too shallow,
and to remedy this the vertical resolution was increased, and a more realistic representation of
the zonal mean structure in the tropopause region was added in Simmons and Hoskins (1977).
Though improved, the linear solutions still failed to represent the observed patterns and
eddy strengths at upper levels, which prompted a study of the nonlinear development of waves
from the same initial conditions. Simmons and Hoskins (1978) left behind the linear initial
value technique and used their fully nonlinear spectral model to generate the first complete
baroclinic wave life cycles. The initial zonal flow was perturbed by a small amplitude distur-
bance in the shape of their previously calculated normal modes and the model was integrated
in time. It was found that the growth rate at lower levels decreased more rapidly than at upper
levels, and so upper level amplitudes did indeed become larger relative to those at the surface
than they had in the linear modes. Barotropic processes were found to bring about a decay of
the wave at later stages and so a well defined life cycle existed.
Waves of zonal wavenumber 6 were found to give deeper disturbances than the waves
of maximum growth rate (wavenumbers 7 and 8), which tended to be shallow in nature and
therefore did not extract as much energy from the basic state. Simmons and Hoskins (1980)
studied various wavenumber 6 life cycles in which the initial conditions differed only in a
barotropic component. One of these featured an added cyclonic horizontal shear, originally
added to enhance warm frontogenesis as suggested by Hoskins and West (1979). The life
cycle, however, displayed strikingly different energetics and was referred to as an ‘anomalous’
case to distinguish it from the ‘basic’ cycles.
A version of the ‘basic’ cycle was studied for evidence of frontal cyclogenesis in Thorn-
croft and Hoskins (1990), where a detailed description of the low level evolution of the wave
is given. Later Thorncroft et al. (1993) re-christened the basic and anomalous cycles as LC1
and LC2 respectively, and extensively analysed the differences between the two. They identi-
fied two extreme types of behaviour which they termed cyclonic and anticyclonic according to
The LC1 Baroclinic Life Cycle 82
whether air turns in the same direction as the Earth’s rotation or in the opposite, respectively. A
key difference between the cycles is that, while the LC1 cycle exhibits both types of behaviour,
the LC2 cycle is dominated by the cyclonic type. The two cycles are characterised by very
different patterns of eddy kinetic energy activity in the later stages and also by very different
synoptic structures and evolutions.
By this stage a picture of the general evolution of such life cycles was developing which
comprised four, more or less distinct phases as described by Thorncroft et al. (1993):
1. Exponential growth of a normal mode perturbation as in the linear theory of baroclinic
instability.
2. Saturation at low levels as nonlinear processes such as the formation of fronts and the
occlusion process slow and eventually stop the growth of the lower wave.
3. Vertical propagation of a Rossby wave, triggered by the low level saturation, which car-
ries wave activity upwards into the jet region in the upper troposphere / lower strato-
sphere. This replaces the baroclinically unstable growth as the dominant cause of eddy
kinetic energy increase.
4. A similar saturation event now occurs at upper levels where nonlinear processes again
act to slow and eventually stop the growth of the wave, which then decays mostly under
the action of barotropic processes.
This is widely referred to as the ‘saturation-propagation-saturation’, or SPS, picture of a
general baroclinic disturbance.
Both life cycles have been used in several important studies over the years, of which the
most relevant to this thesis is Methven (1996). The LC2 cycle is described in Appendix A,
while the rest of this chapter compares the evolution of the LC1 cycle in the isentropic model
described in Chapter 3 with its evolution in a model with a normalised pressure coordinate.
This second model, the Intermediate General Circulation Model or IGCM, has developed from
the original spectral model described by Hoskins and Simmons (1975).
4.2 The IGCM
The model referred to here as the IGCM is more formally named IGCM1 to distinguish it
from later versions which incorporate some representation of the various physical processes at
The LC1 Baroclinic Life Cycle 83
work. Excepting some technical alterations this version of the model has changed little from
the original version of Hoskins and Simmons (1975), which therefore remains the principal
source for a more detailed description of the model.
The IGCM is a spectral model in which the variables are represented in terms of truncated
series of spherical harmonics on a set of discrete levels of normalised pressure p/ps. Here ps
is the surface pressure, which is itself predicted by an integration of the continuity equation.
The quantity p/ps would normally be referred to as the σ coordinate, a term which is avoided
here purely to avoid confusion with the isentropic density σ. For the basic life cycles presented
here the resolution is given by a triangular truncation of T42 in the horizontal and 15 levels in
the vertical, at normalised pressure values of 0.0185, 0.0596, 0.1057, 0.1520, 0.1968, 0.2407,
0.2865, 0.3382, 0.4002, 0.4765, 0.5687, 0.6740, 0.7844, 0.8871 and 0.9673.
The hydrostatic primitive equations in vorticity, divergence, temperature and surface pres-
sure are integrated using a semi-implicit timestepping scheme. The IGCM is a spectral trans-
form model, meaning that the nonlinear terms are evaluated in grid space and then transformed
back to spectral space in order to compute the spectral tendencies. The most significant alter-
ation from the original model is in the vertical scheme. The original mass and energy conserv-
ing scheme has been replaced by one based on that of Simmons and Burridge (1981) so that the
angular momentum is also conserved by the vertical differencing. By restricting the spherical
harmonics used to only every sixth zonal harmonic a six-fold symmetry in longitude is implied
which, when coupled with an assumed symmetry across the equator, means that calculations
need only be performed for one twelfth of the global domain.
This version of the model is dry and the only diabatic process is a horizontal ∇6 hyper-
diffusion term acting on all three prognostic variables. For the basic T42 simulation this dif-
fusion acts to remove features of the shortest retained scale with an e-folding timescale of 4
hours. This diffusion is necessary to avoid the accumulation of energy, and thus noise, at the
grid scale during the cascade of flow features to small scales, which is characteristic of fluid
motion. By acting along normalised pressure surfaces this diffusion mixes air across isentropic
surfaces. This is in direct contrast to the isentropic model in which the mass in each isentropic
level is exactly conserved, and is one of the key processes which gives rise to differences in the
life cycles predicted by the two models.
The LC1 Baroclinic Life Cycle 84
Figure 4.1: A reproduction of Figure 3 of Thorncroft et al. (1993) showing the zonal meanstate of the LC1 and LC2 cycles (left and right, respectively) at days 0 and 7 (top and bottom).Contours show the potential temperature every 5 K and the zonal wind every 5ms−1. Theheavy contour is the 2PV U surface, and the shading indicates the eddy kinetic energy maxima.
4.3 An Overview of the LC1 Cycle in the T42 IGCM
The zonal mean initial conditions have been likened to a western ocean basin wintertime sit-
uation. The initial distributions of potential temperature and zonal wind for the LC1 cycle
are shown in the top left panel of Figure 4.1. The meridional surface temperature gradient at
mid-latitudes is clear, indicating that this region is strongly baroclinic. The isentropes here
slope relative to the normalised pressure surfaces so there is a large amount of potential energy
available for conversion to eddy kinetic energy. The thermal wind balance relation equates the
meridional temperature gradient to the vertical shear of the zonal wind. In agreement with this
the initial conditions display vertical wind shear in the region of sloping isentropes, so that the
zonal mean wind distribution takes the form of a jet which reaches a maximum of 47 ms−1 at
roughly 45N in the vicinity of the tropopause, and drops to almost zero at the surface. To seed
the wave, the surface pressure is perturbed by up to 1 mb in the shape of the most unstable
The LC1 Baroclinic Life Cycle 85
zonal wavenumber six normal mode.
In the initial period of exponential growth the wave more or less retains the shape of
the normal mode perturbation. Fields of all flow variables vary approximately sinusoidally
in longitude and the wave grows much as predicted by the linear normal mode theory, with
the eddy activity concentrated at low levels. The growth in EKE during this stage is due
to baroclinic conversion from available potential energy as in the standard baroclinic growth
theory. Thorncroft et al. (1993), however, showed that this period of exponential growth lasts
only for the first four days of the cycle, and that at the end of this first stage the EKE is less than
a fifth of its maximum value. The EKE evolution in both models is shown later in Figure 4.6.
By day 6 the flow at low levels has departed significantly from the sinusoidal shape held
during the initial growth period, as it undergoes the nonlinear saturation event which comprises
the second phase of the SPS picture of the evolution. The low level temperature distribution
is shown in Figure 4.2 for days 6 to 9. The flow features associated with the nonlinear satura-
tion processes are clearly visible by day 6, and are reminiscent of features observed in mature
cyclone systems. A sector of warm air has moved northward between two regions of colder,
polar air, and strong surface fronts have developed between the air masses. The temperature
gradients are strongest in the cold front which trails out from a deep low pressure region. The
warm sector and associated warm front display a NE-SW tilt, comprising what Thorncroft
et al. (1993) refer to as a bent-back baroclinic zone of inverse meridional temperature gradient.
The warm sector decays during the following days in a manner resembling observed occlu-
sion events. The observed events however are associated with a lifting of the warm sector air
whereas in the simulated life cycle the dominant processes acting are horizontal advection and
diffusion. By day 6 the tip of the warm sector has turned cyclonically to the west, forming a
patch of warm air that is later pinched off into the centre of the low in a feature that has been
termed a seclusion. Both of these features are of a relatively small scale and are thus greatly
affected by the model’s diffusion. Thorncroft and Hoskins (1990) studied the low level evolu-
tion in great detail, concentrating on the possibility of a frontal cyclogenesis event occurring
on the cold front.
By day 8 the low level temperature gradients are weak in the mid-latitudes, indicating
that the baroclinicity in the region has largely been destroyed. The low level saturation event
has stopped the growth of the wave at low levels where it is now in barotropic decay. The
upwards propagating Rossby wave of the third phase of the SPS picture is triggered by this first
saturation event, and Thorncroft et al. (1993) presented the two phases as occurring more or
The LC1 Baroclinic Life Cycle 86
Figure 4.2: Temperature on the level p/ps = 0.9673 in the LC1 cycle simulated by the IGCMat T42 resolution. Contours are drawn every 5 K with the warmest (i.e. southernmost) contourline at 295 K .
The LC1 Baroclinic Life Cycle 87
less concurrently. Eliassen-Palm cross-sections as developed by Edmon et al. (1980) showed
that, above the surface, the dynamics of the life cycle are dominated by the vertical Rossby
wave propagation from days 5 to 7. The EKE growth after day 4 greatly exceeds the growth
up to that point, and the diagnosis of Thorncroft et al. (1993) demonstrated that this growth is
associated, not with baroclinic instability, but with the propagation of Rossby wave activity up
into the strong winds of the upper tropospheric jet.
The evolution of the upper level wave is illustrated in Figure 4.3 by the PV distribution
on the 330 K isentropic surface. The upper wave is clearly still growing on day 7 when the
lower wave has saturated and is in decay. Thorncroft et al. (1993) identified two contrasting
types of behaviour which they termed ‘cyclonic’ and ‘anticyclonic’, and stressed that a key
characteristic of the LC1 cycle is that it exhibits both of these, as is frequently observed in real
systems. Cyclonic wrap-up of air occurs on the poleward side of the westerly jet where there
is strong cyclonic shear of the zonal wind; air on the equatorward side of the jet, in contrast,
wraps up anticyclonically under the influence of the anticyclonic shear there. This anticyclonic
feature is clearly visible from day 7 in the PV field shown in Figure 4.3, but on the poleward
side of the jet the field is dominated by large positive PV anomalies which, while indicative
of cyclonic vorticity, do not exhibit the same process of wrap-up as is seen to the south of
the jet. Thorncroft et al. (1993) chose to illustrate the upper flow by plotting the distribution
of potential temperature on the surface where PV = 2 PVU (= 2 × 10−6 m2Ks−1kg−1), an
approximation to the extratropical tropopause. These fields, shown in their Figure 7, do indeed
show a cyclonic wrapping-up of air polewards of the jet in the location of the positive PV
anomalies in Figure 4.3. This feature is discussed further in Section 4.4. Figure 4.5 of that
section shows a higher resolution version of the life cycle in which the cyclonic wrap-up is
clearly visible from day 6.
The tongues of high PV air in Figure 4.3 which extend south across the jet into the region
of anticyclonic shear are generally referred to as troughs, or as tropopause folds. They mark
regions where stratospheric air with high PV has intruded down into the troposphere. During
days 8 and 9 these troughs fold westward in the anticyclonic shear and are thinned under the
influence of the large scale PV field as described in Thorncroft et al. (1993), Section 6(c). PV
contours are irreversibly deformed as a region of high PV air is cut off from the southern end of
the trough, which weakens as it thins. This ‘equatorward Rossby wave breaking’ event heralds
the decay of the upper level wave in the final phase of the SPS picture. The SW-NE tilt of the
trough means that air moving north towards the jet along the eastern edge of the trough has
The LC1 Baroclinic Life Cycle 88
Figure 4.3: Distribution of PV interpolated to the 330 K isentropic surface during the IGCMsimulation of the LC1 cycle at T42 resolution. Contours are drawn every 0.5 PVU. Note theincrease of the PV maximum; the highest contour is at 7 PVU on day 6 and 8 PVU on day 9.
The LC1 Baroclinic Life Cycle 89
a large westerly velocity component. Momentum is therefore transported from the wave into
the jet signalling the barotropic conversion of EKE into zonal basic state kinetic energy. The
period from day 7 to day 9 is the main period of this barotropic conversion during which EKE
is lost at roughly the same rate as it was generated in days 4 to 7.
The reduction of EKE has stopped by day 10 and the wave has left the atmosphere in a
much more stable state. The horizontal temperature gradient at mid-latitudes has been greatly
reduced, and the zonal mean wind has a much larger barotropic component than the initial
conditions so that the strong westerly winds of the jet extend low down towards the surface.
A strong surface front is still in existence equatorward of the jet, indicating that significant
baroclinicity is still present. The late stages of the cycle show some evidence of a wave-like
disturbance forming in this baroclinic zone but this does not develop into a significant feature.
4.4 A High Resolution Version of the IGCM LC1 Cycle
The life cycles predicted by both the T42 IGCM and the isentropic model (resolution roughly
equivalent to T50) will be compared with a much higher resolution version of the IGCM cy-
cle. This is the high resolution simulation performed by Methven (1996) with 30 levels in
the vertical and a horizontal resolution of T341. Much fine detail is observed in this simula-
tion, with many features reminiscent of those seen in the similar high-resolution life cycles of
Polavarapu and Peltier (1990). Despite differences in the flow at the smallest scales the large
scale evolution of the life cycle is as seen in the T42 cycle.
The low level temperature distribution in the T341 cycle is shown in Figure 4.4. On
comparison with the T42 fields in Figure 4.2, the most striking difference is in the strength
of the surface fronts; clearly in these cycles the horizontal temperature gradient is limited by
the model resolution. The warm sector pushing poleward during days 6 and 7 is of smaller
area in the high resolution cycle, while the cold regions on either side are wider and generally
colder. By day 8 these cold regions have been cut off from the polar reservoir where the air
originated, but in contrast to the T42 cycle they remain large in area and extend equatorward
up to the very edge of the frontal region. This demonstrates the extent to which air masses of
differing temperatures are mixed by the diffusion acting on temperature in the T42 life cycle.
Figure 4.5 shows the PV field on the 330 K isentropic surface in the T341 simulation for
comparison with Figure 4.3. The initial conditions for the life cycle have a maximum PV value
attained not at the pole but at roughly 60N. In this figure, the cyclonic wrap-up poleward of the
The LC1 Baroclinic Life Cycle 90
Figure 4.4: Temperature on the lowest model level from the T341 LC1 cycle of the IGCM.Colours are as in Figure 4.2 with contours again every 5 K from 295 K.
The LC1 Baroclinic Life Cycle 91
Figure 4.5: PV on the 330 K isentropic surface from the T341 LC1 cycle of the IGCM. Con-tours are drawn every 0.5 PVU and colours are identical to Figure 4.3.
The LC1 Baroclinic Life Cycle 92
jet is clearly seen as filaments of this maximum PV air wrap up with filaments of lower PV air
from the surrounding regions. In the T42 cycle this is inaccurately represented as an increase
in the value of the PV maximum. The troughs of high PV air which extend equatorward
demonstrate the same qualitative behaviour as at low resolution but in an exaggerated manner.
They have thinned more and are sheared further to the west by the mean flow. Air of a higher
PV value is cut off from the end of the trough; this air has clearly been advected from the
region of the PV maximum at 60N. Lying south of the zonal jet this high PV anomaly remains
roughly stationary as a small, intense cyclone event which, by day 9, can be seen to induce a
cyclonic turning in the remnants of the trough.
4.5 Initial Conditions for the Isentropic Model Simulation
In order to perform a simulation of the LC1 life cycle using the isentropic model developed
in Chapter 3, the initial conditions from the IGCM run were interpolated onto the isentropic
model grid. This initial state comprises both the zonal mean background state and the normal
mode surface pressure perturbation. The following algorithm was used to transform the data
from an initial output of the T42 IGCM:
1. The IGCM output is first processed using the Tradv program written by John Methven to
give the meridional wind, zonal wind and temperature on normalised pressure surfaces,
and the surface pressure, all on the model’s latitude-longitude spectral transform grid.
The IGCM predicts vorticity and divergence on normalised pressure surfaces, so it is
important to interpolate the winds onto the isentropic model levels and then re-calculate
the vorticity and divergence, since these derivatives should be taken along the isentropic
surfaces.
2. Due to the symmetry assumptions of the IGCM cycle (see Section 4.2), this data only
covers one twelfth of the globe. This patch is simply copied to cover the sphere, assuming
symmetry about the equator.
3. The pressure on normalised pressure levels is readily obtained using the surface pressure,
and is then interpolated onto the isentropic model half levels ηh by assuming that ln p is
a linear function of ln η. This gives the pressure ph associated with the isentropic model
density σ .
The LC1 Baroclinic Life Cycle 93
4. A second pressure ph is defined by ph = min(ph, ps), which will be associated with the
second isentropic model density σ . Values of the two pressures in the first η level below
the IGCM domain are chosen by extrapolating from the IGCM data assuming again that
ln p is a linear function of ln η. These pressure values are used to give density values
in grid boxes which intersect the ground. At this stage the pressure ph is given dummy
values in the massless region; σ values here are chosen directly later.
5. The two densities σ and σ are then calculated from the pressures according to
σ = −1
g
∆ph∆ηh
σ = −1
g
∆ph∆ηh
. (4.1)
The definition of ph then gives σ = 0 in the massless region. Massless cells are given
a σ value by simply copying σ horizontally at constant longitude along a level from
the intersection with the ground. This definition of massless σ is unique because of the
imposed symmetry about the equator, and is designed so that σ represents the limit of
−1g∂p∂θ
within the level as it becomes massless.
6. Zonal and meridional velocities u and v are interpolated onto η levels in the same manner
as the pressure. In the region outside the IGCM domain, i.e. the surface intersection and
below, the winds are calculated from the mass field by assuming thermal wind balance.
This is given in η coordinates by eliminating the Montgomery potential M from the
hydrostatic equation∂M
∂η= π(p) (4.2)
and the equation of geostrophic balance
−∇ηM = fk× v (4.3)
to give
f∂v
∂η=cpη
∂T
∂x, −f ∂u
∂η=cpη
∂T
∂y. (4.4)
7. The PV and divergence are then calculated on η levels using centred differences.
8. At this point the prognostic variables Q, δ, σ and σ have been obtained on η levels but
still as fields on a regular horizontal grid. The final step then is to interpolate the fields
horizontally onto the hexagonal grid.
The LC1 Baroclinic Life Cycle 94
The initial conditions generated by this technique proved to be smooth and balanced
enough for a successful simulation of the LC1 cycle. This was not the case for the LC2 cycle,
however, for which the intermediate latitude-longitude grid had to be refined in order to reduce
imbalances in the interpolated fields. This is described in Appendix A.
4.6 Overview of the LC1 Cycle in the Isentropic Model
The isentropic model of Chapter 3 was run from the initial conditions as constructed above.
The model parameters were identical to those used in the simulations shown in Chapter 3, and
are summarised briefly here. Horizontal grid number 5 was used, which is roughly equivalent
to a spectral truncation of T50. There were 21 full levels in the vertical, of which on average
15 were above the surface. The relaxation in the massless region was exactly as described
in Section 3.3, with a timescale of 30 minutes, and the only explicit diffusion was a hyper-
diffusion acting on the divergence field, with a timescale of 12 hours on grid-scale features.
Despite the strong relaxation employed in the massless region, an instability developed,
and the model crashed during the tenth day of the life cycle. The instability is limited to the
massless region and appeared only a few hours before the end of the integration; there was no
effect on the above ground flow up to and including day 9. This is examined in more detail in
Section 4.8.
Figure 4.6 shows the evolution of the global average EKE for the LC1 cycle in the isen-
tropic model and for the T42 IGCM. The isentropic model has completed much of the decay
phase of the life cycle by the time it crashes, indicating that the terminal instability is not likely
to occur in response to forcing by any above ground feature of the wave evolution.
In general, the EKE growth is slower in the isentropic model than in the IGCM, and the
maximum attained is lower. The growth stage, however, has a longer duration and the de-
cay rate is also slower than in the IGCM. This could perhaps indicate that over the cycle the
two waves exert similar amounts of work in terms of transporting heat polewards, though this
cannot be decisively inferred from this data.
The lower panel of Figure 4.7 shows the zonal mean structure after 7 days of integration
of the isentropic model. Isentropes at mid-latitudes are closer to horizontal as available po-
tential energy has been removed from the basic state by the wave, and baroclinicity has been
destroyed. In line with thermal wind balance the zonal mean jet extends down towards the
surface with a reduction of vertical wind shear in the mid-latitudes. In general the jet by day 7
The LC1 Baroclinic Life Cycle 95
0 5 10 150
2
4
6
8
10
12x 10
5
Day
Eddy Kinetic Energy (Jm−2)
Figure 4.6: Eddy Kinetic Energy (EKE) averaged over the Earth’s surface during the evolutionof the LC1 cycle. The isentropic model is shown as a solid line, and the IGCM dashed.
is slightly weaker and broader than that predicted by the T42 IGCM.
Figure 4.1 shows the same picture for the IGCM life cycle. One of the clearest differences
is in the distribution of isentropes intersecting the surface. The IGCM zonal mean shows a
surface cold region which is isolated at mid-latitudes by warm air on its poleward side, while
the isentropic model does not show this warm region, at least in the zonal mean. Most of
the isentropes below 300 K in the isentropic model cycle intersect the surface close to 20N,
indicating that the surface front is stronger than in the IGCM cycle.
Further understanding of these differences is obtained by examining the low level temper-
ature distribution. Figure 4.8 shows the evolution of the temperature predicted by the isentropic
model interpolated onto the p/ps = 0.967 surface for comparison with the IGCM equivalents
shown in Figures 4.2 and 4.4. Note that to plot all the fields from the isentropic model which
are shown in this thesis the data was first interpolated from the hexagonal grid onto a regular
latitude-longitude grid. The general behaviour is the same as in the IGCM cycle; warm sectors
extending poleward turn cyclonically while the cold air moving equatorward turns anticycloni-
cally. There are, however, some differences.
The warm sectors in the isentropic model life cycle appear wider than their counterparts
in the IGCM. In the IGCM at both low and high resolution these warm sectors decay to leave
The LC1 Baroclinic Life Cycle 96
0 10 20 30 40 50 60 70 80 90
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Initial Conditions
p/p s
0 10 20 30 40 50 60 70 80 90
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Day 7
Latitude
p/p s
Figure 4.7: Zonal mean sections on a normalised pressure axis for the LC1 cycle simulatedby the isentropic model. The black lines are isentropes corresponding to the isentropic modelhalf levels, which are spaced every 6 K from 298 K (dotted) to 341 K (dashed). The slopingred line is the 2 PVU surface, and the other coloured contours show the zonal wind field at5 ms−1 intervals, with the zero contour dotted and negative contours dashed. All fields arezonal means.
The LC1 Baroclinic Life Cycle 97
Figure 4.8: Temperature from the isentropic model LC1 cycle interpolated onto the surfacep/ps = 0.967 for comparison with Figures 4.2 and 4.4. Contours are drawn every 5 K from255 K to 290 K.
The LC1 Baroclinic Life Cycle 98
cut-off regions of warm air near 30N. The cold regions equatorward are cut-off in the same
way. In the isentropic cycle, however, while warm cut-offs are visible by day 9, these fea-
tures are significantly weaker than those in the IGCM cycles. It is supposed that, in the T42
simulation at least, this cutting-off process occurring in the IGCM is largely attributable to the
action of the model diffusion on the thin filament of warm air linking the high-latitude warm
sector to the equatorward warm reservoir from which it originated (see Methven (1996)). At
T341 resolution the warm filaments visible on day 6 are very thin and are bounded by large
temperature gradients on either side, so it is probable that the diffusion has the same effect
here. The diffusion essentially transports air across isentropes, a process which cannot occur
in the isentropic model.
By day 9 the temperature field in Figure 4.8 appears disorganised in that the sixfold sym-
metry in longitude has been lost. This is perhaps to be expected given that the wave at this time
is well into the decay phase and the basic state is significantly more stable than at earlier times.
The flow is not, therefore, dominated by the strong, wavenumber 6 normal mode to the extent
that it was before. When comparing the output of the two models it should also be remembered
that the IGCM cycle is, by definition, symmetric in longitude (see Section 4.2).
In Figure 4.8 the low-level front separating the warm and cold regions does not appear to
be significantly stronger than in the T42 IGCM cycle. However, this is not the case, and this
misleading impression serves to illustrate that care must be taken when interpreting the output
from the isentropic model. Figure 4.8 was created by interpolating the temperature from the
isentropic model onto a normalised pressure surface assuming that ln η is a linear function of
ln p. In the isentropic model, however, the strength of a front is represented by the distribution
of model levels which can become infinitesimally close to each other. By simply interpolating
onto a regular grid on a normalised pressure surface the temperature gradient is limited by the
horizontal grid scale and the advantage of the isentropic coordinate is lost. The details shown
in Figure 4.8 in regions of strong temperature gradients should not, therefore, be trusted.
An alternative is to view the surface potential temperature distribution by plotting the inter-
section of each model half-level with the surface as a contour line drawn where p = (1− δ)psfor some small constant δ. Results are shown in Figure 4.9 for days 7 and 9. The temperature
gradient at the front is indeed increased when compared with that in Figure 4.8. The distribu-
tion to the north of the front is much the same, however, indicating that the interpolation used
to produce Figure 4.8 is valid where isentropes are closer to horizontal. The surface fronts are
studied in more detail in Section 4.9.
The LC1 Baroclinic Life Cycle 99
Figure 4.9: Surface θ from the isentropic model, obtained by plotting the line where p =0.995ps for all intersecting model half-levels.
The upper level wave is illustrated in Figure 4.10 by the distribution of PV on the 331 K
isentropic level for comparison with Figures 4.3 and 4.5. The cyclonic wrap-up poleward of the
jet is visible here, which is an improvement over the T42 cycle. As discussed in Section 4.3, the
T42 cycle in this region suffers from a spurious increase of the PV maximum near 30N. The
opposite effect occurs in the isentropic life cycle where the maximum is spuriously reduced. In
the isentropic model the only process affecting the PV above the surface is the advection. It is
therefore certain that this spurious reduction of the PV maximum is due to the diffusion which
is implicit within the advection scheme, arising because a quadratic upstream interpolation
is used to determine the interfacial PV value. Thuburn (1995) tested the advection scheme
used in the isentropic model, estimating that the diffusive effect is roughly equivalent to a
∇4 diffusion operator. Furthermore, an effective resolution was calculated as the length scale
at which ‘spreading by advection balances the generation of small scales by straining’, and
the fact that this is similar to the grid scale was taken to imply that the cascade of potential
enstrophy to small scales, and its subsequent dissipation, is well represented by the scheme.
The troughs of high PV air which turn anticyclonically on the equatorward side of the jet
are broader and weaker in the isentropic model, though they do extend to the same latitude as
in the IGCM cycles. On day 9 the anticyclonic shear of the troughs is weaker than in either of
the IGCM cycles. It is most likely that the implicit diffusion is also responsible for the broad
nature of the troughs in the isentropic model. The high PV air extends south from the polar
reservoir through the region of strong westerly wind in the jet centre. The jet advects the high
The LC1 Baroclinic Life Cycle 100
Figure 4.10: PV on the 331 K level of the isentropic model during the LC1 cycle. Contoursare drawn every 0.5 PVU from 0.5 PVU to 6 PVU, and the colours are identical to Figures 4.3and 4.5.
The LC1 Baroclinic Life Cycle 101
PV trough in the direction perpendicular to its orientation; the trough, therefore, is bound to be
broadened by the implicit diffusion. The reduction in the PV maximum discussed above is due
to the implicit diffusion and is a significant feature of the PV evolution. It must, therefore, be
concluded that the trough is also significantly affected by the implicit diffusion.
As both the troughs and the ridges in between are eroded into each other, the edge of the
polar reservoir of high PV air is essentially broadened. The PV gradient is weakened which
agrees with the weaker and broader zonal jet seen in the isentropic life cycle. A weaker, broader
jet has less cyclonic shear to the north and less anticyclonic shear to the south, corresponding
to a reduction in the cross-jet vorticity gradient. It is within the anticyclonic shear south of
the jet that the troughs turn dramatically in the IGCM life cycles. The effect of the weakened
shear in the isentropic model compared to the IGCM is to reduce the anticyclonic turning of
the troughs, as seen.
A picture thus emerges of an underlying difference between the life cycles predicted by
the two models. In the isentropic model the PV is diffused on isentropes resulting in the
weaker, broader nature of features such as the jet and the troughs. In the IGCM, however,
PV maxima are increased, resulting in stronger PV gradients and correspondingly stronger,
narrower features. Further evidence to support this theory is given in the following section.
This explains more of the differences between the zonal mean states of the life cycles, as
predicted by the two models (Figures 4.1 and 4.7). Both zonal means show a front of strong θ
gradients near 60N in the lower troposphere, with low-level easterlies to the north. However,
in the isentropic model these features are again weaker and broader than in the IGCM. This is
consistent with the erosion of the strip of high PV at this latitude in the isentropic model life
cycle (seen clearly at 331 K in Figure 4.10), which results in weaker wind shear. In the T341
IGCM life cycle (Figure 4.5) the strip of high PV values has clearly been well maintained.
Neither of the two models exactly reproduces the Lagrangian conservation property of the
PV. The truly adiabatic evolution of the life cycle must lie somewhere between the two. A
comparison of the relative error in the conservation of PV is given in Section 4.12.
4.7 The IGCM LC1 Cycle with∇4 Hyper-diffusion
Thuburn (1995) estimated the diffusive effect of the isentropic model’s advection scheme to
be roughly equivalent to a ∇4 hyper-diffusion. To support this, and to demonstrate that the
difference in dissipation explains most of the differences between the two models, the T42
The LC1 Baroclinic Life Cycle 102
0 5 10 150
2
4
6
8
10
12x 10
5
Day
Eddy Kinetic Energy (Jm−2)
IGCM ∇6
IGCM ∇4
Isentropic model
Figure 4.11: Eddy kinetic energy evolution, as in Figure 4.6, but now for all three simulations.
IGCM life cycle was repeated with the hyper-diffusion changed from∇6 to∇4 (while keeping
the diffusion timescale fixed at 4 hours).
Figure 4.11 shows the EKE evolution of the LC1 cycle, as in Figure 4.6, but with the
IGCM∇4 run added. The EKE evolution in the isentropic model is indeed much closer to that
of the ∇4 run, both in the timing, and magnitude, of the maximum. Figure 4.12 shows the PV
field at 330 K for days 8 and 9 of the IGCM ∇4 life cycle. Here the troughs of high PV air are
broader than in the ∇6 simulation (Figure 4.3), they do not shear to the west as dramatically,
and they show no sign of being cut-off from the polar reservoir. This behaviour is also closer to
that of the isentropic model than it is to the ∇6 run. Note that the spuriously large PV maxima
near 60N are still a major feature of the∇4 life cycle.
The life cycle predicted by the isentropic model is, therefore, very similar to that predicted
by the IGCM with ∇4 hyper-diffusion, which is in agreement with the estimate of Thuburn
(1995). However, this does not mean that the diffusive effect of the advection scheme is identi-
cal to a∇4 hyper-diffusion. For example, if the velocity is zero then there will be no advection,
and hence no diffusion. The ∇6 version of the IGCM life cycle is the standard, as studied by
Thorncroft et al. (1993), and so it is this one with which the isentropic model is compared in
the following sections.
The LC1 Baroclinic Life Cycle 103
Figure 4.12: PV on the 330 K isentropic surface, as in Figure 4.3, but for the IGCM ∇4 lifecycle. Only days 8 and 9 are shown.
4.8 The Terminal Instability
The isentropic model cycle crashed after just over 9.5 days of integration due to an instability
in a massless grid cell, where a large gridpoint PV spike rapidly developed. The location of
this cell is shown in Figure 4.13 a few timesteps before the model crashed. In the horizontal the
cell is near to the point where the warm and cold fronts meet. This is most likely a coincidence,
but the proximity of the front in general is relevant.
The vertical cross-section in the second panel of the figure shows that the cell lies entirely
within the massless region. The cell has, in fact, been empty with respect to σ for the entirety
of the run, and the cell directly above was also empty until near the end of the cycle. Between
days 8.5 and 9 some mass was advected into this upper cell from the north as the front moved
southward (signalling the spread of the surface cold region from the north). The smoothing
function F of equation (3.14) then decreased in the lower cell, resulting in a weaker relaxation
there which enabled the instability to grow in the manner described in Section 3.3.
It can therefore be concluded that the model crashed due to the unstable nature of the
massless region, and not in any direct response to a development of the interior flow. Further-
more, the interior flow cannot have been affected in any way at least until after day 9 when the
gridpoint PV spike began to grow. In fact, the surface θ and mass fields shown in Figure 4.13
appear unaffected by the instability even at this late stage. This is as might be expected since
The LC1 Baroclinic Life Cycle 104
0 20 40 60 80 100
0
10
20
30
40
50
60
Longitude
Latit
ude
0 9 18 27 36 45 54 63 72 81 90
250
256
262
268
274
280
Latitude
Isen
trop
ic M
odel
Hal
f−Le
vels
Figure 4.13: The location of the unstable grid cell in the isentropic LC1 cycle plotted on ahorizontal plane (left) and a meridional section (right). The flow is represented in the horizontalplane by the surface θ distribution. In each case the cell is marked by selected PV contourswhich are coloured. In the right-hand plot the solid black contours show the locations of theIGCM p/ps levels to indicate the mass field and the dashed black line indicates the location ofthe surface, plotted as the p = (1− 10−5)ps contour.
the model is, after all, formulated to prevent contamination of the above ground flow by the
artificial massless region.
Note that the precise timing and location of the instability is sensitive to the choice of
machine and of compiler, indicating that there is some sensitivity to the truncation errors. In
a run performed on a different machine, with a different compiler, the instability occurred at
roughly the same time but in a grid cell at the surface, rather than in an entirely massless cell
as in the run described in this chapter.
The instability could be addressed in future development by attempting a smoother, or
more gradual implementation of relaxation with depth. Preliminary attempts to do this have
been made by applying a gradual shift from relaxation at depth, to a slow evolution or a dif-
fusion near the surface. While the resulting schemes have been as successful as the standard
relaxation, there has not been a significant improvement in stability. This has not been pur-
sued further; the intention instead is to redesign the surface formulation so that relaxation is
unnecessary (see Chapter 6).
4.9 Surface Fronts
There has been significant interest within the atmospheric science community in the question
of whether the strength of atmospheric fronts has any limitation in the theory of inviscid fluid
dynamics. Hoskins and Bretherton (1972) recognised that the quasi-geostrophic equations are
The LC1 Baroclinic Life Cycle 105
invalid for flows with gradients in temperature and velocity comparable to those observed in
fronts. Accelerations tangential to the front were considered important, and they developed a
semi-geostrophic (SG) model in which geostrophic balance is assumed only in the cross-front
direction. In this model Hoskins and Bretherton showed that any true discontinuity in θ can-
not form first in the interior of the fluid, but must do so at the boundary. The model was then
solved analytically for a variety of forcing mechanisms, showing that there is indeed a tendency
to form discontinuities on the boundary in finite time. Cullen and Purser (1984) extended this
model past the formation of a frontal discontinuity by assuming that the Lagrangian conserva-
tion law form of the SG equations remains valid for the majority of fluid parcels. In these SG
solutions a discontinuity formed first on the boundary, but then extended into the interior of the
fluid.
Shapiro et al. (1985) reported on the observation of an atmospheric front over Boulder,
Colorado in which the majority of the temperature gradient occurred over a horizontal scale
of just 200 m. Motivated in part by this, Gall et al. (1987) simulated deformation frontoge-
nesis in a high-resolution, non-hydrostatic primitive equation model with no explicit diffu-
sion (although they do acknowledge the presence of implicit smoothing within their difference
schemes, and an explicit smoothing near the boundaries to reduce wave reflections). The min-
imum horizontal scale of their fronts was never less than a few kilometres, and vertical resolu-
tion was shown to be the limiting factor in the frontal gradient; no natural limiting process was
found. For further investigation, see also the study of Snyder et al. (1993).
Garner (1989) used a fully Lagrangian numerical model of the 2D hydrostatic primitive
equations to simulate a similar deformation front. The grid elements moved to follow the
flow while conserving both their volume and PV exactly, except for time-differencing errors.
A frontal singularity in finite time was demonstrated by the co-location of two model grid-
points on the boundary, after which the model becomes invalid. Garner’s simulations therefore
support the theory of Gall et al. (1987) that the strength of frontal gradients has no inviscid
limitation.
In a similar study, Fulton and Schubert (1991) took advantage of the Lagrangian na-
ture of the isentropic vertical coordinate to model surface frontogenesis, combining it with
a geostrophic coordinate in the horizontal in a SG model. This model also produced a true dis-
continuity in finite time as the transformation from geostrophic to physical space broke down
and θ became multi-valued at some point on the surface.
The difference in frontal strength shown by the IGCM in Figures 4.2 and 4.4 is in agree-
The LC1 Baroclinic Life Cycle 106
60 70 80 90 100
0.4
0.5
0.6
0.7
0.8
0.9
1
Isentropic Model
p/p s
Longitude60 70 80 90 100
T42 IGCM
Longitude
60 70 80 90 100
0.4
0.5
0.6
0.7
0.8
0.9
1
T213 IGCM
p/p s
Figure 4.14: Cross sections of the surface fronts at 35N on day 7.5 in the isentropic model, andin the IGCM at T42 and T213 resolution. Isentropic surfaces coincident with the half-levels ofthe isentropic model are shown.
ment with Methven (1996) that the frontal gradient in the LC1 cycle is limited by spectral trun-
cation and the model’s hyper-diffusion. It is therefore of interest to examine the frontal gradient
in the isentropic model in more detail. Figures 4.8 and 4.9 highlight that care must be taken in
interpreting the output of the isentropic model. In the rest of this section the fronts are shown
by plotting the pressure and height of the model half-levels. The only interpolation needed is
to translate pressure from the hexagonal horizontal grid onto a regular latitude-longitude grid.
Figure 4.14 compares the fronts predicted by the isentropic model with those predicted by the
IGCM at both high and low resolutions. The fronts in the isentropic model are indeed much
more similar to those in a T213 simulation than those in the T42 life cycle are. At the cold front
in particular a very strong surface θ gradient occurs, as several isentropic model levels intersect
the surface very close together. It is clear that one of the main potential benefits of an isentropic
approach has been realised; a sharp front has been well represented by extreme deformation
of the model levels, which are squeezed tightly together along the front. The question, then, is
how tightly?
Figure 4.15 shows close-ups of the surface front at days 6 and 8. All of the isentropic levels
become massless when they intersect the surface; a frontal discontinuity is predicted when two
The LC1 Baroclinic Life Cycle 107
0 20 40 60 80
0
0.2
0.4
0.6
0.8
1
p/p s
Latitude
Day 8
27 31.5 36 40.5 45 49.5 54 58.50
200
400
600
Hei
ght (
m)
Latitude
Day 8
0 20 40 60 80
0
0.2
0.4
0.6
0.8
1
p/p s
Day 6
27 31.5 36 40.5 45 49.5 54 58.50
200
400
600
Hei
ght (
m)
Day 6
Figure 4.15: Meridional sections showing the locations of the model half-levels at 22.5E. Theleft-hand panels show the whole northern hemisphere, plotting the normalised pressure on eachmodel half-level. The right-hand panels focus on the region in the vicinity of the front, this timeshowing the height above the surface of each half-level. In all panels the 298 K half-level isdotted, and the spacing is 6 K for all the half-levels below this.
or more levels become massless at exactly the same point. The latitude of each of the regular
grid cells is shown as an increment on the x axis of the right-hand panels. It is then clear from
these right-hand panels that by day 8 all of the levels which comprise the front are intersecting
the surface in the same horizontal grid cell. However, each cell is roughly 500 km wide, so to
look closer we need to examine the implied sub-gridscale mass distribution.
On each isentropic level the intersection with the ground is defined by the σ distribution.
The sharp σ feature at the surface is advected using a linear sub-grid fit as described in Sec-
tion 3.4.3. When viewing the output from the model the same sub-grid fit should be used to
identify the distribution of mass within the intersecting grid cell. This technique simply redis-
tributes the mass in the intersecting cell and its non-empty neighbour to lie under a straight
line, while conserving the mass within each cell. For more details see Section 3.4.3.
The results obtained by applying the sub-grid fit to the frontal region in the model output
are shown in Figure 4.16. It appears that, at this longitude, a true discontinuity in surface θ
has developed at the front by day 7.5. Not only do multiple levels intersect within the same
The LC1 Baroclinic Life Cycle 108
0
100
200 Day 4.5
Height in metres of model half−levels
0
100
200 Day 5.5
0
100
200 Day 6.5
0
100
200 Day 7.5
0
100
200 Day 8.5
20 25 30 35 40 45 50 550
100
200 Day 9.5
Latitude
Figure 4.16: The height of intersecting model half-levels at 22.5E given by using the linearsub-grid fit to reconstruct the distribution of mass within the intersecting grid cell and its non-empty neighbour. The location of each half-level is only plotted for the two grid cells in whichthe linear fit is valid.
The LC1 Baroclinic Life Cycle 109
0 2 4 6 8 100
500
1000
1500
2000
2500
3000
3500
4000
4500
Day
dz (
m)
Minimum Vertical Spacing
0 2 4 6 8 10−2
0
2
4
6
8
10
12
14
16
Day
dx (
Deg
rees
of L
atitu
de)
Minimum Horizontal Spacing
Figure 4.17: Two different measures of the strength of the surface front. The left-hand panelshows the global minimum vertical spacing between selected model half-levels, with the pro-viso that both are at least 10 m above the ground. The right-hand panel shows the minimumhorizontal spacing between the intersection points of the same levels using the sub-gridscalefit. In both panels the upper line is the spacing between the 268 K and 292 K half-levels andthe lower line is the spacing between the 280 K and 292 K half-levels. These ranges boundfour and two full model levels respectively.
horizontal grid cell at the front, but in the reconstruction of the sub-grid mass distribution the
levels intersect at almost exactly the same point on the surface.
Further evidence that a discontinuity is predicted, and an impression of its time evolution,
is given by Figure 4.17. The left-hand panel shows the minimum vertical spacing attained
between two half-levels while both are at least 10 m above the surface. The sub-grid fit is
not used to produce this plot. The spacing decreases rapidly towards zero as the front forms,
reaching minima of just 1.2 m and 7 cm for the two half-level pairs shown.
An alternative approach to quantifying the strength of the front is to calculate the horizon-
tal spacing in latitude of the intersection points of the sub-grid representations of various levels.
Results are shown in the second panel of Figure 4.17 for the same two pairs of half-levels. In
both cases the minimum horizontal spacing exhibited at some point along the front becomes
negative in finite time. This means that in the sub-grid reconstruction two isentropic surfaces
have intersected just above the surface. However, it should be remembered that the sub-grid fit
is only a linear approximation to the mass field near the front. Compared to a horizontal grid
scale of 4.5 the overlap indicated by the negative spacing is small, and certainly within the
margin of error that should be expected of the linear fit. This figure is strongly suggestive of
The LC1 Baroclinic Life Cycle 110
0 2 4 6 8 10−6
−4
−2
0
2
4x 10
−6
Day
Divergence (s−1)
0 2 4 6 8 10260
280
300
320
340
360
Day
Density σ (kg m−2 K−1)
Figure 4.18: Divergence and model density σ at the surface front for each of the four modellevels bounded by the 268 K and 292 K half-levels. At each longitude the fields are taken inthe intersecting grid cell for each level, and the zonal mean of these values is calculated.
an actual singularity in finite time but should not be taken as proof of the propagation of this
singularity into the interior of the fluid.
The first panel of Figure 4.18 shows the divergence at the front on the four appropriate
model levels. The intention here is to show that the development of the frontal discontinuity
is not simply the consequence of a spurious divergence caused by the problem described in
Section 3.7. The figure clearly demonstrates that this is not the case; there even appears to be a
slight convergence along model levels during the formation of the front. The frontal divergence
values are an order of magnitude smaller than the surface divergence observed in the solid body
rotation simulation, and are of the same size as features in the divergent field above the surface.
In general, although the surface divergence problem has affected the LC1 cycle the effect is
small. This is discussed in more detail in Section 4.13.1.
The strength of frontal gradients is often measured by the ratio of the absolute vorticity
to the planetary vorticity f . Cullen and Purser (1984) reported vorticities of up to 12f in
both their SG and primitive equation simulations, a value near the maximum allowed within
the validity of their SG equations. In their baroclinic life cycle simulations, Polavarapu and
Peltier (1990) noted maximum vorticities of 6f in a run at 50 km horizontal resolution, and
11f in one at 30 km. At the fronts, isentropic surfaces slope relatively steeply to the surface,
so that a velocity gradient taken on an isentropic surface will be different from one taken on a
constant height or pressure surface. It is this latter vorticity, rather than the isentropic vorticity
k · ∇θ × v, which should be compared with the values given above. To calculate this in the
isentropic model we would have to take the surface wind either side of the front and divide this
The LC1 Baroclinic Life Cycle 111
by the distance across the front. However, this distance vanishes because a frontal discontinuity
is predicted. The wind fields could be interpolated onto a normalised pressure surface, and the
vorticity calculated by finite differencing, but the result would depend strongly on the method
of interpolation and the resolution used on the pressure surface. This has not been attempted
here.
The isentropic model clearly demonstrates a frontal discontinuity as the mass σ between
two isentropic surfaces vanishes. However the zonal mean evolution of the second density
variable σ (shown in the final panel of Figure 4.18) decreases during frontogenesis, but does
not approach zero. To determine if this is realistic we have to consider what σ represents. The
surface fronts are formed by the intersection of model levels with the ground, as occurs in the
cell labelled B in Figure 3.5. To the right of the projected intersection point (i.e. for x > x0)
the static stability will be very large, as model half-levels squeeze close together. However, to
the left (x < x0) the level is massless, and the static stability just above the surface will be
small, as seen in Figure 4.15. In massless regions σ represents the limit of ∂p∂θ
just above the
surface, and will be large here. The σ value in the cell is an average over the whole cell, and
hence an average of these very different values, and as such, it should not vanish.
The isentropic model simulation therefore supports the hypothesis that there is no limita-
tion to the frontal gradient within the inviscid primitive equations. It should be noted however
that in the real atmosphere gradients will be limited by small-scale physical processes and var-
ious forms of instability. For example, Hoskins and Bretherton (1972) suggested that since the
Richardson number can be approximated by f/ζ it must become small in strong frontal regions,
thus satisfying the necessary condition for Kelvin-Helmholtz shear instabilities to occur.
4.10 Surface PV Anomaly
Zonal sections of the PV and θ distributions in the two models are shown in Figure 4.19, taken
after 7.5 days of integration when both waves are practically at the end of the growth phase.
One of the most noticeable differences is the large PV anomaly at the surface fronts predicted
by the IGCM. Real cyclone systems have been observed in which large PV anomalies were
apparent at low levels (e.g. Hoskins and Berrisford (1988) and Wernli et al. (2002)). These
anomalies, however, are considered to be products of latent heating and so are clearly distinct
from those in the IGCM cycle.
PV anomalies at surface fronts are a common feature of dry ‘adiabatic’ simulations, and
The LC1 Baroclinic Life Cycle 112
60 80 100 120 140 160
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Longitude
p/p s
Isentropic Model at 40N
40 60 80 100 120 140
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Longitude
IGCM at 40N
Figure 4.19: Zonal sections at 40N of PV and θ on day 7.5 for the isentropic model (left) andthe T42 IGCM (right). Solid contours are the PV every 0.5 PVU up to 5 PVU, and the dottedcontours are the θ surfaces coincident with the isentropic model half-levels.
have been studied by several authors. The anomalies are of particular interest in simulations
of two-dimensional Eady Waves. These two-dimensional waves are unable to develop the
meridional temperature structure which is fundamental in bringing about the equilibration or
halt in the growth of three-dimensional waves. Nakamura and Held (1989) attribute the wave
equilibration in this case to an effective increase in stability linked with the PV anomaly in the
frontal region.
The source of the anomaly was identified as follows, using the concept of boundary PV
first introduced by Bretherton (1966). Isentropic layers which intersect the boundary are con-
sidered to extend along the surface as layers of infinitesimal thickness in the same way as in
the isentropic model described here. The inhomogeneous boundary condition of a varying θ
on the surface is then replaced by a homogeneous condition of constant θ, with the variation
represented instead by variations in PV along the nearly massless layers. During the occlusion
of the fronts, fluid is diabatically transported into the massless layers by diffusive mixing. The
massless layers are thus lifted from the surface, intruding boundary PV into the interior to form
the anomaly in interior PV seen in the simulations. This is in line with the theorem of Haynes
and McIntyre (1987) which allows creation or destruction of PV by diabatic processes at the
intersection of an isentropic layer with the boundary. These diabatic processes, such as the
diffusive mixing, can then be thought of as exchanging interior and boundary PV along the
isentropic layer, while the sum of the two types of PV in the layer is conserved. For a recent
The LC1 Baroclinic Life Cycle 113
extension of the boundary PV concept see Schneider et al. (2003).
Later, Garner et al. (1992) modelled an Eady wave which equilibrated without explicit
diffusion by replacing the rigid boundaries with semi-infinite regions of high PV. The equi-
libration was then attributed to a vortex roll-up of this boundary PV as an intrusion into the
interior. In the limit that the value of the boundary PV is infinitely larger than that of the in-
terior PV the boundary represents a rigid boundary and the PV intrusion becomes infinitely
thin. The role of diffusion was then viewed as spreading this PV intrusion onto the grid scale,
effectively giving resolution to frontal discontinuities as PV structures. Cooper et al. (1992)
studied various aspects of the surface PV production, concluding that significant amounts of
interior PV are produced but also that the occlusion itself is not necessary for the production of
interior PV, which only requires a change in the mean surface θ.
In three dimensions the equilibration of the wave is largely due to barotropic processes
as described by Simmons and Hoskins (1980). Of particular importance is the reduction in
baroclinicity and increase of static stability associated with the reduction of the meridional
temperature gradient by the wave itself. The PV anomalies, however, are still common features
and are thought by some to have an important effect. Ziemianski and Thorpe (2000) used the
primitive equation baroclinic wave simulation of Rotunno et al. (1994) to perform an attribution
analysis of the flow induced by a frontal PV anomaly. They found that up to 10% of the
vertical velocity acting to lower the tropopause fold could be attributed to the PV anomaly,
hypothesising that this resulted in a stronger coupling of the upper and lower waves. This, they
claimed, would counter the direct stabilising effect of the anomaly described by Nakamura
and Held (1989) in the Eady wave. Ziemianski and Thorpe (2002) went on to demonstrate a
simulation of the wave by a nonlinear balance model in which the PV was exactly conserved.
Frontal temperature gradients were stronger in the balance model, and no surface PV anomalies
were produced. In contrast, frontal PV anomalies were produced in a primitive equation model
by day 7. After this time the wave growth rate in the nonlinear balance model decreased but
that in the primitive equation model did not, adding weight to their hypothesis that the PV
anomaly acts to phase-lock the upper and lower wave together, and thus to allow continued
growth.
Methven (1996) studied the surface PV anomalies in the IGCM LC1 cycle, attributing
their generation to scale limitation of the surface front, of which the dominant cause is the
diffusion on temperature. This effectively transports mass diabatically into the massless layers
and so extrudes boundary PV into the interior in the manner described above. The mass-
The LC1 Baroclinic Life Cycle 114
weighted domain-averaged PV increases as a result. In the isentropic model life cycle, however,
there is nothing which limits the horizontal scale of the surface front. The so-called occlusion
event in which the surface warm sector decays takes place solely due to horizontal advection
along isentropes, and there is no frontal PV anomaly produced. The total PV contained in the
intersecting isentropic model levels does change in a manner similar to that seen in the surface
divergence problem of Section 3.7, but the mechanism is entirely different and the effect is
small (see Section 4.13.1).
The zonal sections of Figure 4.19 show that the troughs, or tropopause folds, in the IGCM
cycle do indeed penetrate further into the troposphere than their equivalents in the isentropic
cycle, on first impressions supporting the hypothesis of Ziemianski and Thorpe (2002). How-
ever, the folds in the isentropic model are broader, a consequence of the diffusion on PV which
is implicit in the advection scheme (see Section 4.6).
It is therefore interesting to compare the growth rates of the waves, as implied by the EKE
evolutions of Figure 4.6, in search of evidence to support the hypothesis that the surface PV
anomalies act to prolong the growth stage. The growth rate in the isentropic wave does indeed
begin to decrease slightly before that in the IGCM wave, but the equilibration itself occurs
later. The EKE comparison is therefore inconclusive. In Figure 4.19 there is also no apparent
difference between the two models in terms of the simulated phase difference between the
upper and lower waves. Upper and lower level fields have also been compared in more detail
and there is no significant difference in the phase differences predicted by the two models.
It must be concluded, then, that there is no evidence here to support the theory of Ziemian-
ski and Thorpe (2002) that the PV anomalies are prolonging the growth stage of the life cycle.
This, however, does not disprove the theory. The waves predicted by the two models have dif-
ferent behaviour in fundamental quantities such as the growth rate of EKE; the two waves are
by no means identical, even in the absence of the frontal PV anomaly predicted by the IGCM.
A further possibility is suggested now. Following Garner et al. (1992), the diffusive mixing
of boundary and interior PV in the IGCM is viewed as acting ‘to give resolution to frontal
discontinuities as PV structures’. The surface PV anomaly in the IGCM and the apparent
surface θ discontinuity in the isentropic model can therefore be viewed as two different, but
equivalent, representations of the same dynamical feature. It is hypothesised, then, that the
effect of this feature on the surrounding wind fields, and thus the large scale flow, is similar in
both representations. At the ‘occlusion’ the IGCM predicts a positive PV anomaly, while the
isentropic model instead has a warm surface θ anomaly between the stronger cold and warm
The LC1 Baroclinic Life Cycle 115
fronts. These two anomalies are indeed expected to have similar dynamical influence according
to PV-θ theory (e.g. Hoskins et al. (1985)).
Both model simulations could be considered unrealistic representations of a truly adia-
batic front in the primitive equations for various reasons. However, the front predicted by the
isentropic model is much more similar to a high resolution simulation than that predicted by
the T42 IGCM is. Furthermore, since the IGCM representation involves the production of both
mass-weighted interior PV and entropy, it is surely a worse approximation to a truly adiabatic
frontal evolution than that predicted by the isentropic model, in which the entropy is exactly
conserved and the change in PV is significantly smaller. These, and other, conserved quantities
are now discussed in detail.
4.11 Global Conservation Properties of the Two Models
The real atmosphere has an infinite number of conserved quantities of which only a limited
number can be conserved in a numerical model. The two models compared here have been
formulated with very different conservation properties in mind. In addition, the IGCM uses
a vertical scheme derived from that of Simmons and Burridge (1981) which conserves both
energy and angular momentum, although neither are formally conserved by the horizontal
spectral scheme or by the time differencing. The isentropic model, on the other hand, exactly
conserves the mass, mi say, in each isentropic layer of constant θi, and therefore also the
entropy since this is given bymicp ln θi. The mass-weighted PV in levels which do not intersect
the ground is conserved, but in intersecting levels this quantity varies due to the mixing of
boundary and internal PV by the diffusion implicit in the advection scheme.
A comparison of conserved quantities over the two LC1 life cycles is shown in Figure 4.20.
Mass conservation is shown in the top two panels and is extremely good in both models; the
column average mass is 104 kg so the error is of order 10−4 % for both models, i.e. of the same
size as the expected round-off error.
The energy conservation properties in the second two panels are particularly interesting.
The difference in kinetic energy between the two models is mostly accounted for by the differ-
ence in EKE shown in Figure 4.6. The potential energy in the isentropic model decreases by
a larger amount than in the IGCM despite the fact that less kinetic energy has been generated
by conversion from available potential energy. This error in the isentropic model’s potential
energy must be due to an error in the available part of the potential energy since, in line with
The LC1 Baroclinic Life Cycle 116
0 5 10 15−2
−1
0
1
x 106
Ene
rgy
(Jm
−2)
0 5 10 15−2
−1
0
1
x 106
5 10 15−10
−5
0
x 10−3
Mas
s (K
g m
−2)
IGCM
5 10 15−10
−5
0
x 10−3 Isentropic Model
0 5 10 15−8
−6
−4
−2
0
2x 10
10
Ang
. Mom
entu
m (
Kg
s−2)
0 5 10 15−8
−6
−4
−2
0
2x 10
10
0 5 10 150
200
400
600
Ent
ropy
(JK
−1m
−2)
Day0 5 10 15
−40
−20
0
Day
Figure 4.20: Global conservation of quantities from the IGCM (left-hand panels) and the isen-tropic model (right-hand panels). All quantities are integrated over the finite difference gridof the respective model, then averaged over the Earth’s surface and the change from the initialvalue is plotted. In the energy plots the dashed line is the total kinetic energy, the dotted lineis the potential plus internal energy (given by cpT for a hydrostatic system) and the solid lineis the sum of these, i.e. the total energy. Note that the y axis scaling is different for the twolowermost panels.
The LC1 Baroclinic Life Cycle 117
the discussion of Section 4.1, the unavailable part is that part which cannot be extracted by an
adiabatic re-arrangement of mass within an isentropic layer. The unavailable part is therefore
defined by the mass within each isentropic layer, and this is exactly conserved. The error in the
total energy of the isentropic model is thus the imbalance between the EKE generated and the
available potential energy from which it is converted.
The error in the total energy in the IGCM is known to be simply due to the non-closure
of the energy cycle (see Blackburn (1983)); kinetic energy which is lost during the diffusion
process is not put back into the system as heat. With this in mind it is suggested that the
same process is occurring in the isentropic model. As has been shown, the PV is diffused
significantly which broadens and weakens features in the velocity distribution such as the jet.
This smoothing of the velocity field will lead to a reduction in the kinetic energy as the variance
in the velocity field is reduced, and the kinetic energy lost is again not put back into the system
as heat.
As shown later (Section 4.12) there is a general loss of interior PV to the massless region,
also attributable to the diffusive nature of the advection scheme. Velocities near the surface are
reduced by this effect. This can be seen by noting that the sum of mass-weighted interior PV
within an intersecting isentropic layer is equal to the circulation around the circuit bounding
the mass within the layer, i.e. the circuit marking the intersection of the level with the surface
(Hoskins (1991)). If the sum of mass-weighted interior PV is reduced then so is the circulation,
and thus the component of surface wind lying parallel to this circuit. The loss of PV could
therefore play a part in the loss of total energy by the isentropic model.
The conservation of total energy, and also of angular momentum (shown in the third set of
panels), is worse in the isentropic model than in the IGCM. This should probably be expected
given that the isentropic model makes no attempt to formally conserve these quantities, whereas
the IGCM does. The isentropic model errors, while larger than those in the IGCM, are certainly
of a similar scale. To put the errors in perspective the drift in angular momentum corresponds
to 0.3% of the total, and that in energy just 0.05%.
The potential temperature θ satisfies a Lagrangian conservation law(DθDt
= 0)
under adi-
abatic motion, so that not only is θ conserved for any given fluid parcel, but any function of θ
is as well. The entropy s = cp ln θ is just one of the infinite number of functions of θ which
are conserved by the continuous primitive equations, although it does have a particular phys-
ical significance. The entropy, and in particular one potential consequence of not conserving
it in models, is discussed at length in Chapter 5. Here the conservation of entropy is simply
The LC1 Baroclinic Life Cycle 118
Model Entropy error Energy error Imbalance
T42 IGCM 550 -2000 2550θ Model -50 -5000 4950
Table 4.1: Errors in the second law of thermodynamics by expressing energy errors as theimplied entropy error using equation (4.6). Changes in entropy and energy were taken over thelife cycles and values of T = 250 K and ∆t = 10 days were assumed. Units are Jm−2K−1.
compared for the two models.
The final two panels show the change in entropy over the course of the two life cycles.
While the IGCM shows a systematic increase in global average entropy the change in the en-
tropy of the isentropic model is simply a reflection of the error in global mass due to rounding
error. This time the errors in the two models do appear to have significantly different scales.
However it should be asked whether the reduction in entropy error obtained by using the isen-
tropic model is significant given the increase in the error in the energy.
An obvious way to relate the energy and entropy errors is through the second law of ther-
modynamics, given bydS
dt=Q
T(4.5)
in the absence of friction. Over a fixed time period ∆t this reduces to an equation
∆S∆E =∆E
T(4.6)
for the entropy error ∆S∆E equivalent to an energy error ∆E. Since both models are suppos-
edly adiabatic, both sides of equation (4.5) should be zero for each model. The relative errors
can therefore be compared if they are viewed as errors in the representation of the second law
in each of the models. Table 4.1 compares the errors in entropy and energy, both expressed as
entropies using equation (4.6), and also the imbalance in the second law which is implied. The
imbalance can be interpreted as the entropy error which would be implied if the energy had in
fact been conserved exactly. A temperature of T = 250 K was chosen as a suitable average
over the global domain. The error in the second law is seen to be greatest in the isentropic
model, which exhibits both the largest individual error and the largest imbalance. From this
point of view the energy error in the isentropic model is more significant than the entropy error
in the IGCM.
The LC1 Baroclinic Life Cycle 119
4.12 PV Conservation
In the continuous primitive equations the PV satisfies a Lagrangian conservation law in the
same way as the potential temperature. This conservation theorem was one of the motivations
for the construction of the isentropic model described here, along with the recognition that the
PV is a fundamental quantity that completely determines the balanced part of the flow (see
Section 1.1). It is therefore hoped that this model will conserve its PV better than the IGCM,
and hence that some improvement in the representation of the dynamics might be obtained.
It should be noted that there is a key difference in the nature of the conservation properties
of PV and θ. The mass-weighted integral of any function of θ over an isentropic layer is a
robust invariant, because the stratification implied by the θ distribution itself acts to inhibit
mixing across isentropes, i.e. to inhibit any non-conservative processes. No such robustness
exists for a similar integral of any function of PV over an isentropic layer. It is expected that
the potential enstrophy Q2 will cascade to small scales as thin filaments of PV are produced,
and will ultimately be dissipated in any realistic flow. In this way a given PV distribution
will evolve adiabatically in such a way that some violation of Lagrangian PV conservation is
inevitable.
The variation of PV in the two models is illustrated in Figure 4.21. The IGCM simulation
assumes symmetry about the equator and so the PV is positive everywhere. The isentropic
model, however, simulates the whole global domain, and so the total mass-weighted PV is zero.
While errors can be examined by studying the departure from zero, this gives no impression
of the size of errors relative to the absolute PV values. To overcome this it is the global mass-
weighted integral of |PV | which is plotted here. This is, after all, a function of the PV and so
should be conserved. As expected, the isentropic model conserves this better than the IGCM
does, but it does still show a slight drift.
The lower plots show the same change in |PV | but this time broken down into the con-
tributions from each of the isentropic layers corresponding to model levels in the isentropic
model. For the IGCM some interpolation is required in order to do this. For each column of
the transform grid, the mass-weighted PV in each isentropic layer is calculated as follows. The
pressure at the two θ values bounding the layer is found by interpolating from the IGCM levels
assuming a linear relation between ln θ and ln p. This gives the pressure corresponding to a
given θ value from the pressures at the model levels below (p1 say) and above (p2 < p1), and
The LC1 Baroclinic Life Cycle 120
0 5 10 15−0.01
−0.005
0
0.005
0.01
0.015
0.02|PV|
0 5 10 15−0.5
0
0.5
1
1.5PV levelwise (IGCM)
0 5 10 15−0.25
−0.2
−0.15
−0.1
−0.05
0
0.05
PV levelwise (θ Model)
Figure 4.21: PV conservation diagnostics from the LC1 cycle simulated by the two models.All quantities are plotted as the change from their initial value, normalised by the initial valueso that, e.g. a value of 0.5 indicates a 50% increase. The top panel shows global mass-weighted|PV | for the IGCM (dashed line) and isentropic model (solid line). The following panels showthe change in total PV within individual isentropic layers which correspond to the isentropicmodel levels. Levels which intersect the surface are shown dashed.
The LC1 Baroclinic Life Cycle 121
their respective θ values, as
p(θ) = p1−α1 pα2 where α =
ln θθ1
ln θ2θ1
. (4.7)
The mass-weighted PV in the layer is then given by integrating between the bounds of the
layer, labelled θb and θt say:
−1
g
∫ p(θt)
p(θb)
Q dp, (4.8)
where Q is diagnosed at the IGCM level, or levels, in the range (p(θb), p(θt)). Some error is
expected because of the need for interpolation; in Section 5.2.2 this is shown to be small for a
similar application.
The change in |PV | in the isentropic model is seen to be almost entirely due to the change
which occurs in the intersecting levels. There is, in fact, a small decrease in PV in the upper
levels due to implicit diffusive mixing of positive and negative PV across the equator; this,
however, is tiny compared to the change within the intersecting levels.
Larger variations are seen in the IGCM, and there is a clear difference between the errors
in the two models. While the isentropic model loses PV through the lower boundary, the IGCM
gains PV there through the limiting of the surface frontal gradients described in Section 4.10.
This time there is a significant drift in non-intersecting levels, of order 10% in the worst case.
The IGCM thus fails to respect the PV conservation theorem of Haynes and McIntyre (1987)
which holds for the primitive equations. This is a result not only of the non-conservation of the
PV of a fluid parcel but also of the non-conservation of its potential temperature.
The increase in maximum PV values in the IGCM cycle (noted in Section 4.4) is an exam-
ple of this lack of Lagrangian conservation. As described in Section 4.6, the implicit diffusion
in the isentropic model acts instead to erode these PV maxima. Both of these changes are
spurious; to estimate the effects on the life cycle, their magnitudes are compared. Figure 4.22
shows the evolution of the maximum PV value in a selection of isentropic levels. The IGCM
error is always larger than that in the isentropic model. The erosion of PV maxima in the isen-
tropic model is most obvious in the first panel, but the error is still significantly less than that
in the IGCM. Note that in all four cases the PV maximum corresponds to a significant feature.
For example, the increase in the IGCM PV maximum in the intersecting levels is due to the
frontal anomalies (Section 4.10). At 331 K the largest PV values occur at the local maxima
near 60N , which are seen to spuriously increase in the IGCM in Figure 4.3. It is uncertain
The LC1 Baroclinic Life Cycle 122
0 5 10 150.95
1
1.05
1.1
1.15383K Level
0 5 10 150.8
1
1.2
1.4
1.6331K Level
0 5 10 150.5
1
1.5
2
2.5
3277K Level
Day0 5 10 15
0
1
2
3
4271K Level
Day
Figure 4.22: Variation with time of the maximum interior PV value in a selection of isentropiclevels from the IGCM (dashed line) and isentropic model (solid line). For the isentropic modelinterior PV was defined as the PV in any cell which has σ/σ > 0.1. Values are normalised bythe initial value for that level. The lower panels show levels which intersect the surface.
however what the effect is of these latter increased maxima in the IGCM. Comparison with the
high resolution run (Figure 4.5) suggests that the flow in the vicinity of one of these anomalies
is largely unchanged, so the effect is small. The anticyclonic shear of the high-PV trough is
weaker at low resolution, which would be consistent with a spuriously large cyclonic influence
from the 60N anomaly. This, however, could also be due to the resolution limits on the width
of the trough, resulting in a spurious amplification of its influence as a dynamical feature, in
accordance with the scale effect.
In the intersecting levels the isentropic model can show spurious increases similar to that
seen in the IGCM, but they are significantly smaller and generally more transient. The sections
shown in Figure 4.19 show no large surface PV anomalies as seen in the IGCM. As in the IGCM
these increases are due to the mixing of boundary and interior PV due to diffusion. Here the
diffusion implicit in the advection scheme (as described in Section 3.4.1) acts to diffuse PV
across the interface along isentropic levels. The increase occurs at roughly the same time as
that in the IGCM, since this is the time when strong gradients are formed at the boundary
and the isentropic layers move most dramatically across the surface, indicating a significant
velocity normal to the intersection. Note that only the value of the PV maximum is shown in
The LC1 Baroclinic Life Cycle 123
Figure 4.22; on average interior PV is lost from these levels (Figure 4.21). Also note that these
spurious increases are limited to only those grid cells which actually intersect the boundary; if
only cells with σ/σ > 0.99 are considered then the PV maximum is practically unchanged.
The average loss of interior PV in almost all the intersecting levels in the isentropic model
run is due to the implicit diffusion in the advection step. This loss of PV is less than the error
in the PV contained within isentropic levels by the IGCM. The fact that PV is diffused into the
boundary rather than out of it is probably due to the relaxation which, by relaxing the winds to
zero, acts as a sink of vorticity.
The mixing of internal and boundary PV by diffusion is also responsible for the spreading
into the interior of the boundary anomalies caused by relaxation in the surface divergence prob-
lem described in Section 3.7. The magnitude of this problem, and the effect of the relaxation
in general, will now be assessed for the LC1 simulation.
4.13 Effect of the Relaxation Scheme on the LC1 Cycle
4.13.1 The Surface Divergence Effect
The fundamental signature of this problem is the large value of divergence at the intersection
of a model level with the ground, which can be up to an order of magnitude larger than other
features in the interior divergence field, as seen in Figure 3.16 for the solid body rotation
experiment. A careful study of the divergence field in the LC1 cycle (not shown) reveals no
such anomalies; any effect is therefore of a size no greater than the real divergent processes.
There is also no evidence of the outcropping seen as a by-product of the problem in the solid
body rotation (SBR) simulation.
The evolution of PV in the intersecting levels for the LC1 cycle compared to the SBR
simulation is summarised in Figure 4.23. The surface divergence problem in the SBR run is
mostly confined to the lowest massy level, and it is this level which shows by far the largest
error. In the LC1 cycle all the intersecting levels show a general reduction in both the total PV
and its variance, apart from the lowest massy level which shows a small increase. This increase
is almost certainly a sign of the surface divergence effect and its size shows that the effect is
much smaller than in SBR.
The effect of the surface divergence problem in LC1 is small because the surface wind
field is weaker, and less organised, than in SBR. From the lower panel of Figure 4.7 it is clear
The LC1 Baroclinic Life Cycle 124
0 5 10 15 20−0.2
0
0.2
0.4
0.6SBR
|PV
|
0 2 4 6 8 10−0.2
0
0.2
0.4
0.6LC1
0 5 10 15 200.5
1
1.5
2
2.5
Var
(PV
)
Day0 2 4 6 8 10
0.5
1
1.5
2
2.5
Day
Figure 4.23: Comparison of the PV change in the intersecting model levels between the solidbody rotation simulation (left) and the LC1 cycle (right). Upper panels show the total mass-weighted |PV | and lower panels show the mass-weighted variance of |PV |, i.e. the sum of thesquares of the deviation of |PV | in each cell from its mean value for that level. Quantities areagain normalised by their initial values, and the lowest massy level is shown dashed.
that, even when the jet has extended down to the surface, it does so to the north of the frontal
region where the levels are intersecting the ground. From this picture it looks possible that the
easterly wind maximum of flow along the front could have caused a spurious divergence which
would act to increase the frontal gradient. This, however, was disproved in Section 4.9.
The general loss of interior PV into the boundary in the LC1 cycle is not seen in the SBR
experiment. This is because in SBR the flow is largely parallel to the intersections, which
therefore move little over the surface. While this error is larger in LC1, it is significantly
smaller than the error in the SBR experiment which is caused by the surface divergence effect.
4.13.2 Sensitivity to the Relaxation
The relaxation of fields in the massless region back towards their initial values (see Section 3.3)
is necessary to improve stability, but is certainly one of the less attractive features of this isen-
tropic model. All possible care has been taken to ensure that the massless region and the
relaxation scheme do not adversely affect the flow in the interior. In the vertical scheme infor-
mation only propagates downwards in the massless region, and the surface divergence effect of
The LC1 Baroclinic Life Cycle 125
Figure 4.24: Eddy kinetic energy in Jm−2 for versions of the isentropic model LC1 cyclewhich differ only in the relaxation timescale as shown. The T42 IGCM value is shown forcomparison.
Section 3.7 has been shown to be small.
It is hard to imagine, however, that the above ground flow is entirely unaffected by the
relaxation process, especially given the implicit mixing of boundary and interior PV described
in Sections 4.12 and 4.13.1 above. To investigate the sensitivity of the LC1 simulation to the
relaxation the cycle was run several times with different relaxation timescales. The eddy kinetic
energy of each of these runs is shown in Figure 4.24. There is significant sensitivity to the
relaxation timescale, though the differences are smaller than that between the isentropic model
and IGCM runs, with less EKE is produced than in the IGCM for all relaxation timescales. The
effect of increasing the strength of the relaxation is very systematic; reducing the timescale
results in a simulation which is stable for longer, and in a wave with a larger EKE peak which
is attained slightly later. Note that all the runs are similar until day 7, i.e. until after the collapse
of the surface fronts.
The reason why there is such a systematic dependence on the timescale is unknown. As
mentioned above, one mechanism by which the relaxation could affect the interior flow is
through the PV advection. Changing the timescale will affect massless PV values near to cells
which are at least partially above ground. These ‘massy’ cells will feel the effect through the
implicitly diffusive PV advection step.
The LC1 Baroclinic Life Cycle 126
Note that the EKE in the IGCM cycle is very sensitive to the value of certain model param-
eters, so certainly doesn’t represent an absolute truth. MacVean (1983) showed that the EKE
maximum of a baroclinic wave in the IGCM could be increased by a factor of 2.5 by changing
the timescale, and scale-selectivity, of the hyper-diffusion terms. The sensitivity of the isen-
tropic model to the relaxation is an order of magnitude smaller than this. This sensitivity is
therefore interesting, but is not too alarming given the qualitative similarity between all four
runs. Apart from significant differences in the length of simulation achievable there is little to
choose between the various timescales; it is the 30 minute run which is described in the rest of
this chapter.
4.14 Smoothness of∇2M
Hsu and Arakawa (1990) and Arakawa et al. (1992) reported a problem in their isentropic
model as follows. The surface potential temperature distribution θs is determined by the lo-
cation of the model level intersections, and its resolution is given by the vertical spacing of
the isentropic levels. The horizontal pressure gradient force term is given by ∇2M , where the
Montgomery potential M is obtained by integrating the hydrostatic equation (3.8) up from the
surface as described in Section 3.2. With θs insufficiently smooth they found that ∇2M , and
thus also the geostrophic wind, became almost discontinuous at all levels above an intersection
point.
Randall et al. (2000) studied the effect further by prescribing an analytic pressure and θs
distribution (that of Held and Suarez (1994)), and then calculating the geostrophic vorticity
∇.(
1f∇θM
)using both the continuous equations, and also the discrete equations of Hsu and
Arakawa (1990). Significant computational noise was evident in the discrete vorticity and
attributed to the lack of smoothness in θs. The conclusion was that vertical resolution in an
isentropic model should be chosen so that the horizontal resolution of θs as defined by the
intersection of model levels matches the horizontal resolution of the model.
Figure 4.25 shows meridional sections of ∇2M during the LC1 cycle simulated by the
isentropic model described here. No gridscale noise is seen in ∇2M in connection with the
intersection of model levels with the surface. Figure 4.7 shows six isentropic levels intersect-
ing the surface in about 30 of latitude in the initial conditions. In this simulation the fifth
horizontal grid was used, which has roughly seven cells in this latitude band. The horizontal
and vertical resolutions are therefore fairly closely matched. If the horizontal resolution is in-
The LC1 Baroclinic Life Cycle 127
0 10 20 30 40 50 60 70 80 90
260
280
300
320
340
360
380
Pot
entia
l Tem
pera
ture
∇2M − Initial Conditions
0 10 20 30 40 50 60 70 80 90
260
280
300
320
340
360
380
Latitude
Pot
entia
l Tem
pera
ture
∇2M − After 5 days
Figure 4.25: Meridional sections at 0E of the Laplacian of the Montgomery potential. Con-tours are drawn every 0.5× 10−9 s−2, with negative contours dashed. The lowermost contourline marks the location of the surface.
0 10 20 30 40 50 60 70 80 90
260
280
300
320
340
360
380
Latitude
Pot
entia
l Tem
pera
ture
Laplacian of M − Initial Conditions
Figure 4.26: As Figure 4.25 but at higher horizontal resolution (grid 6). Here the noise in∇2Mis clearly visible.
The LC1 Baroclinic Life Cycle 128
creased then the two are no longer well matched; Figure 4.26 shows the initial conditions of
the LC1 cycle at grid number six, i.e. with approximately four times the number of cells in
the horizontal but with the vertical resolution unchanged. Here noise is clearly visible so it is
concluded that θs is not resolved smoothly enough, and therefore this problem does still exist
in the isentropic model described here. In the grid number five simulation discussed in the
rest of this chapter the resolutions are matched, indicating that this is indeed a solution to the
problem.
It is worth noting that the noise in the high resolution version is very damaging. The noise
seen in Figure 4.26 extends down into the unstable massless region in the initial conditions and
causes the model to crash almost immediately.
4.15 Summary
In this chapter the simulation of a baroclinic wave life cycle by the isentropic model of Chap-
ter 3 has been compared with its simulation by a standard normalised pressure coordinate
model, the IGCM. The principal findings are as follows:
• An underlying difference between the two models is that, while the PV distribution is
slightly diffused by the isentropic model, it is intensified by the IGCM. This contrasting
behaviour is largely responsible for the synoptic differences observed between the two
waves. The isentropic model life cycle is, in fact, much more similar to the IGCM version
with ∇4 hyper-diffusion than to the standard ∇6 version. The error in PV maxima is
significantly smaller in the isentropic model, but certain features of the T341 IGCM
wave, such as the thin, heavily sheared troughs, are better represented in the ∇6 T42
IGCM wave than by the isentropic model.
• The isentropic model predicts an actual frontal discontinuity in surface potential temper-
ature, while in the IGCM the frontal gradient is limited by hyper-diffusion on tempera-
ture and (to a lesser extent) the spectral truncation. In contrast, the surface front in the
IGCM is characterised by a weaker temperature gradient and a large PV anomaly. Both
model simulations can be considered as approximations to a truly adiabatic front in the
primitive equations. Since the IGCM simulation involves the generation of entropy, and
a much larger change in the total interior mass-weighted PV, it is a worse approximation
than the isentropic model simulation. It is the front predicted by the isentropic model
The LC1 Baroclinic Life Cycle 129
which shows most similarity to that in a high resolution version of the IGCM life cycle.
• The ability of the two models to satisfy the various conservation laws of the continuous
equations contrasts markedly. For example, the relative error in global mass-weighted
PV in the IGCM is roughly the same size as the relative error in angular momentum
in the isentropic model. The IGCM strikingly violates the impermeability theorem of
Haynes and McIntyre (1987), which is almost exactly satisfied in the isentropic model
in all non-intersecting levels. While the entropy conservation is better by an order of
magnitude in the isentropic model, the energy error it exhibits means that the total error
in representing the second law of thermodynamics is larger than in the IGCM. Improved
energy conservation is clearly desirable in the isentropic model.
• The problem of noise in∇2M found previously in pure isentropic models is successfully
overcome here by matching the horizontal and vertical resolutions.
• The instability which caused the isentropic model to crash is purely a feature of the
massless boundary region, and was not forced by any feature of the interior flow. The
EKE comparison shows that the life cycle is largely complete when the model crashes, so
a full comparison of the two cycles is possible. There is some sensitivity to the timescale
chosen in the relaxation scheme which is used to control the instability; this, however,
is much less than the sensitivity of the IGCM to parameters in its hyper-diffusion terms.
The instability, though, clearly needs to be addressed if the model is to be used routinely.
• The surface divergence problem of Section 3.7 has little effect on the life cycle predicted
by the isentropic model. The mixing of boundary and interior PV by the diffusive advec-
tion scheme is more significant, but is smaller than the error shown in mass-weighted PV
in the same levels by the IGCM. The same mechanism, however, is likely responsible for
part of the energy loss seen in the isentropic model.
Chapter 5
Entropy Production in the IGCM
5.1 Introduction
The most fundamental difference between the two models compared in the previous chapter
is in the representation of the entropy. The isentropic model is constructed on the principle of
Lagrangian conservation of entropy under adiabatic, frictionless flow. While the IGCM is also
constructed from a continuous equation set which satisfies this constraint, the discretized sys-
tem does not. As shown in Figure 4.20, the isentropic model exhibits changes in global entropy
which are attributable entirely to truncation errors in the conservation of mass; the IGCM, in
contrast, shows a systematic increase in global entropy which is an order of magnitude larger.
In Section 4.11 it was shown that for this particular isentropic model the drift in total en-
ergy constitutes a larger error in modelling the thermodynamics of the system than the entropy
drift in the IGCM does. However, it is important to understand these changes in the total en-
tropy in the IGCM simulation because the vast majority of weather and climate predictions are
currently made using similar, non-isentropic models. Relatively little is known about the en-
tropy budgets of such models and how well these represent the real atmosphere. This chapter
is dedicated, therefore, to studying the entropy change during the IGCM LC1 cycle, and its
relation to both observed and modelled entropy budgets of the atmosphere. We begin with an
introduction to entropy principles and some further, more specific, motivations.
Entropy Production in the IGCM 131
5.1.1 Entropy and the Second Law
Specific entropy s is a thermodynamic state variable defined by
ds =dE
T(5.1)
for the change in entropy from an initial state to one differing by energy dE. Note that strictly
the change between the two states must be reversible in this definition (see e.g. Curry and
Webster (1999) for more detail). Dividing by the temperature T has converted the inexact
differential dE into the exact differential ds, meaning that while the change in energy between
the two states depends on the history of the thermodynamic state throughout the transition, the
change in entropy depends only on the initial and final states (e.g. Dutton (1976)). Using the
first law of thermodynamics and the ideal gas law, (5.1) can be written
ds = cp d(lnT )− R d(ln p), (5.2)
leading to the expression
s− s0 = cp ln
(θ
θ0
)(5.3)
for the entropy in terms of the potential temperature θ, where a subscript 0 indicates the initial
state. An isentropic surface is therefore a surface of both constant s and constant θ.
Entropy can be formally related to the notion of order in a thermodynamic system using
probability theory. A state with a high degree of order or information is less probable, and
has lower entropy, than a more disordered state. For example, a room in which all the air
molecules are crowded into one corner has a lower entropy than an identical room with the
molecules spread evenly throughout. In the same way, a volume of air consisting of parcels of
differing θ is more ordered than if all parcels had identical θ. Any mixing of the air parcels
will increase the total entropy (e.g. Johnson (1997), Appendix B). The entropy also has an
interpretation as the availability of energy; an increase in entropy is associated with a change
of energy from a more to a less usable form.
The second law of thermodynamics has been stated in several equivalent ways. Here we
use
Tds
dt= q + f (5.4)
for an open system which can exchange heat with its surroundings. Here q is the flow of heat
Entropy Production in the IGCM 132
into the system and f is friction internal to the system. Since f ≥ 0, this implies that an isolated
system for which q = 0 can never move to a state of lower entropy. The entropy of an open
system such as the atmosphere can decrease through a flow of heat to its surroundings, but the
total entropy of the system and its environment, in this case the universe, can never decrease.
It is important, therefore, to differentiate between the entropy of the system (ssys) and the total
entropy (stot) of system and environment.
A process is termed reversible if the system is in a state of thermodynamic equilibrium
throughout the process; the process can then be reversed such that both the system and its en-
vironment return to their initial states. The second law demands that dstot = 0 for a reversible
process, otherwise the total entropy will be decreased during the reversal. A process is irre-
versible if dstot > 0 so that the system can still be restored to its original state but not both the
system and the environment can be restored. It is important that these definitions apply to the
total entropy; an open system such as the atmosphere is, for example, able to have reversible
sources and sinks of entropy which simply correspond to fluxes of heat across its boundary.
From (5.1) it follows that stot is unchanged when heat is transferred from one element of a
system to another only if the two are at identical temperatures; the transfer is then reversible.
Any exchange of heat between two elements of differing temperature is irreversible.
5.1.2 The Entropy Budget of the Atmosphere
Over timescales of a few days the behaviour of an air parcel is dominated by reversible trans-
port, leading to the beneficial quasi-Lagrangian nature of the isentropic coordinate. A plethora
of highly irreversible processes do, however, act within the atmosphere, for example the pre-
cipitation out of water vapour, and the dissipation of kinetic energy in the turbulent planetary
boundary layer. Despite this, the total entropy of the atmosphere is approximately constant on
timescales of the order of a few years or longer. The continual production of entropy within
the atmosphere is balanced by a net loss to space by radiation as follows. The motion of the
atmosphere is driven by net heating where it is already hot (in the tropics) and cooling where it
is already cool (the extratropics). The average global incoming and outgoing energy fluxes are
equal, but the entropy removed by long-wave radiation is roughly 20 times the amount brought
in by incoming short-wave radiation (Peixoto et al. (1991)). This is easily seen from (5.1)
since long-wave radiation is emitted from the Earth at temperatures of order 300 K, while the
emission of short-wave radiation from the sun occurs at temperatures close to 6000 K. A given
Entropy Production in the IGCM 133
amount of energy is carried by far fewer photons in solar than in terrestrial radiation and so the
energy is more ordered and the radiation is said to be of high quality, meaning low entropy. The
balance between internal generation of entropy and the net radiative loss to space is generally
referred to as the entropy balance of the atmosphere.
Peixoto et al. (1991) made the first detailed attempt at estimating the entropy budget of
the atmosphere. They estimated the entropy production by irreversible processes internal to
the atmosphere by considering each process individually. The dominant contributions were
from latent heating and absorption of solar radiation, with all other contributions one or two
orders of magnitude smaller. This internal generation was then compared to their estimate
of the net flux of entropy to space. The balance appeared to be represented reasonably well,
with a difference of 17 mWm−2K−1 between the two terms, each of roughly 600 mWm−2K−1.
The leading order terms in the entropy budget seemed to be well understood. This impression
however proved misleading.
Goody (2000) revisited the entropy budget and re-interpreted the data of Peixoto et al.
(1991) by applying two new principles. Firstly, it is the budget of the whole climate system
which is of interest, not that of the atmosphere in isolation. Large radiative fluxes between
the surface and atmosphere are unimportant for the entropy of the climate system and are thus
removed from the balance equation. The second principle is a consequence of a subtlety in
the interaction between the fluid and the radiation field, described earlier by Goody and Abdou
(1996). As noted above, the transfer of energy from one medium to another is reversible if
and only if the temperatures of the two media are identical. In the absorption of radiation by a
fluid the radiation temperature of the photons and the kinetic temperature of the molecules are
generally different, strikingly so, in fact, in the absorption of solar radiation by the atmosphere.
A large irreversible source of entropy would therefore be expected. The subtlety lies in the tra-
ditional fluid dynamical assumption that the fluid is in thermodynamic equilibrium. Absorbed
photons are thus instantaneously thermalized to the fluid temperature so that the transfer of heat
to the fluid occurs at the same temperature, and so constitutes a reversible source of entropy
for the fluid. There is a significant irreversible entropy source due to the thermalization but
this is a source in the entropy budget of the radiation field, not the fluid system. It is therefore
only the reversible component of the radiative entropy flux which is of relevance to the climate
system, and it is this term which is equal and opposite to the internal entropy sources of the
system. The irreversible component largely balances the net outward flux of radiative entropy;
both terms are components of the radiative entropy field rather than that of the climate system
Entropy Production in the IGCM 134
and so are removed in the budget of Goody (2000). This viewpoint is not intuitively obvious,
but could be considered equivalent to treating radiation purely as a diabatic heating term which
transfers energy to the atmosphere.
After Goody’s re-arrangement of the data the difference is striking; while Peixoto et al.
(1991) show am imbalance of -17 mWm−2K−1 between sources and sinks of 589 mWm−2K−1
and -606 mWm−2K−1, the re-arranged budget shows the same imbalance but this time between
sources and sinks of 50 mWm−2K−1 and -33 mWm−2K−1. The imbalance in the entropy
budget is now of the same size as individual source terms, and Goody’s conclusion is that
‘atmospheric entropy inventories are not yet established to a desirable degree of accuracy’.
5.1.3 Some Specific Motivations
Clearly there is much still to be understood concerning exchanges of entropy within the climate
system and its environment. This is of practical as well as academic interest. The real system
obeys strict physical laws and exhibits a delicate balance between a multitude of entropy fluxes.
There is obvious potential for the entropy budget to be used as an additional constraint on
numerical models and their parameterizations, in an attempt to make the simulated climate
more realistic (e.g. Gohar (2002)). Interest in climate entropy is further stimulated by two
intriguing and controversial hypotheses.
Paltridge (1975) presented a simple, thermodynamical multi-box model of the atmosphere
and ocean system containing only parameterized dynamics, and showed that the observed mean
meridional distribution of temperature and cloud cover could be predicted simply by maximis-
ing the entropy produced by poleward heat transport. This work gave birth to the Maximum
Entropy Production or MEP hypothesis, namely that the climate system continually adjusts it-
self to the state in which this entropy generation term is maximised, the global dynamics acting
as ‘something of a passive variable’ in the words of Paltridge.
Several studies have provided supporting evidence for the MEP hypothesis; we only give
a few recent examples. Lorenz et al. (2001) used the MEP principle to realistically predict
the zonal mean temperatures of other planetary bodies in a simple thermodynamical model.
Shimokawa and Ozawa (2001, 2002) applied perturbations to an oceanic GCM to induce tran-
sitions between multiple steady states of the thermohaline circulation. In their experiments all
the transitions led to a final state with a higher rate of entropy production by heat transport than
the initial state, and they took this as evidence that ‘a nonlinear system is likely to move to a
Entropy Production in the IGCM 135
state with maximum entropy production by perturbation’.
There is, however, widespread skepticism over the hypothesis, mostly because all the ev-
idence is empirical in nature and no physical explanation has been agreed upon for why the
system might seek such a state. Recently Ozawa et al. (2003) gave a review of the theory,
including attempts to provide a physical basis. MEP supporters are also hailing the work of
Dewar (2003) as an answer to the objections; information theory is used to show that, of all
possible states, that of MEP is statistically the most probable. Despite this, whether or not the
MEP hypothesis is correct still remains to be proven indisputably. It has certainly motivated
many studies of the entropy of the climate system. The fact that it has neither been proved
or disproved in the past 30 years testifies that much remains to be understood of the entropy
exchange processes at work, and of their importance.
A direct motivation for the work of this chapter is the second hypothesis, that of Johnson
(1997), that errors in entropy are to blame for what he refers to as the ‘general coldness of
climate models’. Boer et al. (1992) compared the current climates predicted by 14 atmospheric
GCMs with observations. All 14 suffered a common deficiency; not only were the simulated
climates too cold in a global average, but all showed remarkably similar zonal-mean distribu-
tions of the cold bias, leading to classification of the problem as ‘systematic’ and ‘tenacious’.
By far the largest biases (up to 20 K) were in the polar upper troposphere and lower strato-
sphere (UTLS) region, hence this is often referred to as the cold pole problem. Smaller, but
equally systematic, biases occurred in the ‘nonpolar’ lower troposphere. The same systematic
bias was seen in the AMIP intercomparison (Gates et al. (1999)), although the magnitude was
slightly reduced. It was noted that ‘the cause of this error has remained elusive’. The GRIPS
intercomparison of middle atmosphere climate models (Pawson et al. (2000)) recorded a cold
pole bias in the range 3 to 15 K in almost all of the models. This bias is still the largest and
most systematic error in the numerical simulations of the current climate.
The theory put forward by Johnson (1997) to explain this is as follows. Climate models all
contain spurious, aphysical sources of entropy from processes such as excessive numerical dis-
persion/diffusion and inadequacies of parameterizations. These models are tuned to simulate
a climate state without drift, so the total entropy of the system must be constant in a long time
average. In order to maintain this driftless state there must then be a corresponding entropy
sink to offset the spurious sources, and the cold bias develops to provide for this sink. The ba-
sic mechanism is clear from equation (5.1); more entropy is lost to space if the same amount of
energy is radiated at a lower temperature. Johnson used entropy balance arguments to explain
Entropy Production in the IGCM 136
the distribution of the bias, and estimated that it would take an error of just 4% in the entropy
source (or 2 mWm−2K−1, Goody (2000)) to give a cold bias of 10 K. Johnson’s theory is far
from being unanimously accepted, partly since all the evidence is somewhat circumstantial,
but it has attracted widespread interest and has further motivated investigation of the entropy
budgets of both real and modelled atmospheres.
Explicit numerical diffusion is cited in the theory as being one of the major spurious en-
tropy sources, since any mixing of air parcels of differing θ will increase the entropy of the sys-
tem. Implicit dispersion from numerical transport schemes is also a spurious entropy source,
as shown by Egger (1999). The magnitude of these two sources in atmospheric simulations has
never been studied in depth. In the dry, adiabatic life cycle studied in the previous chapter these
are the only two ways that entropy can be changed, and they must therefore be responsible for
the systematic increase of global entropy predicted by the IGCM. This idealised simulation of
typical mid-latitude synoptic dynamics presents a good opportunity to quantify and study the
entropy production by these processes in isolation, i.e. in the absence of physical parameterisa-
tions such as heating, and the large real entropy sources and sinks they involve. In the following
sections the entropy sources in the IGCM LC1 life cycle are studied in detail, identifying their
locations and causes, and investigating their sensitivity to changes of model resolution and dif-
fusivity. Key questions are: is the size of the sources in agreement with Johnson’s theory, and
what physical processes, and hence real entropy sources, are being represented by the model
diffusion?
5.2 Entropy Production in the LC1 Cycle
5.2.1 Magnitude and Sensitivity of the Source
Figure 4.20 of the previous chapter shows a change in global entropy during the IGCM LC1
cycle of roughly 600 mWm−2K−1. However, as noted in Section 4.11, and described in Black-
burn (1983), there is a slight drift in the total energy during the life cycle since kinetic energy
lost by diffusion is not put back into the model as heat. This constitutes a flux of energy, and
therefore entropy, out of the system. The entropy generation during the life cycle occurs by
the implicit or explicit mixing of air masses of different potential temperatures, and is in part
masked by the energy drift. This should be taken into account in evaluating the entropy genera-
tion; after all, in a complete climate model simulation there would hopefully be negligible drift
Entropy Production in the IGCM 137
0 1 2 3 4 5 6 7 8 90
1
2
3
4
x 10−3
Rat
e of
Ent
ropy
Cha
nge
(WK
−1m
−2)
Timescale (days−1)
0 5 10 150
500
1000
1500
2000
2500
3000
Day
Cum
ulat
ive
Ent
ropy
Cha
nge
(JK
−1m
−2) Effect of varying diffusivity (decay time on shortest wavelength)
3 Hours4 Hours6 Hours12 Hours1 Day2 DaysNo Diffusion
Figure 5.1: Entropy production in the IGCM LC1 cycle at T42 resolution as a function ofdiffusivity. The upper panel shows the total entropy change, corrected for the energy drift asdescribed in the text. The lower panel shows the rate of production averaged over the final 10days of the life cycle. Here the dotted line is uncorrected, the dashed line the correction and thesolid line the corrected total. Errorbars on the total are calculated using extremal temperaturesin the correction term.
in the total energy of the system. Here we first calculate the total entropy as the mass integral
of cp ln θ, and then correct for the energy drift by adding ∆ETa
, simply an estimate of the entropy
lost in the energy drift using a representative average temperature of the atmosphere Ta.
The results are shown in Figure 5.1 for various LC1 simulations which differ only in the
strength of the IGCM’s hyper-diffusion. The top panel shows the total entropy change after
correction for the energy drift using a value of 250 K for Ta. The lower panel plots the average
rate of entropy generation over the last 10 days of the cycle as a function of diffusivity. Here the
effect of the correction is evident: the dotted line is the uncorrected rate, the dashed line is that
due to the correction term and the solid line is the corrected total rate of entropy production.
Entropy Production in the IGCM 138
The errorbars on the total indicate the effect of varying Ta in the correction term; the lower limit
results from choosing 280 K and the upper from choosing 200 K. In this life cycle most of the
atmosphere, at least up to 100 mb, lies between these two extremes, and 250 K is a suitable
mean value. The figure shows that the choice of Ta has relatively little effect on the total rate
of generation.
The intention of Figure 5.1 is to show the effect on the entropy generation of varying the
diffusivity. The IGCM, in common with almost all numerical models, contains diffusion terms
in the discrete equations which have no analogues in the continuous equations, and are there
to perform several duties, for example to soak up dispersion errors and Gibbs oscillations,
and as an arguably crude representation of the sub-gridscale flow. In these experiments a
hyper-diffusion term K∇6 acts on each of the prognostic variables vorticity ζ , divergence δ
and temperature T . The hyper-diffusion on T is the only explicit process in these experiments
which mixes air across isentropic surfaces and hence generates entropy. The constantK defines
the strength of the diffusion which is referred to here, following MacVean (1983), by the e-
folding decay time of features at the shortest wavelength (here that of wavenumber 42).
There is little change in the entropy in the period before day 5, during the exponential
growth stage of the life cycle when the flow is unrealistically zonal in nature. After day 5
the highest rate of entropy production is seen, coinciding with the low level saturation stage
described in Section 4.1. Of central importance here are the collapse of the surface fronts and
the occlusion-like decay of the warm sector (Section 4.3). The hyper-diffusion is acting in both
cases to reduce temperature gradients, effectively mixing air of different potential temperatures
and hence creating entropy. The life cycle was also run with no hyper-diffusion in the model,
resulting in little entropy production as shown by the dashed line in the upper panel, and the
zero entry in the lower. Considerable noise is evident in the fields predicted in this run, and this
is even visible in the global entropy sum. Structures at the gridscale become unrealistic as the
cascade of potential enstrophy to small scales is artificially truncated; avoiding this is, after all,
one of the main aims of the hyper-diffusion. That entropy production is significantly reduced
in this run serves to illustrate that the diffusion scheme is responsible for most of the entropy
source.
The key result of this experiment is that varying the diffusivity has relatively little effect
on the entropy source. This holds even when it is varied over a range as large as that used
here, which certainly includes most values used in practise by models (4-6 hours is common
for models at this resolution). The time averaged rate of generation in the lower panel clearly
Entropy Production in the IGCM 139
0 5 10 150
2
4
6
8
10
12x 10
5 Effect of varying horizontal resolution
EK
E (
Jm−2
)
T42T63T106T213
0 5 10 150
500
1000
1500
2000
2500
3000
Day
Cum
ulat
ive
Ent
ropy
Cha
nge
(JK
−1m
−2)
T42T63T106T213
Figure 5.2: The effect of varying the horizontal resolution. The upper panel shows the EddyKinetic Energy evolution and indicates that the synoptic evolution is significantly different afterabout day 8. The total corrected entropy change is shown in the lower panel.
bears testament to this, changing by less than 1 mWm−2K−1, i.e. under a third of its value,
when the diffusivity is increased by a factor of 16. Furthermore, the upper panel shows that the
rate of production is almost identical for all diffusivities until after about day 6, i.e. after the
fronts have collapsed to their smallest scale (e.g. Figure 4.2). The entropy production clearly
does not asymptote to the case of no diffusion as the diffusivity is reduced.
In a similar experiment the horizontal resolution was varied while keeping the diffusivity
fixed, with a decay timescale of four hours on features of the shortest resolved scale. The
results are shown in Figure 5.2. Changing the horizontal resolution has a large effect on the
synoptic evolution. At higher resolutions a secondary wave development grows from the debris
of the first, which is still baroclinically unstable (as described by MacVean (1983)). The EKE
evolution in the upper panel indicates that the four life cycles are very similar until about day 8
when this secondary wave develops. This synoptic difference means that the entropy changes
Entropy Production in the IGCM 140
shown in the lower panel should not be compared after this time. Again, the rate of production
during frontal collapse is remarkably similar for all four runs. The only difference before day
8 is in the timing of the large entropy source corresponding to the low level saturation, which
begins slightly earlier and lasts longer at lower resolution. As a result, more entropy has been
generated by the end of this phase, but again the difference is surprisingly small. For example,
the entropy source during the first wave is reduced only slightly by doubling the resolution
from T106 to T213.
The results so far serve to motivate further investigation. The rate of entropy generation
is certainly of a similar size to the 2 mWm−2K−1 predicted by Johnson to yield a 10 K cold
bias. Furthermore, this source is shown to be relatively insensitive to the choice of horizontal
resolution or diffusivity. If Johnson’s theory is correct and explicit diffusion is a spurious
entropy source, then the problem cannot be solved by going to higher resolution or weaker
diffusion. The entropy generation is therefore investigated further, firstly demonstrating two
points which have so far only been suggested: that the entropy production is attributable in the
greater part to the hyper-diffusion, and that this largely occurs at the surface.
5.2.2 Location and Attribution of the Source
The IGCM timestep is split into an adiabatic step and a diabatic one. Here the diabatic step
consists only of the hyper-diffusion term, and hence simulations such as this are often referred
to as adiabatic, though this is obviously not strictly correct. The entropy changes shown above
were diagnosed by a simple global mass integral of entropy s = cp ln θ, with θ in turn diagnosed
from the model variables T , ps and σ = p/ps. This gives the total entropy change but does
not distinguish between the two anticipated sources: the hyper-diffusion and the numerical
transport errors. Now the entropy source due to the diffusion is calculated using a different
technique and the results compared. In this technique the temperature tendencies from the
diabatic step are used to calculate the heat addition, and thus the entropy source due to the
diffusion only. This source is given as a rate of generation sdi by the global mass integral ofcp∆T diT∆t
, where ∆T di is the temperature change over a diabatic step of time ∆t.
The results are shown in Figure 5.3. Here the red line is sdi and the black lines are the
total s given by differentiating the corrected entropy change of Figure 5.1 in time. The close
similarity between the two measures supports the use of the simple energy drift correction in
s, since this correction is not needed in calculating sdi from local heat additions.
Entropy Production in the IGCM 141
0 5 10 15−2
0
2
4
6
8
10x 10
−3
Day
Rat
e of
cha
nge
of e
ntro
py (
Wm
−2K
−1)
Figure 5.3: Total corrected rate of entropy production s in black. The solid line correspondsto a temperature of 250 K for the energy drift correction, and the dashed lines to the extremaltemperatures of 200 K and 280 K. The source from diffusion only (sdi) is shown in red; thisterm does not need correction for the energy drift. The data was diagnosed from an LC1 cycleat T42 horizontal resolution with 4 hour∇6 hyper-diffusion.
It is clear that the diffusion is responsible for the vast majority of the entropy source up
until about day 7, i.e. until after the frontal collapse and much of the low level saturation event
is complete. The diffusion certainly plays a large part in the decay of the surface warm sector
by mixing air of different temperatures. A simple estimate using Figure 4.2 puts the scale
of the warm sector at roughly twice the smallest resolvable scale and, bearing in mind the
strength of gradients here, this means that the diffusion will have a large effect. In addition to
this, Methven (1996) showed that it is the diffusion on temperature that limits the scale of the
surface fronts, doing so just above the smallest resolvable scale.
Though the diffusive contribution dominates the entropy source during the low level satu-
ration, it is interesting that there are other periods when it does not, namely around days 4 and
8 to 9. Numerical transport errors are presumably responsible for the entropy source at these
times, and the effect is not negligible. It looks, for example, as if the second, slightly smaller
peak in total s (around days 8 and 9) is attributable to numerical transport errors rather than
explicit diffusion. Near the end of the cycle there are periods where sdi exceeds s, implying
a sink of entropy elsewhere. One possible explanation is that Gibbs oscillations occurring as
Entropy Production in the IGCM 142
Figure 5.4: A snapshot on day 7.5 of the entropy source sdi due to explicit numerical diffusion,in Wm−2K−1 on the lowest model level of the IGCM. The solid contours show temperatureevery 5 K down from 295 K.
a side-effect of the spectral technique could constitute a small spurious sink of entropy, acting
to add a small amount of order over the equivalent continuous field, and so to increase the
variance of θ.
In general though, Figure 5.3 shows that the diffusion is the dominant source of entropy
in the LC1 cycle. Since sdi is determined locally, the distribution of the source throughout
the domain can be analysed. Figure 5.4 shows a snapshot of the horizontal distribution of sdi
on the lowest model level. The effect of the diffusion is seen everywhere in this picture, but
the largest sources are in the vicinity of the fronts and the warm sector. The diffusion acts to
reduce temperature gradients by heating cool regions and cooling hot regions, resulting in local
entropy sources and sinks, respectively. For example, there is an entropy sink along the warm
sector with sources either side, reflecting the contribution of the diffusion to the occlusion-like
decay of the sector. The diffusion essentially transports heat, and thus entropy, from one region
to another, effectively moving mass across isentropic surfaces. Adjacent sources and sinks
Entropy Production in the IGCM 143
−8 −6 −4 −2 0 2 4 6
10 20 30 40 50 60 70 80
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Latitude
σ=p/
p s
(∂ s / ∂ t)diffusion
on day 6.5 (Wm−2K−1 per unit ∆σ)
0 5000 10000
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Entropy change (Jm−2K−1 per unit ∆σ)
Horizontally integrated source
Figure 5.5: The left panel shows the source sdi in a meridional section through the warm sectorat day 6.5. The solid contours are isentropic surfaces drawn every 5 K. The right hand panelshows the entropy change due to diffusion as a vertical profile, i.e. sdi integrated over the whole15 day life cycle and horizontally over the globe for each model level.
could be viewed as fluxes of entropy from one region to another. Note that the sources shown
in Figure 5.4 are in Wm−2K−1 so that the local sources and sinks are three orders of magnitude
larger than the global averages shown in the previous section. The global entropy drift is thus
seen as the small residual left after many cancellations between much larger individual sources
and sinks.
A similar picture is seen in the vertical, an example snapshot of which is shown in Fig-
ure 5.5. As expected, the largest entropy fluxes are at the surface since this is where the
strongest temperature gradients occur. These fluxes do, however, extend a significant way
up from the surface, and there is also evidence of weaker fluxes higher up, indicating the effect
of diffusion at the tropopause level, for example at tropopause folds and upper level fronts.
This picture is just a snapshot; the overall effect of the diffusion is seen in the horizontally and
time integrated source shown as a vertical profile on the right. Diffusion at the tropopause level
is seen to be only a small contribution to the total entropy change. Surface effects dominate,
but these extend higher in the atmosphere than might be expected, with approximately a third
of the entropy produced outside of the lowest two levels, i.e. above p/ps = 0.84. This implies
that a significant amount of entropy is produced outside the region below about 1 km, which is
Entropy Production in the IGCM 144
−6 −4 −2 0 2
x 105
240
260
280
300
320
340
360
380
400
θ (K
)
∆ s (JK−1m−2)
−5000 0 5000240
260
280
300
320
340
360
380
400
∆ sdiff
(JK−1m−2)
Figure 5.6: Horizontally and time integrated vertical profiles of s and sdi as in the right panelof Figure 5.5, but shown here as functions of θ. The left panel is thus the total entropy changeand the right panel is the entropy change due to the hyper-diffusion.
generally represented by a boundary layer scheme in comprehensive models. Numerical dif-
fusion would, therefore, be a significant source of entropy, even if the surface processes were
modelled perfectly by such a scheme.
Figure 5.5 shows an overall source which is positive at all normalised pressure levels.
The entropy source is due to effective mixing of air across isentropic surfaces, and so looks
significantly different when viewed in isentropic coordinates. Figure 5.6 shows the vertical
profiles of the entropy sources as a function of θ. The calculation of these inevitably involved
some interpolation from the model’s normalised pressure levels. This was performed as in
the isentropic PV analysis of Section 4.12 by dividing the atmosphere into a set of ‘isentropic
layers’, or regions between two isentropic surfaces, roughly coincident with the vertical levels
of the isentropic model. The entropy within each isentropic level was evaluated by calculating
the pressure at each of the bounding θ values, then using these pressures as bounding values
in a mass integral. The pressure at a given θ bound was calculated by interpolating from the
model levels assuming a linear variation of ln θ with ln p. Some error is to be expected from the
interpolation procedure. To show that this is small, the sum was repeated several times with the
isentropic levels shifted by 1 K each time (the level spacing is 6 K for most of the troposphere)
and each of the resulting profiles is plotted in a different colour in Figure 5.6. The general
structure is the same for all the profiles, which is taken to imply that this structure is robust.
Entropy Production in the IGCM 145
The right hand panel shows that the diffusion results in a local entropy sink in the isentropic
levels around 300 K and a source of similar size in levels below. This simply corresponds to
a flow of mass downwards in isentropic space. The diffusion is limiting surface temperature
gradients by transporting mass into isentropic layers whose thickness is tending to become very
small. The figure shows that the net transport is downwards everywhere, from levels above
about 290 K to those below. These lower levels are those which are involved in the surface
fronts (e.g. Figure 4.15), so it is not surprising that mass is transported into these layers. It is
interesting, though, that all this mass comes from the levels above rather than those below.
The total entropy change is shown on the left and here some other entropy fluxes are
visible, for example from the region around 270 K to its surroundings. An important point to
note is that the scales of the two horizontal axes are different; local sources and sinks in the
total entropy are two orders of magnitude larger than can be explained by the action of the
diffusion. Despite this, Figure 5.3 reminds us that the overall entropy source, given as a small
residual between these local sources and sinks, is largely attributable to the diffusion. This
can be understood as follows. Numerical errors in temperature advection, for example, will
effectively move air parcels across isentropic surfaces. If an air parcel moves to a higher θ the
entropy of the whole system increases slightly, while if it moves to a lower θ the total entropy
decreases. If there is no preferred direction for the temperature error then many such local
sources and sinks will cancel with little effect on the total entropy. The diffusion, on the other
hand, has a systematic effect of heating cool air and cooling warm air. This systematic mixing
across isentropic surfaces acts to bring the temperature of neighbouring parcels closer together,
and is always a positive source of entropy.
Note that the residual effect of the numerical transport errors is also positive, shown as
the difference between red and black lines in Figure 5.3, and is doubtless due to the implicit
entropy generation by numerical schemes, as studied by Egger (1999).
These large local sources and sinks show that the dynamics of the IGCM clearly violate the
principle of Lagrangian conservation of entropy to a greater extent than indicated by the overall
entropy changes. This is in agreement with the error analyses of Johnson et al. (2000, 2002),
who demonstrated large errors in the Lagrangian conservation of moist potential temperature
under moist adiabatic conditions in the NCAR Community Climate Models.
Entropy Production in the IGCM 146
−100 −80 −60 −40 −20 0 20200
250
300
350
400
450
500
Cross−isentrope mass transport (kg m−2)
θ (K
)
Figure 5.7: Total cross-isentrope mass transport (positive upwards) over the 15 day LC1 cycle.First the mass change of each isentropic level was calculated by interpolating from model levelsas in Section 4.12 and Figure 5.6, and then this was integrated down from the top of the domainto any given isentropic surface to give the mass transport across that surface.
5.2.3 Cross-Isentrope Mass Flow
In a perfectly adiabatic simulation there should be no transport across isentropic surfaces. In
the IGCM LC1 cycle, however, there is; to quantify this, Figure 5.7 shows the vertical profile
of net cross-isentrope mass flow accumulated over the 15 day life cycle. The largest mass flux
is of almost 100 kg m−2 through an isentropic surface just under 300 K, an amount equal to
roughly 1% of the total column mass of the atmosphere.
Thuburn (1993) advocated the use of accumulated cross-isentrope mass flow as a diagnos-
tic of the diabatic circulation showing, for example, localised fluxes across the 358 K surface
of 1-4 kg m−2day−1 associated with regions of tropical convection. From Figure 5.7 the global
average mass flux at this level of roughly 10 kg m−2 represents a flux of the same order (given
that the active part of the cycle lasts roughly 10 days). The effects of diffusion and numerical
transport errors in the IGCM are therefore significant. This serves to illustrate one of the major
advantages of the isentropic coordinate. The diabatic motion is generally of great importance
to the behaviour of the atmosphere but is much smaller than the adiabatic component, and so
is easily contaminated by small transport errors in the adiabatic flow (e.g. in vertical advection
terms in non-isentropic models).
Another example is in the field of stratosphere-troposphere exchange, which is of par-
Entropy Production in the IGCM 147
ticular interest for the transport of trace chemical species between the two regions. Schoeberl
(2004), following Appenzeller et al. (1996), estimated the net mass flux across the extratropical
tropopause by considering the mass of the lowermost stratosphere, i.e. the region bounded by
the tropopause and the 380 K surface. Subtracting the mass flux across the 380 K surface from
the rate of change of mass of the lowermost stratosphere gave a downwards flux across the
Northern Hemisphere tropopause of roughly 1010 kg s−1 in two assimilated data sets. Analysis
of the output from a GCM gave a slightly lower value of 0.7 × 1010 kg s−1. Figure 5.7 shows
a flux of order 10 kg m−2 across isentropic surfaces near 380 K over the life cycle. When as-
sumed to occur over 10 days, and multiplying by the surface area of the Northern Hemisphere
extratropics (poleward of 30) this represents a flux of 1.7× 109 kg s−1. This is clearly a crude
estimate of the extratropical flux but, since most of the dynamical activity in the life cycle
occurs in the extratropics, it is more likely to be an underestimate than an overestimate. The
diabatic mass flux due to diffusion and numerical errors in the tropopause region of the IGCM
is thus seen to be roughly one sixth of the observed flux, and so counts as a significant term.
This is unlikely to be realistic. It is also of a similar size to the discrepancy between Schoeberl’s
(2004) estimates from GCM and assimilated data. It should be noted that the life cycle is a very
idealised simulation; this should not therefore be taken to suggest that this error is the cause of
the discrepancy, but simply that errors of this size are to be expected in non-isentropic models.
It seems likely that an unrealistically weak Brewer-Dobson circulation (see Section 5.4) could
also be contributing.
By far the strongest dynamical feature in the tropopause region during the life cycle is the
folding of the tropopause, so it would be reasonable to assume that a significant amount of
the cross-isentrope mass flux is associated with this feature. This has not been proven here,
but if true would have interesting consequences. The stratosphere-troposphere exchange due
to tropopause folding is primarily downwards, i.e. from the stratosphere to the troposphere
(e.g. Holton et al. (1995)). In the life cycle, however, the diffusion and numerical errors have
had mixed effects, acting to transport mass upwards across isentropic surfaces above 350 K
and downwards in the region just below. This is clearly not a rigorous or robust result, merely
an indication that further investigation might be worthwhile. In general, it is believed that too
much mass is transported downwards in stratosphere to troposphere transport events simulated
by GCMs due to excessive numerical diffusion (Stohl et al. (2003a)). Models therefore overes-
timate the ozone flux from the stratosphere to the troposphere and, furthermore, stratospheric
ozone reaches the surface more often than expected, perhaps indicating that there is too much
Entropy Production in the IGCM 148
mixing across isentropic surfaces.
Finally, it should be noted that the cross-isentrope flow due to diffusion and numerical
errors are presumably rather shallow vertical motions in which a parcel’s potential temperature
does not change by a large amount. In the stratosphere-troposphere exchange problem it is the
deep exchange events which bring air parcels of radically different chemical compositions into
contact, and are thus of greatest importance (Stohl et al. (2003b)).
5.3 The Entropy Theory for the Cold Bias
Are the entropy sources shown in these experiments consistent with the coldness theory of
Johnson (1997) introduced in Section 5.1.3; i.e. is there any evidence here to support or con-
tradict the theory? We begin with a brief summary of the arguments made by Johnson.
5.3.1 Summary of Johnson (1997)
Models do contain spurious, aphysical sources of entropy. In his Appendix B, Johnson demon-
strates positive definite aphysical entropy production from numerical diffusion/dispersion since
any ‘mixing of energy’, be it implicit or explicit, will increase the total entropy of the system.
In his Section 9 he asserts that the primary source stems from the action of this numerical mix-
ing during the long-range transport of energy, both dry and moist, and of potential temperature
θ. As described by Egger (1999), the total entropy S of a discrete system of grid cells i, each
of volume δV , is given by
S =∑
i
(cp ρ δV ln θi)
= cp ρ δV ln
(∏
i
θi
), (5.5)
where here the density ρ is assumed constant for simplicity. Hence, in a discrete system the
entropy is essentially the product of all gridpoint values of θ, and is thus a moment of high
order. Inevitably the variance of θ will decrease as a result of the implicit or explicit numerical
mixing, and the total entropy given by (5.5) will increase. (Note that while most numerical
schemes will mix unrealistically, some can un-mix, for example the ‘VL2’ scheme studied by
Thuburn and McIntyre (1997), though this is obviously unrealistic as well.)
Johnson defines the following entropy sources: s is the total entropy source in a model,
Entropy Production in the IGCM 149
and se
is the true source which the model is aiming to represent. (He uses a wavy underbar
wherever a variable applies to the true atmospheric state.) The spurious positive definite source
is labelled sa. Departures of heat addition ∆Q, and of temperature ∆T , from the true climate
state are defined by
Q = Qe
+ ∆Q and T = Te
+ ∆T, (5.6)
where Q and T represent the model’s heat flux and temperature, respectively. Departure from
the true entropy source is expected in the model as a result of these heat and temperature
departures; this entropy source is labelled ∆s (∆Q,∆T ). Johnson then writes the relation
between the entropy sources as
〈s〉 =⟨se
⟩+ 〈∆s (∆Q,∆T )〉+ 〈sa〉 , (5.7)
where 〈〉 indicates a temporal and areal mass-weighted average value over an incremental isen-
tropic layer. This is his equation (23). If both real and modelled climate states are assumed to
be without drift then
〈∆s (∆Q,∆T )〉 = −〈sa〉 < 0, (5.8)
i.e. a departure entropy sink develops to counter the spurious source. The theory is that this
happens through a departure ∆T of the model temperature field from that of the real climate.
Note that in any case the presence of an aphysical source sa precludes the unbiased representa-
tion of the true state, since there must be a departure in either the heat addition or temperature
fields to allow for ∆s.
By considering the dry entropy balance, Johnson derives his equation (38):
〈sa〉 ∼=1⟨Te
⟩
〈s?(∆T )?〉 − 〈∆T 〉⟨
Te
⟩⟨s?Te?⟩− 〈∆Q〉+
1⟨Te
⟩⟨
(∆Q)?Te?⟩ (5.9)
showing that the aphysical source can be balanced by a combination of the four terms on the
right. Here ? represents a deviation from the average, i.e.
f ? = f − 〈f〉 . (5.10)
Note that some assumptions have been made in this derivation, namely that ∆T is small com-
pared to 〈T 〉, and that the locations of the spurious source and the temperature departure are
Entropy Production in the IGCM 150
uncorrelated, so that the covariance⟨s?a
(∆TTe
)?⟩is negligible. Johnson then assumes that
‘the emphasis given by climate modelers to replicating the atmospheric energy balance leads
to successfully modeled estimates of heat addition’, and sets ∆Q to zero. This leaves only the
first two terms on the right-hand side of (5.9) to balance the aphysical source.
Consider the second term. The circulation of the atmosphere is maintained by radiative
heating where it is hot and cooling where it is cool, so the covariance⟨s?Te?⟩
must be positive.
For this term to balance the positive aphysical source therefore requires 〈∆T 〉 to be negative,
i.e. there must be a mean cold bias of this infinitesimal isentropic layer. Similarly, for the first
term to offset the aphysical source requires a particular distribution of the cold bias; that is
coldest where the net entropy flux is out of the layer, and less cold where it is inwards.
The cold bias shown by Boer et al. (1992) has a clear distribution, with the severest bias
in the extratropics at high altitude and a weaker, but still systematic, cold bias in the tropical
troposphere. Johnson notes the striking similarity between these regions and that bounded by
the 300 and 325 K isentropic surfaces, which roughly fills the troposphere from the ground up
to 500 mb in the tropics, and sits in the band 300 to 200 mb in the polar regions. He also notes
that this region contains both areas of heating where the atmosphere is already hot and cooling
where it is already cool, and is therefore the region where the covariance in the second term is
likely to be largest. Furthermore, the distribution of the bias is such that it is coldest where the
net entropy flux is to space in the polar high altitudes, and less cold in the tropical troposphere
where the net flux is inwards; the covariance in the first term will therefore be positive. In
incremental isentropic layers within this region, he claims, both terms contribute to offsetting
the spurious, aphysical entropy source, as the model develops a cold bias which becomes more
pronounced towards the poles. Note, however, that only the second term on the right of (5.9)
was considered in Johnson’s estimate of a 4% error in the entropy source being required to
cause a 10 K mean cold bias in an isentropic layer.
5.3.2 Analysis of the IGCM Results
The global entropy source shown in Section 5.2.1 is of a similar size to Johnson’s prediction.
Section 5.2.2 shows that the entropy is produced largely at the surface in the extratropics in
connection with the low-level evolution of the wave. The source, therefore, is outside the 300-
325 K region identified by Johnson, and so would not appear directly in the entropy balance
equation (5.9) for infinitesimal isentropic layers in this region. This, however, does not mean
Entropy Production in the IGCM 151
that the theory does not apply (Johnson (2004), personal communication). If a model is able
to develop additional entropy sinks to balance aphysical sources, it is clearly able to support
additional fluxes of entropy between different isentropic levels. (After all, the results of Sec-
tion 5.2.2 show that non-isentropic models do not conserve the entropy between two isentropic
surfaces, even over a short period of time.) It is within the limits of the theory for entropy to
be spuriously produced in one isentropic level and radiated to space in another; this simply
implies a spurious flux of entropy between the two.
As noted in Section 5.2.2, the difference between the red and black lines in Figure 5.3
represents a source of entropy which occurs during the adiabatic timestep of the model. This,
therefore, is without doubt a spurious source. Entropy should be conserved under adiabatic
flow, but will be spuriously increased by numerical schemes as described by Egger (1999).
In the IGCM this would apply, for example, to the nonlinear advection terms calculated in
gridspace. Gregory and West (2002) showed that numerical advection schemes can have a large
diffusive effect when the wind field is oscillating. The vertical wind will oscillate in a model
such as the IGCM during the vertical propagation of planetary waves, as isentropic surfaces
are displaced relative to isobaric surfaces. This could partly explain this systematic entropy
source from the adiabatic step, since this occurs at a similar time as the propagation phase of
the saturation-propagation-saturation picture of the life cycle. Averaged over the ‘active’ latter
10 days of the cycle, the source during the adiabatic step is of the order of 0.4 mWm−2K−1,
which is smaller than Johnson’s estimate.
Including the diabatic step (i.e. the effect of the diffusion), however, the overall entropy
source is of the right size to give a bias of 10 K according to Johnson’s theory. The diffusion
is a source of entropy, but it must now be asked whether it is a spurious source; i.e. what real
processes does the diffusion represent, and how well does this entropy source compare to any
real sources?
The hyper-diffusion in the IGCM is one, very common, type of the scale-selective dissi-
pation which almost all numerical models include for various reasons, for example:
• To clean up small-scale noise introduced by various components of the model, e.g. physical
parameterizations, dispersion and truncation errors introduced by discretization of the
dynamical equations, and Gibbs oscillations introduced in the use of the spectral method.
• To help suppress any nonlinear instabilities.
• To maintain realistic flow features by dissipating potential enstrophy near the gridscale
Entropy Production in the IGCM 152
as it cascades to small scales.
• As a crude representation of the Reynolds stress terms, and hence of the sub-gridscale
flow.
Some of these are now discussed in detail.
Numerical diffusion can be interpreted as a representation of the Reynolds stress terms in
the large-scale equations derived by Reynolds averaging. As described by Kalnay (2003), for
example, variables such as velocity u and a tracer mixing ratio q can be expressed as the sum
of resolved (i.e. cell average) and unresolved components so that, for example,
u = u+ u′, where u′ = 0. (5.11)
By substituting these expansions into the continuous prognostic equations for u and q, alterna-
tive equations are formed which predict u and q, i.e. the large-scale or resolved variables. The
only dependence on the unresolved variables is in nonlinear terms such as − ∂∂x
(ρ u′q′) which
represent transport by the unresolved eddies. Diffusion, and similarly hyper-diffusion, essen-
tially parameterizes these eddy fluxes in terms of the large-scale variables by, for example,
−ρ u′q′ = K∂q
∂x. (5.12)
This is a first order closure for the Reynolds stress terms and is generally considered to be
crude. There is no a priori reason for closing the large-scale equations in this way, and there
can be significant sensitivity to the choice of the eddy diffusivity K (e.g. MacVean (1983),
Stephenson (1994)). It is widely believed that diffusion mixes the model atmosphere more
than the unresolved eddies would. The real atmosphere shows no clear separation of scales
at the resolution limit of models; the necessity to artificially truncate this range of scales and
simultaneously avoid noise at the gridscale inevitably leads to flow features near the gridscale
which are smoother than their counterparts in the real atmosphere.
The stretching out of vortex filaments to ever smaller horizontal scales is a familiar feature
in fluid dynamics. This occurs under the action of the large-scale, adiabatic deformation field as
seen, for example, in the contour advection study of Waugh and Plumb (1994). Charney (1971)
showed that in the atmosphere rotation and stratification apply a ‘geostrophic constraint’ that
acts to inhibit strong vortex stretching, and thus also to inhibit the cascade of energy to small
scales which is seen in three-dimensional turbulence. In the atmosphere it is the potential
Entropy Production in the IGCM 153
enstrophy (Q2), or variance of the PV field, which cascades to small scales rather than the
energy (e.g. Salmon (1998)). In the absence of a dissipation scheme this variance will collect
at the model’s smallest resolvable scale resulting in noisy and unrealistic small-scale features.
This is often referred to as ‘spectral blocking’.
In the real atmosphere quasi-horizontal chaotic advection along isentropic surfaces acts
to stir the air creating thin filaments of varying PV which are not mixed (e.g. Pierrehumbert
and Yang (1993)). Actual mixing, and hence dissipation of potential enstrophy, occurs under
the action of radiative heating, and in intermittent encounters with localised patches of three-
dimensional turbulence (e.g. Haynes and Ward (1993)). So it is, therefore, diabatic processes
which ultimately dissipate the potential enstrophy, and real entropy sources are to be expected.
In the troposphere, three-dimensional turbulence is encountered in the atmospheric boundary
layer, in connection with convective events or in strong frontal zones where Kelvin-Helmholtz
instability is expected. As shown by Figure 5.5, most of the entropy source from diffusion
occurs near the surface in response to strong horizontal temperature gradients. It could then
be argued that the diffusion gives a good representation of these processes, which act to mix
across isentropic surfaces, and so reduce temperature gradients at or near the surface.
In the IGCM LC1 cycle the scale limitation of the surface fronts described by Methven
(1996) clearly represents a large part of the entropy source from diffusion. The formation of
fronts is driven by the large-scale deformation field (e.g. Hoskins and Bretherton (1972)). Dif-
fusion limits the frontal scale by transporting mass across isentropic surfaces, and Figure 5.2
shows that the entropy source sdi during the frontal collapse is remarkably similar at all reso-
lutions, i.e. regardless of the scale at which the diffusion is operating. Furthermore, Figure 5.1
shows that the entropy source is also largely independent of the eddy diffusivity.
This is in agreement with the findings of Roberts and Marshall (1998), who studied spuri-
ous potential temperature tendencies resulting from diffusion on temperature in ocean models.
They found that the diapycnal velocities(DθDt/∂θ∂z
)due to diffusion on temperature were inti-
mately related to the dissipation of vorticity gradients in the cascade of potential enstrophy to
the gridscale. In numerical experiments, the spurious diapycnal transfers were relatively in-
sensitive to changes in the model resolution, indicating that the same diapycnal transfer was
needed to dissipate the vorticity gradients no matter which scale it occurred on. Compare the
evolution of the low-level temperature field in the IGCM at T42 and T341 resolutions (Fig-
ures 4.2 and 4.4). In both cases a warm sector extends towards the pole, collapses to near the
gridscale and is eroded by the hyper-diffusion. The only difference between the two is the scale
Entropy Production in the IGCM 154
at which this occurs.
The argument of Roberts and Marshall also holds for atmosphere models. As in the orig-
inal argument, thermal wind balance must be assumed, expressed here on pressure surfaces
by
− fRp∂v
∂p=
(p
p0
)κ∂θ
∂x,
f
Rp∂u
∂p=
(p
p0
)κ∂θ
∂y, (5.13)
where R is the gas constant and f the Coriolis parameter (assumed constant). A ∇4 hyper-
diffusion acting on temperature on constant pressure model levels can then be expressed as
∇4 T = ∇4
(θ
(p
p0
)κ)
= ∇2
(f
Rp
(− ∂2v
∂p ∂x+
∂2u
∂p ∂y
))using (5.13)
= − fRp∂
∂p
(∇2ξg
). (5.14)
Hyper-diffusion on temperature in pressure coordinate models is therefore seen to dissipate
gradients of geostrophic vorticity ξg.
The following picture thus emerges. Entropy is produced by the limitation of frontal tem-
perature gradients by diffusion in the IGCM. The size of this source is determined not by the
details of the scale-limiting process, however, but by the rate of frontal collapse, itself deter-
mined by the deformation field of the large-scale flow. It is to be hoped that the large-scale flow
will be represented well, so that the rate of collapse will be very similar in both real and mod-
elled atmospheres. This suggests that, on sufficiently long timescales, the entropy source will
also be very similar, even though in the real atmosphere it is turbulent and radiative processes
which act to limit the frontal scale.
It is proposed, therefore, that the entropy source due to the action of diffusion to limit the
frontal gradients is a good representation of the real entropy source involved in limiting such
gradients in the continuous atmosphere. The IGCM has no mechanism other than diffusion or
spectral blocking to limit the gradient. What, then, is the part played by the diffusion in the
entropy budget of a full GCM?
Goody (2000) analysed the entropy budget of the GISS GCM and compared it to his
analytical estimations of the entropy sources of the atmosphere. The numerical scale-selective
dissipation of this model is in the form of a binomial filter which was found to dissipate energy
at an average rate of 0.684 Wm−2. Goody combined this with the dry convection term since,
Entropy Production in the IGCM 155
though it has no direct physical meaning, it ‘most resembles the action of a turbulent viscosity’.
A temperature of 260 K was assumed for the dry convection, and using the same value gives an
entropy source of 2.6 mWm−2K−1 for the action of the filter. This is in close agreement with
the source from diffusion in the IGCM which is shown in Section 5.2.1. The closeness of this
agreement may, of course, be a coincidence, but the size of the source does indicate that the
diffusion still plays a large part in dissipating enstrophy during the cascade, even in this, more
comprehensive model.
Goody quantifies the GISS entropy sources from parameterized dissipative processes such
as dry turbulence and boundary layer stresses, which would be expected to play a part in limit-
ing frontal gradients. The total dissipative source compares well with his theoretical estimate,
but given the uncertainties emphasised by Goody in the atmospheric budget, it cannot be in-
ferred from this that these sources in the model are correct. It is surprising that the source due
to the binomial filter is so large even with radiative and convective parameterizations acting in
the model. Here, however, it is hypothesised that the overall source due to truncation of the
potential enstrophy cascade, by a combination of parameterizations and the binomial filter, is
realistic. If potential enstrophy is dissipated by the filter then there will be weaker temperature
gradients and presumably less mixing will be done by the parameterizations. In the extreme
case, if the fluid is well mixed it simply cannot be mixed any more.
Hence it is argued here that, even in a full GCM, the action of diffusion to limit frontal
gradients is not a spurious source of entropy, since the entropy generation is determined by the
deformation field of the large-scale flow. This is not to say that there is no spurious source due
to diffusion. Here we have focussed on the surface fronts; Figure 5.5 shows that while these
have a dominant contribution, the diffusion does also act to increase entropy elsewhere. At the
surface fronts significant sub-gridscale features would be expected, and so this crude closure
of the large-scale equations is perhaps acceptable. It could well be less so, however, in other
regions where the sub-gridscale ‘eddies’ are expected to be weaker, for example in the vicinity
of stratosphere-troposphere exchange events where the diffusion is generally considered to do
too much mixing (Section 5.2.3). As emphasised by Salmon (1998), one of the fundamental
concerns over this closure is that there is no dependence of the eddy diffusivity on the nature
of the flow.
As another example, consider the blocking anticyclone shown in Figure 11 of Hoskins
et al. (1985), which is reproduced in Figure 5.8. A large pool of low PV air from the subtropics
gets cut off in the high PV air to the north, where it remains for four days, with little change in
Entropy Production in the IGCM 156
Figure 5.8: Reproduction of Figure 11 of Hoskins et al. (1985), showing PV on the 330 Kisentropic surface from analyses for the period 30 September - 7 October 1982. The contourinterval is 1 PVU with the region between 1 and 2 PVU shaded.
magnitude, before being re-absorbed into the low PV region to the south. In a model simulation
of this event scale-selective dissipation was needed to reduce noise; numerical diffusion thus
has some effect on such features. In contrast to features such as the surface fronts, here the
potential enstrophy is not cascading to small scales in this region, so the argument presented
above does not apply. It is not clear in situations such as this that the entropy source will be
independent of the scale, or the rate at which the dissipation acts. It is possible that the entropy
produced by diffusion will be spuriously large in such events. This is, of course, entirely
speculative.
In summary this study has shown that there are spurious, aphysical entropy sources during
the life cycle due to numerical errors in the adiabatic transport terms. It also seems likely that
a part of the entropy generated by diffusion constitutes a spurious source. The dominant part
of this source is that due to the limiting of the surface frontal gradients. It is proposed that
this dominant part is of a realistic size, however, despite the unrealistic nature of the actual
scale-limiting mechanism.
Entropy Production in the IGCM 157
5.4 Alternative Theories
Johnson’s theory is currently not widely accepted. Alternative explanations for the cold bias
have been suggested, and several studies published demonstrating a reduction in the bias. These
are now briefly reviewed. In the original intercomparison of Boer et al. (1992) it was suggested
that the simplest explanation would be a minor error in albedo or emissivity due to, for example,
the treatment of clouds, or of trace gases and aerosols in radiation schemes. Consistent with this
the temperature structure was ‘much more sensitive to changes in physical parameterizations
than to resolution’. Later, Boer (2000) commented that ‘the error is diminishing, but still exists,
in recent models’, although it was unclear which aspects of model development had reduced
the bias; improved resolution and treatment of clouds or radiation were given as possibilities.
Clearly there is much uncertainty in the representation of clouds and cloud-radiation inter-
actions in climate models. Improvements have been achieved by developing parameterizations
of these effects, but never to the extent that the cold bias is fully corrected in either size or dis-
tribution. For example Gu et al. (2003) tested a new cloud-radiation parameterization scheme
in the UCLA GCM which corrected, to some extent, the cold bias in the troposphere of the
original model. The lower stratosphere, however, became even colder under the new scheme,
leading the authors to comment that ‘the general coldness of climate models is not simply a
radiation problem’.
One feature of climate models which is clearly improving is the resolution, both in the
horizontal and the vertical. The effects of this, however, appear to be mixed. Hamilton et al.
(1999) achieved a significant reduction in the cold pole bias by increasing the horizontal res-
olution in the GFDL SKYHI model, which extends with good vertical resolution up into the
mesosphere. They note that the bias reflects an extratropical stratospheric circulation that is
too close to radiative equilibrium. The middle atmosphere is driven away from radiative equi-
librium by the Brewer-Dobson circulation, a global-scale, meridional overturning circulation
with large-scale ascent across the tropopause in the tropics and descent nearer the poles (see
e.g. Holton et al. (1995)). As air parcels are pushed downwards near the poles they are warmed
adiabatically above the radiative temperature. In the eyes of Hamilton et al. (1999) and many
others the cold pole bias is therefore a symptom of an unrealistically weak Brewer-Dobson
circulation in climate models. This circulation is driven by momentum deposited in the extrat-
ropical stratosphere by the breaking of Rossby and gravity waves; this is the downward control
mechanism of Haynes et al. (1991). An unrealistically weak circulation is thus, in turn, a symp-
Entropy Production in the IGCM 158
tom of weak wave fluxes. Increasing the resolution means that shorter waves are resolved and
the wave fluxes are stronger, leading to a stronger Brewer-Dobson circulation and a warmer
polar stratosphere.
At first sight the results of Brankovic and Gregory (2001) appear to contradict this. In
experiments with the ECMWF model an increase in horizontal resolution reduces the cold bias
throughout the troposphere but not in the extratropical UTLS region. A closer look shows
that the two results are, however, consistent. In the SKYHI model used by Hamilton et al.
(1999) there is no parameterization of sub-gridscale gravity wave effects; any such effects must
come from resolved gravity waves, so the wave forcing of the circulation would be expected
to increase with resolution. An increase in temperature with resolution is seen everywhere
above a few hundred millibars, but by far the largest warming (up to 40 K) is in the region
of 10 to 0.1 mb, i.e. the upper stratosphere and lower mesosphere. In the polar UTLS region
originally identified as cold by Boer et al. (1992) (about 200 mb) there is a warming with
increased resolution, but this is slight, especially above N90 (i.e. 90 grid rows between pole and
equator). Brankovic and Gregory (2001) focus on this UTLS region and so the lack of warming
with increased resolution is consistent. Furthermore, the ECMWF model used in their study
contains a gravity wave drag parameterization, and so it is not obvious that a large increase
in wave forcing would result from an increase in resolution. These parameterizations aim to
represent the momentum deposited during the dissipation above the tropopause of unresolved
gravity waves which are triggered almost exclusively in the troposphere, mainly by orography
but also by dynamical events such as convection. An example is the scheme of McFarlane
(1987) who also shows that, while much of the wave dissipation is in the upper stratosphere
and mesosphere, there is an effect on the extratropical UTLS region with temperatures rising
by up to 7 K.
Clearly the strength of the wave driving in the stratosphere has an effect on the polar
temperatures. A common opinion in the modelling community is that the cold bias is due
to the omission or underestimation of some element of gravity wave forcing. Most of the
middle atmosphere climate models in the intercomparison of Pawson et al. (2000) contain some
kind of gravity wave drag parameterization and yet still show significant cold biases. Among
these models two main types of systematic error in 100 mb temperatures were identified, and
different causes suggested for each. Many models were too cold at all latitudes, yet showed a
realistic annual temperature cycle; here an error in radiation was suggested. Some models, on
the other hand, had a realistic annual cycle but were too warm in the Tropics and cold at high
Entropy Production in the IGCM 159
latitudes, suggesting that the residual circulation described above was too weak, i.e. with too
little downwelling at high latitudes and upwelling at low latitudes. This raises an interesting
point: if an unrealistically weak meridional overturning circulation is to blame, why are models
not systematically too warm in the tropical stratosphere due to underestimated upwelling? The
answer to this, however, could be that the descent near the poles is more concentrated than the
ascent in the tropics, which occurs over a larger horizontal area.
Williamson and Olson (1994) noted a reduction in polar tropopause cold bias in changing
from Eulerian to semi-Lagrangian model dynamics. This was confirmed by Chen and Bates
(1996), who achieved a reduction of the polar cold bias by roughly half everywhere above
200 mb, showing this was due to more efficient poleward heat transport in the polar regions.
This, in turn, is likely to be a result of increased wave-driving of the stratospheric circulation.
Finally, Zhu and Schneider (1997) achieved a reduction in the cold pole bias in the third
generation ECHAM GCM by switching from a hybrid σ − p to a σ − θ vertical coordinate.
Warming over the poles in the 100-300 mb region had also been noted by Thuburn (1993) in a
similar experiment. Zhu and Schneider show the temperature error of the original model which
appears to contain two distinct cold biases; one of up to 9 K centred at 200 mb in the polar
region of both hemispheres, and a second centred at 30 mb only in the winter hemisphere (9 K
in the Southern Hemisphere and up to 27 K in the Northern Hemisphere). Both of these biases
are reduced in the σ − θ version, consistent with differences in eddy heat-flux convergence
between the two models. Changes in Eliassen-Palm fluxes are observed in the same regions,
indicating a change in wave forcing of the zonal mean.
Webster et al. (1999) and Schaack et al. (2004) have also achieved reductions in the cold
bias by using hybrid-isentropic coordinates. Webster et al. noted that errors in the vertical
advection of temperature and moisture are reduced in both isentropic models, and in those em-
ploying semi-Lagrangian advection schemes, and suggested that this could be one reason for
the reduced bias in these models. Increased wave-driving contributed significantly to the re-
duction, especially in the winter hemisphere. In the hybrid-isentropic model this was attributed
to two factors. Firstly, model diffusion acts along isentropic surfaces in the stratosphere, and
therefore is expected to be a better representation of the isentropic mixing of PV which occurs
in wave-breaking events in the real atmosphere (Haynes and McIntyre (1987, 1990)). Secondly,
the vertical propagation of planetary waves involves large displacements of isentropic surfaces
from isobaric surfaces, so that with a pressure coordinate there are large vertical mass fluxes,
with associated truncation errors, that do not occur with an isentropic coordinate. Schaack et al.
Entropy Production in the IGCM 160
(2004) comment that the improvements in advection, and in wave-driving, in hybrid isentropic
models ‘are consistent with Johnson’s (1997) theoretical study’, though no further explanation
of this is given.
5.5 A Comment on the MEP Theory
The work presented here was not performed with the Maximum Entropy Production, or MEP,
theory in mind. It is, however, of relevance. According to the MEP theory it is the entropy
produced by poleward heat transport in the climate system which is at a maximum. The size of
this entropy production in the atmosphere can be estimated using a very simple model. Suppose
the atmosphere can be represented as a warm reservoir of temperature T+ (the tropics) and a
cool reservoir at T− (the polar regions), with a poleward heat flux F between the two. Using
(5.1) the total entropy source is then
s =F
T−− F
T+
=T+ − T−T+T−
F > 0. (5.15)
Assuming values of T+ = 295 K, T− = 275 K and F = 5 PW this gives an entropy production
rate of 4 mWm−2K−1. This is obviously a very rough value, but indicates the order of magni-
tude of the source; it is certainly a small number compared to terms in the atmospheric entropy
budget (Peixoto et al. (1991), Goody (2000)).
It is the large-scale circulation which transports heat towards the poles, but this is, in
principle, a reversible process. Entropy is only produced by the mixing of energy, and it is
the small-scale dissipation processes such as turbulence which actually mix the warm tropical
and cool extratropical air masses, and hence generate the entropy to which the MEP theory
applies (Ozawa et al. (2003)). The dry dissipation processes, however, are also precisely those
represented by the diffusion in the IGCM LC1 cycle.
In investigating the MEP theory, Kleidon et al. (2003) used a simple, dry GCM to perform
experiments similar to those described in this chapter. The only physical parameterizations
included were Rayleigh friction to represent the turbulent boundary layer and Newtonian heat-
ing to represent the effect of radiation by relaxing temperatures towards a radiative-convective
equilibrium state. In 10 year simulations the total entropy source was determined by comput-
ing the change in heat content of each grid cell and dividing by the temperature. The size of
Entropy Production in the IGCM 161
the source was seen to be independent of horizontal resolution above T42, having converged
to 2.5 mWm−2K−1. In these experiments the heating, surface friction and numerical diffusion
all contribute to dissipating potential enstrophy and generating entropy, yet the overall source
is remarkably similar to that seen in Figure 5.1 for the source during the LC1 cycle (roughly
3 mWm−2K−1).
It is perhaps a coincidence that these values are so similar, particularly given the short
duration of the IGCM runs. However, here it is argued that this is not the case. The generation
of entropy in both experiments is by the mixing of air masses with different potential temper-
atures, and is a direct consequence of the dissipation of potential enstrophy to halt the cascade
to small scales. The similarity of the two values could then be taken to support the hypothe-
sis that the size of the source is determined not by the details of the dissipation mechanism,
here strikingly different in the two experiments, but by the rate at which the potential enstro-
phy is cascading, itself determined by the characteristics of the large-scale flow. It could be
argued that the large-scale flows are significantly different in the two experiments, one driven
by parameterized heating and the other evolving from an idealised initial condition. The same
dominant dynamics act in each, however; the evolution of synoptic scale eddies in response
to baroclinic instability. It is entirely possible, if not anticipated, that both flows will generate
small scales in PV at the same rate.
This comparison is, of course, entirely circumstantial evidence for the hypothesis that the
entropy source is determined by the large-scale flow. It could, in fact, be considered weak evi-
dence given the idealistic nature of the LC1 cycle, and its limited duration. Longer simulations
in the absence of any heating parameterization would, however, be unrealistic. Clearly further
investigation is warranted.
It is a widely held view that there is no physical basis for the climate system to choose to
be at a state of MEP, and the fact that it appears to be close to this is merely a coincidence.
Here it is suggested that the rate of entropy production is determined by the large-scale flow,
and hence by the degree of baroclinic instability. It is much easier to imagine that there is some
critical level of stability at which the stabilising effect of the baroclinic eddies balances the
destabilising effect of the uneven solar heating. The atmosphere is thus continually adjusting
towards this state, and the entropy source is indirectly determined by this through the strength
of the eddy field, rather than itself determining the nature of the flow.
As an example, consider again the two box model introduced at the start of this section.
In slightly more complex models (e.g. Lorenz et al. (2001), Kleidon and Lorenz (2004)) the
Entropy Production in the IGCM 162
temperatures are allowed to vary, and the heat flux is expressed as a diffusive flux by
F = k(T+ − T−), (5.16)
where k is considered an effective heat diffusivity and is also allowed to vary. The heat flux
F is fixed by energy conservation. The parameter k measures how efficient the atmosphere
is at transporting heat, i.e. how quickly heat is transported for a given temperature gradient
(T+ − T−). The mechanism inducing heat transport is baroclinic instability, so k would be
expected to decrease if the temperature gradient decreases. This system now shows potential
for feedback: if k is increased then heat is transported more efficiently and the equator to pole
temperature gradient is reduced, in turn reducing k. There is therefore a stable state at some
value of k; the system will act to correct any perturbation of k away from this state. MEP
supporters claim that this state is the one in which the entropy production given by (5.15) is
a maximum, though clearly this depends on the exact nature of the dependence of k on the
temperature gradient.
Now suppose the system is at the critical stability state described above, and is perturbed
so it is slightly more unstable. This would correspond to an increase of APE in the continuous
system, or an increased gradient (T+ − T−) in the two box model. Stronger eddies would
form, effectively increasing k and reducing the temperature gradient again by more efficient
heat transport. There is therefore a clear physical argument for the system to seek this critical
stability state. This may correspond to the state of maximum entropy production via APE
arguments (Ozawa et al. (2003)) but if so this is a symptom of the system’s behaviour rather
than the cause.
5.6 Discussion
It should be noted that the experiments presented in this chapter represent very idealised situ-
ations. Any conclusions drawn are valid only to the extent that the flows modelled are repre-
sentative of both real flows and those simulated by more comprehensive models. Furthermore,
several of the ideas presented here are clearly hypotheses rather than proven theories.
In addition it is important to remember that the effect of diffusion on model simulations is
not limited to its characteristics as an entropy source. MacVean (1983) also used the IGCM to
study the effect of varying diffusion on idealised baroclinic waves. The linear growth rate of
Entropy Production in the IGCM 163
the normal modes was found to be very sensitive; without diffusion, the growth rate monoton-
ically increased with wavenumber so that waves of unrealistically high wavenumber (e.g. 20)
would be more unstable than the commonly observed events of wavenumber 6 to 9. Adding
sufficiently scale-selective diffusion had the beneficial effect of damping the high wavenum-
bers while leaving the more realistic, lower wavenumber disturbances unchanged. The details
of the diffusion, therefore, greatly affected the growth of high wavenumber secondary waves
after the main development, resulting in wildly different energetics as seen in Figure 5.2.
Later, Stephenson (1994) studied the sensitivity of simulations by a more realistic GCM to
the horizontal diffusion. Significant effects were observed, for example that ‘weakened diffu-
sion results in weakened equator-pole temperature gradients, weakened baroclinicity and much
weaker barotropic energy conversions’. In some ways, then, the details in representation of the
small-scale processes clearly do profoundly affect the larger scales.
While entropy is generated by hyper-diffusion on temperature in the IGCM LC1 cycle, it
is argued here that the majority of this does not constitute a spurious entropy source in climate
models. The combination of diffusion and physical parameterizations acts to dissipate PV
gradients and to mix energy. The amount of mixing needed is determined by the large-scale
flow and is not overestimated; if air masses of different potential temperature have been mixed
by diffusion then there is simply nothing remaining to be mixed by the parameterizations and
the entropy will not be further increased. If it is true that, in this context at least, the effect of
small-scale processes is determined by the large-scale flow, this provides further justification
for predicting the flow evolution from a discretized equation set, providing perhaps that some
minimum resolution is used so that the large-scale deformation field is accurately simulated.
This is not to say that there is no spurious entropy generated by the diffusion, only that
the dominant contribution from the limiting of surface frontal gradients is not spurious, since
these gradients are limited by diabatic processes in the real atmosphere. Diffusion, however,
also acts on features which are not cascading to small scales in the way that fronts and PV
filaments are. Here the above arguments do not apply; the amount of mixing performed by the
diffusion would be expected to be dependent on the details of the scheme, and is likely to be
an overestimate. Entropy generated in this way would constitute a spurious source.
Entropy is also produced in the life cycle by numerical schemes representing the adiabatic
dynamics. This is considered to be a spurious source, though it is five times smaller than that
predicted by Johnson (1997) to yield a 10 K mean cold bias in the 300-325 K region. This
does not contradict Johnson’s theory. Large spurious entropy sources are expected from the
Entropy Production in the IGCM 164
representation of moist processes (Johnson et al. (2000, 2002)), which are not included here.
If the dry dynamics alone generated entropy at 2 mWm−2K−1 then the total source would be
larger and the predicted bias would significantly exceed that actually seen in climate models.
It is worth noting that the entropy exchanges associated with moist processes are clearly the
most uncertain (Goody (2000)). Any subtlety in whether, for example, the entropy generated
by dry dynamics during a model occlusion event is realistic will be swamped by uncertainty in
the moist effects.
It appears that, in general, the cold bias has reduced since the original study of Boer et al.
(1992), but has not disappeared. It is clear that the representation of gravity waves plays a part;
any underestimation of the gravity wave forcing will result in a weakened Brewer-Dobson
circulation, less downwelling and therefore reduced adiabatic heating over the poles. This
cannot explain all of the bias, however. The effect of increasing gravity wave forcing is seen
mainly in the upper stratosphere and lower mesosphere, while the tropopause region is also too
cold in models. Furthermore, if an unrealistically weak Brewer-Dobson circulation is to blame
then the tropical tropopause region would show a systematic warm bias. There does appear
to be a mean warm bias here (see Figure 8 of Gates et al. (1999)), but also a large ensemble
standard deviation, indicating a large spread between models. The fact that the warm bias is
not as systematic as the cold bias could be because the polar downwelling is more concentrated
than the tropical upwelling. However, both Pawson et al. (2000) and Shepherd (2002) conclude
that the stratospheric circulation is too weak in some models, but not in others.
Entropy errors of the size required by Johnson’s theory certainly exist (Goody (2000))
and the theory does provide an explanation for the cold bias in the 300-325 K region, i.e. the
region where little warming is expected from improved representation of gravity wave effects.
Perhaps the greatest problem with Johnson’s theory is the neglect of the ∆Q terms in (5.9). If
the temperature of an air parcel decreases then the energy that it loses by longwave radiation
will also decrease. It is then hard to imagine departures of the temperature structure from that
of the real atmosphere, without also experiencing some departure of the energy exchanges.
This is hinted at in Goody’s final comment that ‘there may be compensating changes in energy
sources’. Despite this, Johnson’s entropy theory is attractive, and seems likely to account for
part of the observed cold bias. Aphysical sources must be offset by additional sinks and, given
the strong constraints on energy exchanges, a systematic bias in temperatures seems to be an
obvious mechanism to provide for this. (Another candidate could be a systematic bias in the
hydrological cycle; moist effects enter Johnson’s theory only through the heat addition Q.)
Entropy Production in the IGCM 165
Hamilton et al. (1999), Zhu and Schneider (1997) and Webster et al. (1999) all show two,
seemingly distinct, regions of bias, one centred at roughly 200 mb and another higher up. Both
are reduced in the switch to a hybrid isentropic vertical coordinate in the latter two studies.
Here, it is hypothesised that the reduction in spurious entropy sources associated with the
isentropic coordinate has reduced the cold bias in the 300-325 K region, in line with Johnson
(1997). The improved eddy heat fluxes resulting from more accurate planetary wave dynamics
would account for the reduction in bias centred on 30 mb in the winter hemisphere, where the
large-scale circulation has a strong warming effect.
5.7 Summary
• The entropy source during the IGCM LC1 cycle averages approximately 3 mWm−2K−1,
and is remarkably insensitive to changes in horizontal resolution and diffusivity.
• The effect of the model’s hyper-diffusion dominates the source, and this component is,
in turn, dominated by the entropy generation near the surface. A source of roughly
0.4 mWm−2K−1 is generated in the evaluation of the adiabatic terms in the prognostic
equations; this is certainly spurious.
• It is hypothesised here that much of the entropy generated by the hyper-diffusion does
represent a real, physical source. The diffusion generates entropy as it limits the scale
of surface fronts, and as it dissipates potential enstrophy in the cascade to small scales.
The entropy source is determined by the rate at which these small scales are generated,
and this is determined by the large-scale deformation field rather than the dissipation
mechanism.
• The diffusion clearly acts on features which are not cascading to small scales. In this case
the above argument does not apply. The resulting entropy source is expected to depend
on the formulation of the diffusion scheme, and is likely to be overestimated. Some part
of the entropy source due to diffusion is, therefore, likely to be spurious.
• The spurious entropy sources demonstrated in the IGCM are of a size which would result
in a cold bias of up to a few Kelvin, according to Johnson’s theory. It does appear
probable that this mechanism accounts for a part of the bias seen in current climate
models.
Entropy Production in the IGCM 166
• The Lagrangian conservation of entropy in the IGCM is much worse than the net source
suggests. The global increase in entropy is the residual effect of frequent increases and
decreases in the entropy of individual fluid parcels.
Chapter 6
Conclusions
The general conclusions of this thesis are that:
• There are errors in the representation of the entropy balance in conventional atmosphere
models, and these are significant.
• Isentropic models do show improved accuracy in certain areas, such as Lagrangian trans-
port and conservation, and are therefore a useful alternative to conventional atmosphere
models.
• Some of the potential benefits expected from using PV as a prognostic variable have also
been realised.
On the first of these, the most basic argument is simply that the real system obeys physical laws
such as the second law of thermodynamics, and the Lagrangian conservation of entropy under
adiabatic motion. Models which do not obey these laws are therefore unphysical in some way.
On the second, it is important that the atmosphere is modelled from a range of perspec-
tives, and using a range of different techniques. Isentropic models offer the trivial satisfaction
of Lagrangian conservation laws which are not satisfied in other models. Several hybrid isen-
tropic models are at an advanced stage of development, but technical difficulties do still exist
at the Earth’s surface in pure isentropic models, including the one developed here. These mod-
els are still useful, however, as the only tools currently available that are able to predict the
completely adiabatic evolution of a primitive equation atmosphere. The interplay between adi-
abatic and diabatic processes in the atmosphere is not fully understood, and these models have
the potential to provide valuable insight.
Conclusions 168
Before discussing these conclusions in more detail, we briefly review some specific results
of this work. More detailed summaries can be found at the end of each chapter.
6.1 Summary of Results
• Charney-Phillips type vertical grids best represent the normal modes of both hydrostatic
and non-hydrostatic atmospheres in isentropic coordinates. The isentropic coordinate
is the only one out of three coordinates tested which has the potential to easily achieve
good conservation and dispersion properties in the same grid.
• An isentropic model is presented which uses a new variation of the massless layer tech-
nique to handle the intersection of model levels with the ground. Unfortunately the
massless region is unstable, and a relaxation method is employed to control this. Simple
flows are modelled successfully, and some known problems analysed, in Chapter 3. The
model is not stable enough to simulate a whole 15 day baroclinic wave life cycle, but
does successfully simulate the growth and most of the decay phase.
• The isentropic model life cycle is much more similar to an IGCM version with∇4 hyper-
diffusion than it is to the standard∇6 version. This is in agreement with estimates of the
implicit diffusion in the advection scheme by Thuburn (1995). There is, therefore, a
slight mixing of interior and boundary PV during advection. While PV is conserved bet-
ter in the isentropic model, it is the∇6 IGCM version of the cycle which most resembles
a high resolution simulation, because the scale of PV features is limited in the isentropic
model as it is in the ∇4 IGCM.
• Despite this, some of the expected benefits of using PV as a prognostic variable have
been realised. For example, there is no spurious amplification of PV maxima, as is seen
in the IGCM life cycle, and there is no spurious generation of potential enstrophy.
• Lagrangian conservation of entropy is significantly better in the isentropic model. How-
ever, the model exhibits a total energy drift which is more serious than the error in total
entropy shown by the IGCM. This energy drift may be partly caused by the implicit
mixing of interior and boundary PV.
• As described in Section 1.2.3, recent pure isentropic models have suffered from two
outstanding technical problems. The first of these, i.e. lack of smoothness in ∇2M , is
Conclusions 169
avoided here because the horizontal and vertical resolutions are well matched. If this
is not the case, this problem still exists. The second, that of inability to model surface
fronts, appears to have been overcome. This model predicts a discontinuity in surface
potential temperature, and so supports the hypothesis that frontal discontinuities are a
feature of purely inviscid atmospheres. The model forms fronts without entropy produc-
tion, and with only slight mixing of interior and boundary PV, so these are considered
better representations of truly adiabatic fronts than those in the IGCM. As hoped, the
isentropic model is able to simulate realistic fronts at much lower resolution than the
IGCM.
• There is a spurious entropy source in the IGCM due to numerical transport errors; this
source averages 0.4 mWm−2K−1 over the lifecycle.
• Entropy is generated by numerical diffusion at a rate of approximately 2.5 mWm−2K−1.
This varies only slightly with resolution and diffusivity, and is dominated by the con-
tribution from the lowest two levels. It is argued in Chapter 5 that the majority of this
source does represent a real, physical entropy source, because it is diabatic processes
which ultimately limit both the potential enstrophy cascade, and the gradient of surface
fronts. The rate of cascade, and therefore the entropy source, is determined by the defor-
mation field of the large-scale flow, rather than the details of the dissipation mechanism.
Some fraction of this entropy source from diffusion is spurious, however, since model
diffusion does not only act on features which are cascading to small scales.
• No evidence has been found to disprove the theory of Johnson (1997) that spurious en-
tropy production in climate models is the cause of their systematic cold bias. On the
contrary, the spurious entropy source seen in the IGCM life cycle is of the size required
by the theory to cause a bias of at least a few Kelvin, according to Johnson’s estimate. It
appears likely that this is one factor responsible for the general coldness of climate mod-
els, with an unrealistically weak wave-driven stratospheric circulation also contributing.
• The IGCM, in any case, does not respect the Lagrangian conservation laws of entropy
and PV. This would undoubtedly lead to errors in the transport of moisture and chemical
species in a more complete model.
Conclusions 170
6.2 Discussion
The instability of the isentropic model clearly needs addressing. The definition of PV and
static stability values throughout the massless region appears to be justified, since the flow near
the surface is realistic. However, evolution according to the primitive equations in this region
is less justifiable, especially since a strong relaxation needs to be applied. Future work will
aim to determine these values in the massless region diagnostically using data from just above
the surface. This would eliminate the need for a relaxation scheme, preventing the surface
divergence effect of Sections 3.7 and 4.13.1, and hopefully reducing the mixing of internal and
boundary PV by reducing PV gradients. This, in turn, should reduce the energy drift.
In general the model has shown promise, especially in its Lagrangian conservation prop-
erties. Despite this, however, some synoptic PV features simulated by the ∇6 IGCM are more
similar to features in the high resolution IGCM run than those predicted by the isentropic model
are. This is because the scale limitation in the isentropic model acts like a ∇4 operator, and
at this resolution features such as the high PV troughs are affected. It is hoped that at higher
resolution this scale limitation will act to dissipate potential enstrophy at a more realistic scale,
with less effect on features such as the troughs. Unfortunately, higher resolution runs with the
current model have been unsuccessful due to the instability of the massless region.
The surface flow is well simulated by the isentropic model, which is a significant improve-
ment over similar previous models. Arakawa et al. (1992) describe how the surface evolution in
their model is inhibited by the lack of degrees of freedom in the surface potential temperature.
Baroclinic waves are too weak because the low-level wave does not evolve correctly (Konor
(2004), personal communication). Their isentropic model levels slump excessively and pile up
along the surface, as seen in Hsu and Arakawa (1990), so that the fronts where they intersect
the surface lie too far towards the equator. This is believed to be partly a result of the model
adjusting itself to smooth the ∇2M distribution. This does not happen in the isentropic model
developed here; the fronts it predicts are no further towards the equator than those in the IGCM
(compare Figures 4.2 and 4.8), and the EKE generated is almost identical to that shown by the
∇4 IGCM. The sub-gridscale fit of Section 3.4.3 is key in the accurate representation of the
surface potential temperature distribution.
One potential objection to the modelling philosophy used here is based on the scale effect
of PV. An isolated PV anomaly corresponds to wind and temperature fields that have a larger
scale, in both the horizontal and vertical, than the anomaly itself. The anomaly is therefore
Conclusions 171
harder to resolve on a discrete grid than its associated flow field. This is perhaps the reason
why the PV distribution predicted by the isentropic model in Figure 4.10 still appears slightly
more diffused than that predicted by the∇4 IGCM in Figure 4.12. However, there is very little
difference between the two simulations in either the synoptic scale features, or the eddy kinetic
energy evolution, so this effect appears to be only minor, at least in the simulation studied here.
Another objection is that the ‘inversion’ step needed to calculate the winds from the vor-
ticity and divergence, while cheap in a spectral model such as the IGCM, is expensive in a grid-
point model. Multigrid methods, as used in the isentropic model, have significantly reduced
this computational burden, but not removed it. The horizontal scheme offers many benefits: it
allows the balanced flow to evolve only by the action of a carefully constructed conservative
advection scheme on PV and density, it allows gravity modes to be damped separately if need
be, and it effectively gives the vorticity and divergence on the unstaggered Z grid, which sim-
ulates geostrophic adjustment better than any of the well-known A-E grids (Randall (1994)).
(Note that it may be possible to achieve the benefits of the Z grid in a model predicting momen-
tum directly, and so not requiring an inversion step (Ringler and Randall (2002a,b)), though
this does require the filtering of computational modes.) In future development of the isentropic
model, it will be important to establish whether these potential benefits have been realised, and
the expense justified.
Spurious entropy production occurs in the IGCM life cycle by two mechanisms, as argued
in Chapter 5. Firstly, numerical transport errors during the evaluation of adiabatic terms in the
primitive equations result in a net production of entropy. This positive source is the residual of
many errors, both positive and negative, in the Lagrangian transport of heat. While the overall
source is small, local errors are large; the potential temperature of an air parcel is certainly not
conserved by the IGCM. This supports the conclusion of Johnson et al. (2000, 2002) that non-
isentropic Eulerian atmosphere models are very bad at Lagrangian conservation. The second
mechanism is by explicit numerical diffusion. Here we claim that some of the entropy produced
by hyper-diffusion in the IGCM constitutes a spurious entropy source.
Both of these spurious sources can be removed by using an isentropic vertical coordinate,
as demonstrated by the model developed here. In this model, potential enstrophy is dissipated
on isentropic levels with no mass exchange between levels. Real entropy sources could then
be carefully parameterized as diabatic mass fluxes. It should be remembered, however, that
the energy drift shown by this isentropic model constitutes a larger error in the second law of
thermodynamics than the entropy drift in the IGCM. Given the ongoing technical problems
Conclusions 172
with pure isentropic models such as this, it is wise to also consider some alternatives.
In hybrid models not all of the atmosphere is modelled isentropically, but a large propor-
tion is. In the RUC model of Benjamin et al. (2004), 70-80% of the atmosphere is modelled
with a purely isentropic coordinate (Benjamin (2004), personal communication). In the Uni-
versity of Wisconsin model this figure is 85% (Schaack et al. (2004)). Furthermore, these
models both demonstrate exceptionally good Lagrangian entropy conservation in the tests laid
out by Johnson et al. (2000, 2002). The use of a non-isentropic coordinate in the region near
the lower boundary does not appear to damage these Lagrangian conservation properties sig-
nificantly. These coordinates are, therefore, a good solution to the problem of how to represent
entropy in an atmosphere model. These hybrid models currently employ standard physical
parameterizations which were designed, and tuned, for models based on other coordinate sys-
tems. The community is now looking to design parameterization schemes specifically to take
advantage of the benefits of the isentropic coordinate, for example, the direct link between
diabatic heating and vertical velocity.
It might be possible to reduce errors in the transport of entropy in non-isentropic Eulerian
models by developing more accurate transport schemes, but this would be very challenging.
There is, however, an option for reducing the second spurious entropy source, that from nu-
merical diffusion. The ocean is dominated by adiabatic processes to an even greater extent
than the atmosphere, so much attention has been focussed on efforts to reduce spurious dia-
batic motion in ocean models. Gent and McWilliams (1990), and subsequently Griffies (1998),
parameterize the sub-gridscale flow by effectively diffusing isopycnal layer thickness, i.e. by
moving fluid along isopycnal surfaces rather than across them. It is hoped that such a scheme
would improve the accuracy of atmosphere models, reducing both the spurious entropy source,
and the flow of moisture and chemical species across isentropic surfaces unless explicitly pa-
rameterized. Some benefit is expected even in comprehensive models which incorporate a
separate boundary layer scheme, since Figure 5.5 shows that roughly a third of the entropy
generation by the hyper-diffusion occurs above the boundary layer. Numerical diffusion is
generally believed to transport too much mass across isentropic surfaces in the tropopause re-
gion (Stohl et al. (2003a)); it is hoped that more accurate transport could be achieved if some
kind of adiabatic diffusion operator is used. Work is currently in progress to implement diffu-
sion schemes in the IGCM which are based on that of Gent and McWilliams (1990), and on
the more scale-selective scheme of Roberts and Marshall (1998).
The atmosphere is strongly constrained by the Lagrangian conservation of entropy and
Conclusions 173
PV, and exhibits a delicate balance between numerous entropy sources and sinks. If numerical
models can be made to reproduce these features, their realism can only be improved.
Appendix A
The LC2 Cycle
The second of the two life cycles studied by Thorncroft et al. (1993) is the LC2 cycle, whose
initial conditions differ from those of the LC1 cycle only by an enhanced cyclonic horizontal
shear (see Section 4.1 and Figure 4.1). A simulation of this life cycle was also attempted using
the isentropic model, but suffered from imbalances in the initial conditions. The initial state
was created in the same way as in Section 4.5 for the LC1 cycle; LC2 however proved more
sensitive to errors in the interpolation procedure. Imbalances were introduced because the
mass field was interpolated to the centre of each hexagonal or pentagonal cell, while the wind
fields were interpolated onto the cell edges. The imbalances were successfully removed using
an adjustment technique, but this significantly changed the zonal mean initial conditions. The
purpose of this appendix is to document this adjustment technique, and the effect on the mean
state, and to show that a closer fit to the IGCM LC2 cycle can be obtained by using a higher
resolution intermediate grid during the interpolation of initial conditions.
If no attempt is made to balance the initial state, then the simulation is unsuccessful. The
imbalances result in small-scale noise, and in the initial forcing of gravity waves. The noise is
largest at the pentagons, and therefore has a wavenumber 5 shape which seeds a wavenumber
5 wave in the same way as the surface pressure perturbation seeds a wavenumber 6 wave. The
resulting evolution shows a combination of waves of these two wavenumbers.
A first attempt to remove the noise was made by enforcing a period of adjustment, or bal-
ancing, following Thuburn (1995). Firstly, the normal mode perturbation was removed from
the PV field by subtracting off the difference from the zonal mean. This perturbation was then
stored in order to be re-applied later. The model was run with the density and PV fields ad-
vected only by the divergent wind, and with the timescale for hyper-diffusion on divergence
The LC2 Cycle 175
0 20 40 60 80 100 120
−80
−60
−40
−20
0
20
40
60
80
Latit
ude
Zonal Mean Divergence
0 20 40 60 80 100 120289
301
313
325
337
358
451
Time (hours)
θ (K
)
Figure A.1: Evolution of the zonal mean divergence as a function of latitude at 345 K (top),and as a function of potential temperature at 45N (bottom), during adjustment. Contours aredrawn every 2× 10−6s−1.
The LC2 Cycle 176
reduced by a factor of 10. The intention was to allow the system to adjust towards a balanced
state by emitting gravity waves. These waves would be damped by the additional divergence
damping, with the balanced dynamics essentially frozen. Note that this approach is only pos-
sible because this model is formulated with PV and divergence as prognostic variables.
Figure A.1 shows that gravity waves are indeed emitted from the region near 60N, where
the imbalance is greatest, and propagate away while being damped by the hyper-diffusion.
After 96 hours of balancing, the normal mode PV perturbation was added back on, and the
model integrated as usual from these ‘adjusted’ initial conditions. Figure A.2 compares the
subsequent PV evolution with that predicted by the IGCM. The isentropic model does predict
the correct general behaviour, with cut-off cyclones dominating in the extratropics. The inten-
sity of these cyclones, however, is much weaker than it is in the IGCM simulation, and the
disagreement between the two models is greater than that seen in the LC1 cycle.
The cause of these differences is obvious when the adjusted initial conditions are examined
in detail. Figure A.3 shows the zonal mean initial conditions before, and after, the adjustment
process. This process has significantly affected the state, as is seen most clearly in the low
level winds. After adjustment the zonal mean wind near the surface reaches 20 ms−1, a value
that is not attained until over 600 mb in the original state. The adjustment has also resulted in
a uniformity of wind on isentropic model levels in the mid-latitude troposphere, so that in this
figure the wind contours lie more parallel to the isentropes. The life cycle is very sensitive to the
zonal mean initial state, as described by Thorncroft et al. (1993); it is, after all, only changes in
the zonal mean wind shear that distinguish the LC2 cycle from LC1. The changes in the initial
state shown in Figure A.3 result in a wave which is significantly different, as is to be expected,
and so this simulation cannot be directly compared to the IGCM LC2 cycle. While adjustment
techniques such as this are clearly successful in bringing the state back towards balance, this
example shows that care must be taken since the mean state can be altered significantly.
A second attempt was then made to remove the imbalances. As described in Section 4.5,
the spectral fields from the IGCM are first evaluated on a latitude-longitude grid, and then
interpolated onto the hexagonal-icosahedral grid. The spectral coefficients define the initial
conditions exactly at a given point on the sphere, so using a higher resolution latitude-longitude
grid will result in smaller interpolation errors, and in more accurate fields on the hexagonal
grid. The resolution of this intermediate grid was doubled, and the LC2 cycle run from the
resulting initial conditions with no adjustment. Figure A.4 shows the eddy kinetic energy
of this life cycle, compared with that predicted by the IGCM, and by the isentropic model
The LC2 Cycle 177
Figure A.2: PV on the 330 K isentropic surface for the LC2 cycle in the isentropic model (left)and the IGCM (right). Contours and colour are as in Figure 4.3.
The LC2 Cycle 178
0 10 20 30 40 50 60 70 80 90
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Pre−adjustment
p/p s
0 10 20 30 40 50 60 70 80 90
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Post−adjustment
Latitude
p/p s
Figure A.3: Zonal mean sections, as in Figure 4.7, but for the LC2 initial conditions before, andafter, the adjustment process. The black lines are isentropes corresponding to the isentropicmodel half levels, which are spaced every 6K from 298K (dotted) to 341K (dashed). Thesloping red line is the 2 PVU surface, and the other coloured contours show the zonal windfield at 5 ms−1 intervals, with the zero contour dotted and negative contours dashed. All fieldsare zonal means.
The LC2 Cycle 179
0 5 10 150
5
10
15x 10
5
Day
Eddy Kinetic Energy (Jm−2)
IGCMAdjustedUnadjusted
Figure A.4: Eddy kinetic energy evolution for the LC2 simulations. Two runs are shown fromthe isentropic model, demonstrating the two methods for creating balanced initial conditions.The ‘adjusted’ run only used the adjustment technique, while the ‘unadjusted’ run only usedthe refined intermediate grid.
cycle using the adjustment technique. The life cycle that evolves from the adjusted initial
conditions has dramatically weaker EKE than the IGCM cycle, a result that is consistent with
the weaker cyclones seen in Figure A.2. By using the refined intermediate grid, rather than
the adjustment technique, an EKE evolution is obtained which is much closer to that predicted
by the IGCM. It is anticipated that further refinement of the intermediate grid would lead to
even closer agreement. This has not been pursued here, however, because the isentropic model
proved unstable in this life cycle, as in LC1. Addressing this deficiency is considered a higher
priority than further corrections to the initial conditions in this test case.
References
Acheson, D. J. Elementary Fluid Dynamics. Oxford University Press, 1990.
Andrews, David G., J. R. Holton, and C. B. Leovy. Middle Atmosphere Dynamics. Academic
Press, 1987.
Appenzeller, C., J. R. Holton, and K. Rosenlof, 1996. Seasonal variation of mass transport
across the tropopause. Journal of Geophysical Research, 101:15071–15078.
Arakawa, Akio. Future development of general circulation models. In Randall, D. A., edi-
tor, General Circulation Model Development, chapter 23, pages 721–780. Academic Press,
2000.
Arakawa, A. and V. R. Lamb. Computational design of the basic dynamical processes of the
UCLA general circulation model. In Chang, J., editor, General Circulation Models of the
Atmosphere, volume 17 of Methods in Computational Physics, pages 173–265. Academic
Press, 1977.
Arakawa, A., C. R. Mechoso, and C. S. Konor, 1992. An isentropic vertical coordinate model:
Design and application to atmospheric frontogenesis studies. Meteorology and Atmospheric
Physics, 50:31–45.
Arakawa, Akio and S. Moorthi, 1988. Baroclinic instability in vertically discrete systems.
Journal of the Atmospheric Sciences, 45:1688–1707.
Bacon, D. P., N. Ahmad, Z. Boybeyi, T. Dunn, M. S. Hall, C. S. Lee, A. Sarma, M. D. Turner,
K. T. Waight, S. H. Young, and J. W. Zack, 2000. A dynamically adapting weather and
dispersion model: The Operational Multiscale Environment model with Grid Adaptivity
(OMEGA). Monthly Weather Review, 128:2044–2076.
Bates, J. R., Y. Li, A. Brandt, S. F. McCormick, and J. Ruge, 1995. A global shallow-water
numerical model based on the semi-Lagrangian advection of potential vorticity. Q. J. R.
Meteorol. Soc., 121:1981–2005.
180
References 181
Benjamin, Stanley G., G. A. Grell, J. M. Brown, T. G. Smirnova, and R. Bleck, 2004.
Mesoscale weather prediction with the RUC hybrid isentropic-terrain-following coordinate
model. Monthly Weather Review, 132:473–494.
Bishop, Craig H. and A. J. Thorpe, 1994. Potential vorticity and the electrostatics analogy:
Quasi-qeostrophic theory. Q. J. R. Meteorol. Soc., 120:713–731.
Bishop, Craig H. and A. J. Thorpe, 1995. Potential vorticity and the electrostatics analogy:
Ertel-Rossby formulation. Q. J. R. Meteorol. Soc., 121:1477–1495.
Blackburn, M. An Energetic Analysis of the General Atmospheric Circulation. PhD thesis,
University of Reading, 1983.
Bleck, Rainer, 1973. Numerical forecasting experiments based on the conservation of potential
vorticity on isentropic surfaces. Journal of Applied Meteorology, 12(5):737–752.
Bleck, Rainer, 1974. Short-range prediction in isentropic coordinates with filtered and unfil-
tered numerical models. Monthly Weather Review, 102(12):813–829.
Bleck, Rainer, 1978. On the use of hybrid vertical coordinates in numerical weather prediction
models. Monthly Weather Review, 106(9):1233–1244.
Bleck, Rainer, 1984. An isentropic coordinate model suitable for lee cyclogenesis simulation.
Rivista di Meteorologia Aeronautica, 43:189–194.
Bleck, Rainer and S. G. Benjamin, 1993. Regional weather prediction with a model combining
terrain-following and isentropic coordinates. Part I: Model description. Monthly Weather
Review, 121:1770–1785.
Bleck, Rainer and D. Boudra, 1981. Initial testing of a numerical ocean circulation model
using a hybrid (quasi-isopycnic) vertical coordinate. Journal of Physical Oceanography, 11:
755–770.
Bleck, Rainer and D. Boudra, 1986. Wind-driven spin-up in eddy-resolving ocean models
formulated in isopycnic and isobaric coordinates. Journal of Geophysical Research, 91(C6):
7611–7621.
References 182
Bleck, Rainer, C. Rooth, D. Hu, and L. T. Smith, 1992. Salinity-driven thermocline transients in
a wind- and thermohaline-forced isopycnic coordinate model of the North Atlantic. Journal
of Physical Oceanography, 22:1486–1505.
Boer, G. J. Climate model intercomparison. In Mote, Phillip and A. O’Neill, editors, Numerical
modelling of the global atmosphere in the climate system, chapter 19. Kluwer Academic
Publishers, 2000.
Boer, G. J. et al., 1992. Some results from an intercomparison of the climates simulated by 14
atmospheric general circulation models. Journal of Geophysical Research, 97(D12):12771–
12786.
Boris, J. P. and D. L. Book, 1973. Flux-corrected transport I. SHASTA, a fluid transport
algorithm that works. Journal of Computational Physics, 11:38–69.
Brankovic, C. and D. Gregory, 2001. Impact of horizontal resolution on seasonal integrations.
Climate Dynamics, 18:123–143.
Bretherton, F. P., 1966. Critical layer instability in baroclinic flows. Q. J. R. Meteorol. Soc., 92
(393):325–334.
Charney, J. G., 1947. The dynamics of long waves in a baroclinic westerly current. Journal of
Meteorology, 4:135–162.
Charney, Jule G., 1971. Geostrophic turbulence. Journal of the Atmospheric Sciences, 28:
1087–1095.
Chen, Minghang and J. R. Bates, 1996. A comparison of climate simulations from a semi-
Lagrangian and an Eulerian GCM. Journal of Climate, 9:1126–1149.
Chipperfield, M. P., 1999. Multiannual simulations with a three-dimensional chemical trans-
port model. Journal of Geophysical Research, 104(D1):1781–1805.
Cooper, Iain M., A. J. Thorpe, and C. H. Bishop, 1992. The role of diffusive effects on potential
vorticity in fronts. Q. J. R. Meteorol. Soc., 118:629–647.
Cullen, M. J. P. and R. J. Purser, 1984. An extended Lagrangian theory of semi-geostrophic
frontogenesis. Journal of the Atmospheric Sciences, 41(9):1477–1497.
References 183
Curry, Judith A. and P. J. Webster. Thermodynamics of Atmospheres and Oceans. Academic
Press, 1999.
Danielsen, E. F., 1959. The laminar structure of the atmosphere and it’s relation to the concept
of a tropopause. Arch. Meteor. Geophys. Bioklim, A11:293–332.
Dewar, R., 2003. Information theory explanation of the fluctuation theorem, maximum entropy
production and self-organized criticality in non-equilibrium stationary states. J. Phys. A
Math. Gen., 36:631–641.
Dritschel, D. G. and M. H. P. Ambaum, 1997. A contour-advective semi-Lagrangian numerical
algorithm for simulating fine-scale conservative dynamical fields. Q. J. R. Meteorol. Soc.,
123:1097–1130.
Dutton, J. A. The Ceaseless Wind. McGraw-Hill, 1976.
Eady, E. T., 1949. Long waves and cyclone waves. Tellus, 1:33–52.
Edmon, H. J., B. J. Hoskins, and M. E. McIntyre, 1980. Eliassen-Palm cross sections for the
troposphere. Journal of the Atmospheric Sciences, 37:2600–2616.
Egger, Joseph, 1999. Numerical generation of entropies. Monthly Weather Review, 127:2211–
2216.
Eliassen, Arnt and E. Raustein, 1968. A numerical integration experiment with a model atmo-
sphere based on isentropic surfaces. Meteorologiske Annaler, 5(2):45–63.
Eliassen, Arnt and E. Raustein, 1970. A numerical integration experiment with a six-level
atmospheric model with isentropic information surfaces. Meteorologiske Annaler, 5(11):
429–449.
Fox-Rabinovitz, Michael S., 1994. Computational dispersion properties of vertically staggered
grids for atmospheric models. Monthly Weather Review, 122:377–392.
Fox-Rabinovitz, Michael S., 1996. Computational dispersion properties of 3D staggered grids
for a nonhydrostatic anelastic system. Monthly Weather Review, 124:498–510.
Fulton, Scott R. and W. H. Schubert, 1991. Surface frontogenesis in isentropic coordinates.
Journal of the Atmospheric Sciences, 48(23):2534–2541.
References 184
Gall, R. L., R. T. Williams, and T. L. Clark, 1987. On the minimum scale of surface fronts.
Journal of the Atmospheric Sciences, 44(18):2562–2574.
Garner, Stephen T., 1989. Fully lagrangian numerical solutions of unbalanced frontogenesis
and frontal collapse. Journal of the Atmospheric Sciences, 46(6):717–739.
Garner, Stephen T., N. Nakamura, and I. M. Held, 1992. Nonlinear equilibration of two-
dimensional eady waves: A new perspective. Journal of the Atmospheric Sciences, 49(21):
1984–1996.
Gates, W. Lawrence et al., 1999. An overview of the results of the Atmospheric Model In-
tercomparison Project (AMIP 1). Bulletin of the American Meteorological Society, 80(1):
29–55.
Gent, Peter R. and J. C. McWilliams, 1990. Isopycnal mixing in ocean circulation models.
Journal of Physical Oceanography, 20:150–155.
Gill, Adrian E. Atmosphere-Ocean Dynamics. Academic Press, 1982.
Gohar, Laila K. A study of the radiative contribution to the entropy budget of the atmosphere.
PhD thesis, Imperial College, London, 2002.
Goody, Richard, 2000. Sources and sinks of climate entropy. Q. J. R. Meteorol. Soc., 126:
1953–1970.
Goody, Richard and W. Abdou, 1996. Reversible and irreversible sources of radiation entropy.
Q. J. R. Meteorol. Soc., 122:483–494.
Gregory, Andrew. Numerical simulations of winter stratosphere dynamics. PhD thesis, Uni-
versity of Reading, 1999.
Gregory, A. R. and V. West, 2002. The sensitivity of a model’s stratospheric tape recorder to
the choice of advection scheme. Q. J. R. Meteorol. Soc., 128:1827–1846.
Griffies, Stephen M., 1998. The Gent-McWilliams skew flux. Journal of Physical Oceanog-
raphy, 28:831–841.
Grose, William L. and B. J. Hoskins, 1979. On the influence of orography on large-scale
atmospheric flow. Journal of the Atmospheric Sciences, 36:223–234.
References 185
Gu, Y., J. Farrara, K. N. Liou, and C. R. Mechoso, 2003. Parameterization of cloud-radiation
processes in the UCLA general circulation model. Journal of Climate, 16:3357–3370.
Hamilton, Kevin, R. J. Wilson, and R. S. Hemler, 1999. Middle atmosphere simulated with
high vertical and horizontal resolution versions of a GCM: Improvements in the cold pole
bias and generation of a QBO-like oscillation in the tropics. Journal of the Atmospheric
Sciences, 56(22):3829–3846.
Haynes, P. H., C. J. Marks, M. E. McIntyre, T. G. Shepherd, and K. P. Shine, 1991. On
the ‘downward control’ of extratropical diabatic circulations by eddy-induced mean zonal
forces. Journal of the Atmospheric Sciences, 48:651–678.
Haynes, P. H. and M. E. McIntyre, 1987. On the evolution of vorticity and potential vorticity
in the presence of diabatic heating and frictional or other forces. Journal of the Atmospheric
Sciences, 44(5):828–841.
Haynes, P. H. and M. E. McIntyre, 1990. On the conservation and impermeability theorems
for potential vorticity. Journal of the Atmospheric Sciences, 47(16):2021–2031.
Haynes, P. H. and W. E. Ward, 1993. The effect of realistic radiative transfer on potential
vorticity structures, including the influence of background shear and strain. Journal of the
Atmospheric Sciences, 50(20):3431–3453.
Heikes, R. P. and D. A. Randall, 1995. Numerical integration of the shallow water equations
on a twisted icosahedral grid. Part II: Grid refinement, accuracy and computational perfor-
mance. Monthly Weather Review, 123:1881–1887.
Held, Isaac M. and M. J. Suarez, 1994. A proposal for the intercomparison of the dynamical
cores of atmospheric general circulation models. Bulletin of the American Meteorological
Society, 75(10):1825–1830.
Holton, James R. An Introduction to Dynamic Meteorology. Academic Press, 1992.
Holton, James R., P. H. Haynes, M. E. McIntyre, A. R. Douglass, R. B. Rood, and L. Pfister,
1995. Stratosphere-troposphere exchange. Reviews of Geophysics, 33(4):403–439.
Hoskins, Brian J., 1991. Towards a PV-θ view of the general circulation. Tellus, 43AB:27–35.
References 186
Hoskins, Brian J. and P. Berrisford, 1988. A potential vorticity perspective of the storm of
October 1987. Weather, 43:122–129.
Hoskins, Brian J. and F. P. Bretherton, 1972. Atmospheric frontogenesis models: Mathematical
formulation and solution. Journal of the Atmospheric Sciences, 29:11–37.
Hoskins, B. J., M. E. McIntyre, and A. W. Robertson, 1985. On the use and significance of
isentropic potential vorticity maps. Q. J. R. Meteorol. Soc., 111(470):877–946.
Hoskins, B J. and A. J. Simmons, 1975. A multi-layer spectral model and the semi-implicit
method. Q. J. R. Meteorol. Soc., 101:637–655.
Hoskins, B. J. and N. V. West, 1979. Baroclinic waves and frontogenesis. Part II: Uniform
potential vorticity jet flows - cold and warm fronts. Journal of the Atmospheric Sciences, 36:
1663–1680.
Hsu, Yueh-Jiuan G. and A. Arakawa, 1990. Numerical modeling of the atmosphere with an
isentropic vertical coordinate. Monthly Weather Review, 118:1933–1959.
Johnson, Donald R., 1997. General coldness of climate models and the second law: Implica-
tions for modelling the earth system. Journal of Climate, 10:2826–2846.
Johnson, Donald R., A. J. Lenzen, T. H. Zapotocny, and T. K. Schaack, 2000. Numerical
uncertainties in the simulation of reversible isentropic processes and entropy conservation.
Journal of Climate, 13:3860–3884.
Johnson, Donald R., A. J. Lenzen, T. H. Zapotocny, and T. K. Schaack, 2002. Numerical
uncertainties in the simulation of reversible isentropic processes and entropy conservation:
Part II. Journal of Climate, 15(4):1777–1804.
Johnson, D. R., F. M. Reames, B. J. Wolf, and R. B. Pierce, 1993. A comparison of simulated
precipitation by hybrid isentropic-sigma and sigma models. Monthly Weather Review, 121:
2088–2114.
Kalnay, Eugenia. Atmospheric modeling, data assimilation and predictability. Cambridge
University Press, 2003.
Kasahara, Akira, 1974. Various vertical coordinate systems used for numerical weather pre-
diction. Monthly Weather Review, 102:509–522.
References 187
Kleidon, Axel, K. Fraedrich, T. Kunz, and F. Lunkeit, 2003. The atmospheric circulation
and states of maximum entropy production. Geophysical Research Letters, 30(23). 2223,
doi:10.1029/2003GL018363.
Kleidon, Axel and R. Lorenz. Entropy production by earth system processes. In Kleidon, Axel
and R. Lorenz, editors, Non-Equilibrium Thermodynamics and the Production of Entropy:
Life, Earth and Beyond. Springer Verlag, 2004.
Konor, Celal S. and A. Arakawa, 1997. Design of an atmospheric model based on a generalized
vertical coordinate. Monthly Weather Review, 125:1649–1673.
Konor, Celal S. and A. Arakawa, 2000. Choice of a vertical grid in incorporating condensation
heating into an isentropic vertical coordinate model. Monthly Weather Review, 128:3901–
3910.
Li, Yong, J. Ruge, J. R. Bates, and A. Brandt, 2000. A proposed adiabatic formulation of 3-
dimensional global atmospheric models based on potential vorticity. Tellus, 52A:129–139.
Lin, Shian and R. B. Rood, 1997. An explicit flux-form semi-Lagrangian shallow-water model
on the sphere. Q. J. R. Meteorol. Soc., 123(544):2477–2498.
Lorenz, E. N., 1955. Available potential energy and the maintenance of the general circulation.
Tellus, 7:157–167.
Lorenz, Ralph D., J. I. Lunine, P. G. Withers, and C. P. McKay, 2001. Titan, Mars and Earth:
Entropy production by latitudinal heat transport. Geophysical Research Letters, 28(3):415–
418.
MacVean, M. K., 1983. The effects of horizontal diffusion on baroclinic development in a
spectral model. Q. J. R. Meteorol. Soc., 109:771–783.
Mahowald, Natalie M., R. A. Plumb, P. J. Rasch, J. del Corral, F. Sassi, and W. Heres, 2002.
Stratospheric transport in a three-dimensional isentropic coordinate model. Journal of Geo-
physical Research, 107(D15).
McFarlane, N. A., 1987. The effect of orographically excited gravity wave drag on the general
circulation of the lower stratosphere and troposphere. Journal of the Atmospheric Sciences,
44(14):1775–1800.
References 188
McKenna, D. S. et al., 2002. A new chemical Lagrangian model of the stratosphere (CLaMS):
Part 1, Formulation of advection and mixing. Journal of Geophysical Research, 107(D16).
Methven, John. Tracer behavoir in baroclinic waves. PhD thesis, University of Reading, 1996.
Mohebalhojeh, A. R. and D. G. Dritschel, 2004. Contour-advective semi-Lagrangian algo-
rithms for many-layer primitive-equation models. Q. J. R. Meteorol. Soc., 130:347–364.
Nakamura, Noboru and I. M. Held, 1989. Nonlinear equilibration of two-dimensional eady
waves. Journal of the Atmospheric Sciences, 46(19):3055–3064.
Ozawa, Hisashi, A. Ohmura, R. D. Lorenz, and T. Pujol, 2003. The second law of thermo-
dynamics and the global climate system: A review of the maximum entropy production
principle. Reviews of Geophysics, 41(4):1–24.
Paltridge, G. W., 1975. Global dynamics and climate - a system of minimum entropy exchange.
Q. J. R. Meteorol. Soc., 101:475–484.
Pawson, S. et al., 2000. The GCM-Reality Intercomparison Project for SPARC: Scientific
issues and initial results. Bulletin of the American Meteorological Society, 81(4):781–796.
Peixoto, Jose Pinto, A. H. Oort, M. de Almeida, and A. Tome, 1991. Entropy budget of the
atmosphere. Journal of Geophysical Research, 96(D6):10981–10988.
Pierce, R. B. et al., 2003. Regional air quality modelling system (RAQMS) predictions of
the tropospheric ozone budget over east Asia. Journal of Geophysical Research, 108(D12):
8825. doi:10.1029/2002JD003176.
Pierce, R. B., D. R. Johnson, F. M. Reames, T. H. Zapotocny, and B. J. Wolf, 1991. Numerical
investigations with a hybrid isentropic-sigma model 1: Normal mode characteristics. Journal
of the Atmospheric Sciences, 48:2005–2024.
Pierrehumbert, R. T. and H. Yang, 1993. Global chaotic mixing on isentropic surfaces. Journal
of the Atmospheric Sciences, 50(15):2462–2480.
Polavarapu, S. M. and W. R. Peltier, 1990. The structure and nonlinear evolution of synoptic
scale cyclones: Life cycle simulations with a cloud-scale model. Journal of the Atmospheric
Sciences, 47(22):2645–2672.
References 189
Purser, R. James, S. K. Kar, S. Gopalakrishnan, and M. Iredell. Hybrid coordinate employing
a generalization of the montgomery potential. Technical report, NOAA/NCEP Office note,
2004.
Randall, David A., 1994. Geostrophic adjustment and the finite-difference shallow-water equa-
tions. Monthly Weather Review, 122:1371–1377.
Randall, D. A., R. Heikes, and T. D. Ringler. Global atmospheric modelling using a geodesic
grid with an isentropic vertical coordinate. In Randall, D. A., editor, General Circulation
Model Development, chapter 17, pages 509–538. Academic Press, 2000.
Redi, M. H., 1982. Oceanic isopycnal mixing by coordinate rotation. Journal of Physical
Oceanography, 12:1154–1158.
Ringler, Todd D. and D. A. Randall, 2002. A potential enstrophy and energy conserving nu-
merical scheme for the solution of the shallow-water equations on a geodesic grid. Monthly
Weather Review, 130:1397–1410.
Ringler, Todd D. and D. A. Randall, 2002. The ZM grid: An alternative to the Z grid. Monthly
Weather Review, 130:1411–1422.
Roberts, Malcolm and D. Marshall, 1998. Do we require adiabatic dissipation schemes in
eddy-resolving ocean models? Journal of Physical Oceanography, 28:2050–2063.
Robertson, Andrew W. Ertel potential vorticity as a tropospheric variable. PhD thesis, Uni-
versity of Reading, 1984.
Rotunno, R., W. C. Skamarock, and C. Snyder, 1994. An analysis of frontogenesis in numerical
simulations of baroclinic waves. Journal of the Atmospheric Sciences, 51:3373–3398.
Sadourny, R. and C. Basdevant, 1985. Parameterization of subgrid scale barotropic and baro-
clinic eddies in quasi-geostrophic models: Anticipated potential vorticity method. Journal
of the Atmospheric Sciences, 42:1353–1363.
Salmon, R. Lectures on Geophysical Fluid Dynamics. Oxford University Press, 1998.
Schaack, Todd K., T. H. Zapotocny, A. J. Lenzen, and D. R. Johnson, 2004. Global climate sim-
ulation with the University of Wisconsin global hybrid isentropic coordinate model. Journal
of Climate, 17:2998–3016.
References 190
Schar, Christoph, 1993. A generalization of bernoulli’s theorem. Journal of the Atmospheric
Sciences, 50(10):1437–1443.
Schneider, Tapio, I. M. Held, and S. T. Garner, 2003. Boundary effects in potential vorticity
dynamics. Journal of the Atmospheric Sciences, 60:1024–1040.
Schoeberl, Mark R., 2004. Extratropical stratosphere-troposphere mass exchange. Journal of
Geophysical Research, 109(D13303).
Shapiro, M. A., 1975. Simulation of upper-level frontogenesis with a 20-level isentropic coor-
dinate primitive equation model. Monthly Weather Review, 103:591–604.
Shapiro, M. A., T. Hampel, D. Rotzoll, and F. Mosher, 1985. The frontal hydraulic head: A
microscale (∼1km) triggering mechanism for convective weather systems. Monthly Weather
Review, 113:1166–1183.
Shepherd, Theodore G., 2002. Issues in stratosphere-troposphere coupling. Journal of the
Meteorological Society of Japan, 80(4B):769–792.
Shimokawa, Shinya and H. Ozawa, 2001. On the thermodynamics of the oceanic general
circulation: Entropy increase rate of an open dissipative system and its surroundings. Tellus,
53A:266–277.
Shimokawa, Shinya and H. Ozawa, 2002. On the thermodynamics of the oceanic general
circulation: Irreversible transition to a state with higher rate of entropy production. Q. J. R.
Meteorol. Soc., 128:2115–2128.
Simmons, A. J. and D. M. Burridge, 1981. An energy and angular-momentum conserving
vertical finite-difference scheme and hybrid vertical coordinates. Monthly Weather Review,
109:758–766.
Simmons, A. J. and B. J. Hoskins, 1976. Baroclinic instability on the sphere: Normal modes
of the primitive and quasi-geostrophic equations. Journal of the Atmospheric Sciences, 33:
1454–1477.
Simmons, A. J. and B. J. Hoskins, 1977. Baroclinic instability on the sphere: Solutions with a
more realistic tropopause. Journal of the Atmospheric Sciences, 34:581–588.
References 191
Simmons, A. J. and B. J. Hoskins, 1978. The life cycles of some nonlinear baroclinic waves.
Journal of the Atmospheric Sciences, 35:414–432.
Simmons, A. J. and B. J. Hoskins, 1980. Barotropic influences on the growth and decay of
nonlinear baroclinic waves. Journal of the Atmospheric Sciences, 37:1679–1684.
Snyder, C., W. C. Skamarock, and R. Rotunno, 1993. Frontal dynamics near and following
frontal collapse. Journal of the Atmospheric Sciences, 50(18):3194–3211.
Stephenson, David B., 1994. The impact of changing the horizontal diffusion scheme on the
northern winter climatology of a general circulation model. Q. J. R. Meteorol. Soc., 120:
211–226.
Stohl, A. et al., 2003. Stratosphere-troposphere exchange: A review, and what we have
learned from STACCATO. Journal of Geophysical Research, 108(D12). 8516, doi:
10.1029/2002JD002490.
Stohl, Andreas, H. Wernli, P. James, M. Bourqui, C. Forster, M. A. Liniger, P. Seibert, and
M. Sprenger, 2003. A new perspective of stratosphere-troposphere exchange. Bulletin of the
American Meteorological Society, 84:1565–1573.
Temperton, C. and A. Staniforth, 1987. An efficient two-time-level semi-Lagrangian semi-
implicit integration scheme. Q. J. R. Meteorol. Soc., 113(477):1025–1039.
Thorncroft, C. D. and B. J. Hoskins, 1990. Frontal cyclogenesis. Journal of the Atmospheric
Sciences, 47(19):2317–2336.
Thorncroft, C. D., B. J. Hoskins, and M. E. McIntyre, 1993. Two paradigms of baroclinic-wave
life-cycle behaviour. Q. J. R. Meteorol. Soc., 119:17–55.
Thuburn, John, 1993. Baroclinic-wave life cycles, climate simulations and cross-isentrope
mass flow in a hybrid isentropic coordinate GCM. Q. J. R. Meteorol. Soc., 119:489–508.
Thuburn, J., 1995. Dissipation and cascades to small scales in numerical models using a shape-
preserving advection scheme. Monthly Weather Review, 123:1888–1903.
Thuburn, John, 1996. Multidimensional flux-limited advection schemes. Journal of Computa-
tional Physics, 123:74–83.
References 192
Thuburn, John, 1997. A PV-based shallow-water model on a hexagonal-icosahedral grid.
Monthly Weather Review, 125:2328–2347.
Thuburn, John and M. E. McIntyre, 1997. Numerical advection schemes, cross-isentropic
random walks, and correlations between chemical species. Journal of Geophysical Research,
102(D6):6775–6797.
Thuburn, J. and T. J. Woollings, 2005. Vertical discretizations for compressible Euler equation
atmospheric models giving optimal representation of normal modes. Journal of Computa-
tional Physics, 203:386–404.
Tomita, Hirofumi, M. Satoh, and K. Goto, 2002. An optimization of the icosahedral
grid modified by spring dynamics. Journal of Computational Physics, 183:307–331.
doi:10.1006/jcph.2002.7193.
Trevisan, Anna, 1975. Numerical experiments on the influence of orography on cyclone for-
mation with an isentropic primitive equation model. Journal of the Atmospheric Sciences,
33:768–780.
Uccellini, Louis W., D. R. Johnson, and R. E. Schlesinger, 1979. An isentropic and sigma
coordinate hybrid numerical model: Model development and some initial tests. Journal of
the Atmospheric Sciences, 36:390–414.
Waugh, Darryn W. and R. A. Plumb, 1994. Contour advection with surgery: A technique for
investigating finescale structure in tracer transport. Journal of the Atmospheric Sciences, 51
(4):530–540.
Webster, Stuart, J. Thuburn, B. Hoskins, and M. Rodwell, 1999. Further development of a
hybrid-isentropic GCM. Q. J. R. Meteorol. Soc., 125:2305–2331.
Wernli, Heini, S. Dirren, M. A. Liniger, and M. Zillig, 2002. Dynamical aspects of the life
cycle of the winter storm ‘Lothar’. Q. J. R. Meteorol. Soc., 128(580):405–429.
White, A. A. A view of the equations of meteorological dynamics and various approxima-
tions. In Norbury, J. and I. Roulstone, editors, Large-Scale Atmosphere-Ocean Dynamics I:
Analytical methods and numerical models., pages 1–100. Cambridge University Press, 2002.
Williamson, D. L. and J. G. Olson, 1994. Climate simulations with a semi-Lagrangian version
of the NCAR Community Climate Model. Monthly Weather Review, 122:1594–1610.
References 193
Zalesak, S. T., 1979. Fully multidimensional flux-corrected transport algorithms for fluids.
Journal of Computational Physics, 31:335–362.
Zapotocny, Tom H., D. R. Johnson, and F. M. Reames, 1994. Development and initial test of
the University of Wisconsin global isentropic-sigma model. Monthly Weather Review, 122:
2160–2178.
Zhu, Z. and E. K. Schneider, 1997. Improvement in stratosphere simulation with a hybrid σ−θcoordinate GCM. Q. J. R. Meteorol. Soc., 123:2095–2113.
Zhu, Zhengxin, J. Thuburn, B. J. Hoskins, and P. H. Haynes, 1992. A vertical finite-difference
scheme based on a hybrid σ-θ-p coordinate. Monthly Weather Review, 120:851–862.
Ziemianski, M. Z. and A. J. Thorpe, 2000. The dynamical consequences for tropopause folding
of PV anomalies induced by surface frontal collapse. Q. J. R. Meteorol. Soc., 126:2747–
2764.
Ziemianski, M. Z. and A. J. Thorpe, 2002. Nonlinear balanced simulation of a baroclinic wave.
Q. J. R. Meteorol. Soc., 128:1879–1897.