How should trends in hydrological extremes be estimated?

How should trends in hydrological extremes be estimated?

Robin T Clarke

Instituto de Pesquisas Hidraulicas

Universidade Federal do Rio Grande do Sul (UFRGS)

Porto Alegre – RS

Brazil

Keywords: Trend estimation; hydrological extremes; extreme value distributions; Theil-

Sen.

This article has been accepted for publication and undergone full peer review but has not beenthrough the copyediting, typesetting, pagination and proofreading process which may lead todifferences between this version and the Version of Record. Please cite this article as an‘Accepted Article’, doi: 10.1002/wrcr.20485

2

Summary.

A comparison of six procedures for estimating the linear trend parameter β in

annual maximum one-day river flows at five sites in southern Brazil showed marked

differences between, on the one hand, estimates obtained by incorporating trend into the

GEV location parameter with all parameters estimated by maximum likelihood (ML); and

on the other hand, estimates found by least squares, trend-removal prior to fitting the

GEV by ML, boot-strap sampling, and Theil-Sen estimation. ML estimates of trend were

considerably smaller than those given by all other procedures. The same was true where

trend had been incorporated into the Gumbel location parameter. Where 95% confidence

intervals were calculated for the “true” trend β by different procedures, some confidence

intervals bracketed zero (indicating that the trend was not “significant” at the 5% level)

but there was no consistency between results from different procedures; Theil-Sen

confidence intervals always bracketed zero, confidence intervals given by de-trending

never did. It is concluded that not only do different estimation procedures give different

measures of trend uncertainty, as reported elsewhere, but the estimated trends themselves

may differ, and the paper suggests an explanation why this may occur. Some

philosophical issues relating to estimation of trend in climatological and hydrological

extremes are discussed, and it is concluded that selection of a method to estimate trend

must depend on context.

3

Introduction.

Recent decades have seen a surge in analyses of hydrological and climatological

data to seek evidence of trends brought about by anthropogenic influences. In terms of

trends in river flows, the international literature reports many recent studies of trends in

annual and seasonal flows [e.g., Hannaford and Buys, 2012; Burn et al., 2010; Wilson et

al., 2010; Fiala et al., 2010; Novotny and Stefan, 2007, Hodgkins and Dudley, 2006]

some of which have included analyses of trends in high flows [Marsh and Harvey 2012;

MacDonald, 2012; Petrow and Merz, 2009; Hannaford and Marsh, 2008; Svensson et

al., 2006; Robson et al., 1998]. The present paper addresses some issues relating to the

analysis of trends in high flows, in a region where land-use change from native forest to

arable cropping is likely to be at least as influential on extreme river flows as any existing

or potential climate change over the last eighty years. In particular, the paper is concerned

with the apparently simple issue of how to estimate trends in annual maximum one-day

river flows, although the methods discussed also have relevance to the estimation of

trends in annual maximum rainfalls of any given duration, and trends in annual maxima

of climatological variables.

There is a very extensive literature on the use of Extreme Value (EV) distributions

for describing the variability amongst “block maxima” (such as the series obtained by

abstracting maximum values during periods or blocks – typically years) with the

Generalized Extreme Value (GEV) distribution given by

4

G (q) = exp{- [ 1 + ξ ( q – μ ) / σ ] -1/ξ } 1 + ξ ( q – μ ) / σ > 0; σ > 0 (1)

where μ, σ, ξ are parameters of location, scale and shape respectively; q denotes the

annual maximum one-day flow. The methods of Hosking and Wallis [1997] based on

estimation of the GEV parameters (μ, σ, ξ) by L-moments have proved simple to use, and

software is also widely available for fitting GEV distributions by maximum likelihood

(ML). Coles [2001, Chapter 6] has described how, in the presence of trend in the series of

block maxima, the GEV distribution can be adapted to estimate trends in any of the

parameters, typically and most commonly by fitting the modified distribution GEV

(μ (t), σ, ξ) where μ (t) = α + β t ; extensions to GEV (μ (t), σ (t), ξ(t)) are also possible.

In all such approaches, it is assumed that annual maxima are statistically independent.

This paper considers only the simpler model GEV (α + β t , σ, ξ), with particular

emphasis on the estimation of β, and on the uncertainty in this estimate as measured by

its 95% confidence interval. The linear trend parameter β is estimated by ML using the

statistical package GenStat [VSN International, 2012]; other packages (e.g., ismev,

http://www.ral.ucar.edu/~ericg/softextreme.php, Heffernan and Stephenson, 2013, and

related packages evd, evdbayes, lmom, POT) allow more general GEV models to be fitted

with link functions relating parameters to predictors [Coles, 2001, Section 6.1] , which

may include other predictors as well as time. Also, since the GEV distribution reduces to

the widely-used Gumbel distribution when the GEV shape parameter ξ is zero, the paper

includes this too, using a form in which the Gumbel parameter μ is μ (t) = α + β t.

5

A GEV distribution with time-variant location parameter is not the only way to

estimate trend parameters when annual maxima are statistically independent. The many

alternatives include bootstrap estimation [e.g., Venables and Ripley, 1999], Theil-Sen

distribution-free estimation [e.g., Hannaford and Buys, 2012], two methods that are used

in this paper. Ordinary least squares also provides a valid estimate of the trend parameter

β, but the fact that extreme flows are commonly heteroscedastic, with larger extremes

having greater variances, means that confidence limits for β calculated on the

assumption that the residual variance σe2 is constant will no longer be valid. Another

estimate of β explored in this paper is given by using least squares to estimate the trend

coefficient β, then calculating the residuals {ε t} to which a GEV (or Gumbel) is fitted,

the location parameter then being no longer time-dependent. An advantage of such a

procedure is that ML estimation requires a search over the three-dimensional (μ, σ, ξ)

space instead of the four-dimensional (μ, σ, ξ, β): a considerable advantage if many

sequences are to be analysed for trend Other methods of trend estimation (M-estimators;

least median of squares (LMS); least trimmed squares (LTS); S-estimation; MM-

estimation) might have been included, but most have drawbacks [Venables and Ripley,

1999], and in any case, are not widely used by hydrologists and climatologists. Nor does

the paper include the estimation of trends in “peaks-over-threshold” (PoT) models, which

can also be modelled, in very general forms, by the ismev software referred to above.

Thus the purpose of the paper is to compare estimates ^

β of the trend coefficient

β obtained from a number of possible estimation procedures, and to compare their

uncertainties as measured by these estimates’ approximate confidence intervals. The

6

following sections describe the data used, the analytical procedures by which they were

analysed, and the results. A discussion follows, and conclusions are stated.

Data.

The primary data source was measurements of daily mean flow over the period

1940 to 2010 at five gauging stations in the southernmost Brazilian state of Rio Grande

do Sul, a region where clearance of native forest and land-use change to agriculture has

been extensive over the period of record. For one site, 14 de Julho, the record was even

longer: 1931 to 2010, but with the two years 2004 and 2005 missing. The five sites were

used because they are the longest in the region, and because they are almost complete.

Table 1 give details of location, upstream drainage area, and mean annual rainfall for the

five sites. From the records of daily mean flow, the maximum daily flow in each year of

record was abstracted. Figure 1 shows a plot of the five sequences of annual maximum

flows for the period 1940-2010, standardized by subtracting each sequence mean and

dividing by each sequence’s standard deviation, so that the vertical axis is dimensionless.

Visual inspection suggests a slight positive trend in annual maxima over the period at

some sites, as well as substantial cross-correlation between sites. When each record is

divided into four roughly equal periods, a Bartlett test shows evidence of variance

heterogeneity at four of the five sites (χ2 values 9.32, 8.85, 8.08, 13.32, 6.60; P-values

0.025, 0.031, 0.044, 0.004, 0.086, all with three degrees of freedom); but where variances

differ significantly, they do not show an increasing trend with time. Variance

heterogeneity was also found where data were divided into four groups by magnitude, as

7

well as by time. Modelling the variance of annual maxima, and of cross-correlation, are

not discussed further in this paper, which focuses on the estimation of linear trend in the

measured variable.

Analytical method.

The analysis is to compare estimates of trend, and the uncertainties in those estimates (as

measured by 95% confidence intervals), when different analytical procedures were used

for trend estimation. Because the flow sequences at the five gauge sites are annual

maxima, it is appropriate to assume that serial correlation is absent, and this was

quantified by calculating the correlogram and a portmanteau test for each sequence [Box

et al., 2009]. For the site 14 de Julho, Figure 2 gives a Q-Q plot comparing sequence

quantiles with quantiles of a standard GEV distribution. Figures for the other four sites

are similar, and are not shown. In all cases, quantiles of the flow sequence lay

comfortably within the 95% confidence interval for the Q-Q plot, suggesting that a GEV

distribution, possibly with super-imposed linear trend, should be an adequate description

of the statistical characteristics of annual maximum flows. ML procedures [Coles, 2001,

Chapter 6] were used to estimate the in-built linear trend of the form μ (t) = α+ β t (Coles

also used an exponential trend in the dispersion parameter σ, but this was not explored in

this paper). Τhus the following procedure was used to calculate an estimate ^

β of a linear

trend parameter β, at each of the five sites in Table 1:

Method 1: G (q) = exp{- [ 1 + ξ ( q – α − β t ) / σ ] -1/ξ (2)

8

with the four parameters α, β, σ and ξ estimated by ML. Standard errors for estimates of

the parameters were calculated from the information matrix 12 ]/log[ −∂∂−∂ jiL θθ evaluated

at the maximum of the log-likelihood function log L (where ji θθ , are the four parameters

taken in pairs) and were used to calculate large-sample confidence intervals for the trend

β. The mean of the distribution in (2) is

α + β t + σ [Γ (1 + ξ ) – 1 ] / ξ (3)

which provides a linear trend with trend coefficient β. It is shown below that, at each of

the five sites, the shape parameter ξ was not large relative to its standard error, suggesting

that the simpler Gumbel distribution, with linear trend incorporated, might also be an

acceptable model for the purposes of this study. Therefore, the GEV reduces to:-

Method 2: G (q) = exp{- exp [- ( q – α − β t ) / σ ] } (4)

which was also used to estimate the trend parameter β , and the standard error of^

β , from

which approximate 95% confidence intervals were calculated . These estimates are also

shown in Table 2.

As a third method, a linear trend was estimated at each site, using the same expression as

that used in ordinary linear regression, with

9

^

β = = =

−−N

i

N

iiii ttttq

1 1

2__

)(/)( (5)

where ti is the i-th year and N is the number of years in the sequence (i =1…N). For any

linear trend model such that q t = α + β t + (possibly, a function of model parameters

which does not involve the data q t) + ε t, the expression in (5) is an unbiased estimator of

β, whatever the distribution of the residuals ε t , whether or not residuals are uncorrelated,

and whether or not the residuals ε t have homoscedastic variance. After estimating the

trend β using (5), the sequence was “de-trended” and the GEV distribution given by (1)

was fitted by ML to give estimates of the remaining three parameters α, σ, ξ. The

variance of a random variable Q with a GEV distribution is (σ / ξ)2 (g 2 - g1 2) where gk =

Γ (1 + k ξ), k= 1,2, so that if the N years are consecutive, the standard error of the trend

estimate ^

β given by (5) is the square root of

var[^

β ] = 12 (σ / ξ)2 (g 2 - g1 2) / [N (N 2 – 1 )] (6)

As mentioned, an advantage of this procedure is that, if the trend can be estimated

efficiently by least squares, estimation of the GEV parameters then requires a search over

the three-dimensional space of α, σ, ξ instead of the four dimensional space for α, σ, ξ ,

β.

Therefore the following method was also used:-

Method 3: Trend estimated by ^

β = 12 =

N

i 1

qi { i - (N+1) / 2 } /[N (N 2 – 1 )]

10

with standard error given by the square root of (6), and with σ, ξ estimated from the de-

trended data sequence.

The variance of a random variable Q with a Gumbel distribution is σ2 π2 / 6, so

that a fourth estimator is given by

Method 4: Trend estimated by ^

β = 12 =

N

i 1

qi { i - (N+1) / 2 } /[N (N 2 – 1 )]

with standard error the square root of var[^

β ] = 2 σ 2 π2 / [N (N 2 – 1 )].

In both Methods 3 and 4, the de-trended sequences no longer consist of

statistically-independent values, because the estimated residuals are linear functions of

the original data sequence { q t }, t = 1…N. It can be shown [e.g., Johnson and Wichern ,

2007] that the de-trended residuals, denoted by r*, are given by r* = (I- H) q where H is

the ‘hat’ matrix H = X (XTX)-1XT, I is the unit matrix, and X is the N×2 matrix with 1’s in

the first column and the year numbers in the second. Thus the covariance between the r-

th and s-th residuals, obtained after de-trending when data are from a GEV distribution, is

[ ≠ srii ,,

h r i h s i – (2 – h r r – h s s) h r s]. (g 2 - g1 2) (σ / ξ)2 (7)

In the case of the Gumbel distribution, the expression in square brackets is multiplied by

σ2 π2 / 6. This covariance structure differs from the covariance structure of (for example)

a lag-one auto-regression, so the type of portmanteau test used to test for serial

11

correlation between values in the original data sequence is no longer appropriate. Thus,

whilst the estimate of trend given by (5) is valid, the presence of correlation between

residuals after de-trending means that the usual log-likelihood function is incorrect. This

may affect estimates of the parameters μ, σ, ξ of the GEV distribution, and hence the

standard error of the trend estimate given by (5).

Methods 1 to 4 above are all based on probability distributions for the sequences

of annual maximum flows. To compare their performance with methods which do not use

such assumptions, two distribution-free procedures were used: namely, a bootstrap

estimate of trend, and the Theil-Sen estimate. Thus:

Method 5: For a data-set of length N years, N pairs of values (t i , q i) were drawn with

replacement, where t i is the year number and q i (i = 1…N) is the annual maximum for

that year. The slope (given by (5) ) and intercept of the sample were calculated, and the

procedure was repeated 600 times; all simulations and boot-strap calculations used 600

repetitions, since this number was recommended by Wilcox [2012] for calculating

confidence intervals for the Theil-Sen trend estimate described below. Quantiles

corresponding to probabilities 0.025 and 0.975 then gave a 95% confidence interval for

the trend β. This procedure is equivalent to a randomly-weighted regression [Venables

and Ripley, 1999]; as an alternative, they suggest (Section 6.6) model-based re-sampling

in which the residuals about the fitted regression are re-sampled. However the calculated

residuals r* do not then have the correct variance, or even the same variance as each

other. Corrections are possible [Venables and Ripley, 1999] but model-based re-sampling

12

was not pursued further in this paper. Methods 3, 4 and 5 all estimate trend by the

expression in (5).

Method 6 estimated trend by means of the Theil-Sen distribution-free procedure.

Trends were calculated as follows: (i) at each site (except for 14 de Julho), the set of

M=70×71/2=2485 differences (qk - qi)/(k - i) for i < k. The median of these differences

gave the Theil-Sen estimate of trend β; (ii) 600 samples of size N annual maxima were

drawn, with replacement, from the record of length N years, and the Theil-Sen estimate of

trend was calculated from each of the 600 samples. Quantiles corresponding to

probabilities 0.025 and 0.975 then gave a 95% confidence interval for the trend β . For

the site 14 de Julho, where two years were missing, the longest period of unbroken record

(1931-2003) was used, giving M = 2701. This procedure differs from that given by

Wilcox [2011] who re-sampled the M differences with replacement. It also differs from

the variant of the Theil-Sen estimator given by Siegel [1982] who determined, for each

sample point (ti, qi), the median mi of the slopes (qj − qi)/(tj − ti) of lines through that

point, and then calculated the overall estimator as the median of these medians.

Results.

Table 2(a) shows estimates of the linear trend parameter estimated by these six

methods. The table shows that differences between estimates of trend obtained by

Method 1 (fitting a GEV by ML, with trend incorporated in the location parameter) and

Method 3 (de-trending by least squares, before fitting a GEV distribution to the de-

13

trended series) are large. Trend estimates from Method 3 are in all cases much larger than

the ML estimates, and in one case (14 de Julho) are several times larger. The same is true

for trend estimates obtained by Method 2 (Gumbel, with trend incorporated in the

location parameter) and Method 4 (Gumbel fitted after de-trending). Where the location

parameters of GEV and Gumbel distributions have trend incorporated, Gumbel estimates

of the trend coefficient β are always larger than those from fitting the GEV distribution,

despite the fact that the shape parameter ξ is not large, relative to its standard error, at any

of the five sites, as shown at the bottom of Table 2(a). For Site 14 de Julho, Figure 3

compares the trend estimated by fitting a GEV with trend parameter incorporated

(Method 1), with the least-squares estimate of trend (Methods 3, 4 and 5); visually, the

least-squares fit looks the better fit. Both Theil-Sen estimates of trend β, and the least-

squares estimate of β, are very considerably greater than estimates of β obtained from

both GEV and Gumbel distributions, with or without de-trending. As reported [e.g., Frei,

2011], the Theil-Sen estimates of β are smaller than the ^

β found by least squares.

Table 2(b) shows 95% upper and lower confidence limits, and the width of the

95% confidence intervals, for linear trends estimated by the six methods at the five sites.

For fits by ML, it is assumed that large-sample properties hold so that the approximate

95% confidence intervals are given by ± 2×SE, where the standard error SE is found as

described above. Where the confidence interval brackets zero, the estimated trends are

consistent with a null hypothesis of zero trend; where zero is not bracketed, the null

hypothesis of zero trend would be rejected. In terms of statistical significance, therefore,

results given by the five methods are very different. For the site 14 de Julho, the

14

confidence interval from Method 1 (GEV with incorporated trend) brackets zero, and so

does the confidence interval for Method 2 (Gumbel with incorporated trend), Method 5

(bootstrapped linear regression) and Method 6 (Theil-Sen), whilst the ‘detrended’

methods (Methods 3 and 4) both indicated that the null hypothesis of zero trend should be

rejected. At all five sites, Theil-Sen confidence intervals bracketed zero, showing that

Method 6 always gave estimates of trend consistent with the zero-trend hypothesis.

Confidence-interval widths often differed substantially between methods at any given

site, but no consistent pattern emerges; the widths of confidence intervals where data are

de-trended are usually greater than where trend is built into a distribution’s location

parameter, and the differences are sometimes large.

By comparing the upper and lower confidence limits with the least-squares

estimates of trend shown in Table 2(a), it can also be seen that, at three of the five sites,

the least-squares estimate of trend lay outside the confidence interval for the ML

estimate; at two of the five sites, the Theil-Sen estimate lay outside the confidence

interval for the ML estimate. Interpretation of this result requires caution, however,

because both the least-squares and Theil-Sen estimates are also subject to uncertainty,

and because records at the five sites are to some extent cross-correlated.

Table 2(c) shows how estimates of the location and scale parameters, α (= μ) and

σ, differ when they are estimated from GEV distributions with the trend parameter

incorporated, and from GEV distributions after linear trend has been removed (i.e.,

Methods 1 and 3). Estimates of the GEV distribution’s α (i.e., the location parameter of

the de-trended residuals) are consistently greater when data are de-trended, but the

15

differences are not large; the same is true of estimates of σ, the scale parameter, but here

the differences between Methods 1 and 3 are substantially greater. For the Gumbel

distribution, differences between estimates of α with and without trend removal (Methods

2 and 4) are very small; estimates of the scale parameter are greater where data are de-

trended, but the increases are smaller than those for the GEV distribution. De-trending

has a marked effect on estimates of the shape parameter ξ, shown in Table 2(a); these,

and their standard errors, were substantially reduced where the data were de-trended

before fitting a GEV.

The main point emerging from Tables 2(a), (b) and (c) is that when a trend

coefficient is incorporated into the location parameter (whether GEV or Gumbel) of

annual maximum one-day flows, the estimates of trend were markedly different from any

estimates of trend obtained by least squares, or by Theil-Sen, or even fitting by eye (see

Fig. 3). To explore this in greater detail, samples of size 70 were drawn from five GEV

distributions with parameter values equal to those given in Table 2 (in the case of the

gauge 14 de Julho, for example, the GEV parameters were α = 2522, σ = 1063, ξ =

0.1251, β = 5.626). Six hundred samples were drawn in each case. For each sample, the

trend β was estimated (i) by ML (Method 1), (ii) by ordinary least squares (giving the

estimated trend for Methods 3, 4 and 5), and (iii) by the Theil-Sen estimator (Method 6).

Table 3 gives statistics derived from the 600 generated samples. Comparisons by t-test

between the means of the ^

β ’s over all 600 simulations and the “true” trends shown at

the top of Table 3, show that none of the three methods shows evidence of major bias,

and indeed theory shows that the ML estimates ^

β are unbiased when the sample of years

16

is large. Provided that the trend is linear, least-squares estimates ^

β are unbiased whatever

the length of record. Theory also shows that, given certain regularity conditions [Kendall

et al., 1983] no other estimation procedure will give estimates^

β with smaller variance

than ML estimates when the length of record is sufficiently long (when ML estimates are

“asymptotically efficient”). Table 3 also shows standard deviations (SDs) of the 600

samples at the five sites. The SD for ML estimates ^

β is substantially smaller than the

SDs of either least squares or Theil-Sen estimates, and the SDs of the Theil-Sen estimates

are substantially smaller than the SDs given by least squares. Figure 4 shows a scatter-

diagram for the Site 14 de Julho; the greater variability in the least-squares estimate

“pulls” the line towards the horizontal, giving a slope (0.393) considerably less than 45o.

Correlations between the estimates ^

β obtained from the 600 samples by ML,

least squares and Theil-Sen are shown at the bottom of Table 3; the correlation between

ML and least-squares estimates of trend is lowest at all sites, ranging from about 0.5 to

0.75. Correlation between ML and Theil-Sen estimates is considerably greater, whilst

correlations between least-squares and Theil-Sen estimates are in the range 0.82 to 0.92.

These correlations, together with the asymptotic efficiency of ML estimates

mentioned above, suggest an explanation for the differences between ML and other trend

estimates shown in Table 2(a). The argument is as follows. Consider the probability

distributions of the ML and least-squares estimates of trend (given, in the case of Figure

4, by the projections of points on to the vertical and horizontal axes respectively). The

dispersion of the former (ML) distribution will be less than that of the latter because of

17

ML asymptotic efficiency; both distributions will be centred about the true trend value,

however, because both ML and least-squares estimators are unbiased (the former in large

samples, the latter in samples of any size, when the trend is truly linear). Hence, when the

sample of years in a record of annual maxima gives a trend that is large and positive

when estimated by least squares, the ML estimate of trend will be smaller (closer to the

central value common to both distributions) but still positive (because ML and least-

squares estimates are positively correlated). The converse must also be true: when the

sample of years in the record of annual maxima shows a trend that is small or even

negative when estimated by least squares (i.e., the least squares estimate lies in the left-

hand tail of its probability distribution), the ML estimate will be greater (again, closer to

the central value common to both distributions). A similar argument is developed by

substituting ‘Theil-Sen estimator’ for ‘least-squares estimator’.

Discussion.

The trends in the estimation methods described above are linear trends, and it

could be argued that changes may be episodic rather than linear. The many methods for

modeling episodic changes over time include piece-wise or “broken stick” regression,

with or without continuity at the break-points, the number of which may be known or

unknown [Khodadadi and Asgharian, 2008; Toms and Lesperance, 2003]. Erdman and

Emerson [2007] following Barry and Hartigan [1993] used Bayesian methods to estimate

posterior distributions of parameters θ i {i=1…b} where a data sequence {Xi}, i=1…n is

modeled as b “blocks” with the i-th block specified by θ i which may be (for example) a

distribution mean or a regression coefficient. In the physical context of hydrological

18

extremes, however, results from any analysis of episodic trends will be more acceptable

where changes are known to have occurred at specific times, and from specific causes,

rather than found by using a computer algorithm to determine the number of change

points and the change magnitudes. As far as is known, data used to illustrate methods

described above were not subject to any such specific changes; in the absence of such

knowledge, a linear trend gives a simple description of the way that annual maxima

changed over the period of record.

In an important paper on the calculation of trends and their uncertainties, Cohn

and Lins [2005] explored aspects of the statistical significance of trends in

hydroclimatological time series, concluding that “while trend magnitude can be

determined with little ambiguity, the corresponding statistical significance … is less

certain because significance depends critically on the null hypothesis, which in turn

reflects subjective notions about what one expects to see”. They also concluded that “it

may be preferable to acknowledge that the concept of statistical significance is

meaningless when discussing poorly-understood systems.” The results presented above

suggest that uncertainty is a characteristic not only of the statistical significance of a

trend, as Cohn and Lins state, but also of its magnitude. Two sets of procedures for

estimating trend in annual maximum one-day flows, both arguably valid, have been

shown to give very different results in some specific cases: one set, based on the almost

universal assumption in frequency analyses that annual maxima are statistically

independent with some kind of extreme-value distribution (GEV or Gumbel), and another

set which assumed statistical independence but no distributional form (bootstrap, Theil-

Sen). Table 2(a) shows that estimates of trend given by the two sets of procedures can be

19

markedly different. Furthermore, as confirmation of the statement from Cohn and Lins

[2005] quoted above, the two sets of procedures lead (in the case of the five records here

considered) to different conclusions about the statistical significance of trends, shown by

Table 2(b).

It can be argued [Koutsoyiannis, 2006] that analysing time series of hydrological

or climatic data for trends is in any case illogical, and that “a stochastic approach

hypothesizing stationarity and simultaneously admitting a scaling behaviour reproduces

climatic trends (considering them as large-scale fluctuations) in a manner that is logically

consistent.” Such an approach, based on the well-known Hurst coefficient, “does not

require the separation of the time series into two or more components, so it does not

attempt to de-trend the original series. It admits that the existence of trends is the normal

behaviour of real world time series” [Koutsoyiannis, 2006]. However a careful reading of

his text suggests that Koutsoyiannis’ criticism is levelled principally at the fallacy of

concluding that such “trends” are deterministic. But the term “deterministic” must imply

that present behaviour determines a system’s future behaviour; in the present context

where annual extreme one-day flows are analysed over a period of 70 to 80 years,

changes – where they exist – are almost certainly the consequence of land-use change

from native forest to arable cropping, with no “deterministic” interpretation about future

behaviour possible. It can therefore be argued that in such circumstances, a linear trend

coefficient is a useful summary of past behaviour (although it can never be more than

that; certainly no extrapolation to future years is possible). In a sense, a trend coefficient

b and H coefficient are complementary entities: the former is a statistical summary of

time-series behaviour over a specified period of the past, and the latter is a statistical

20

summary of future long-term behaviour over an indeterminate (but long) period; neither

conveys information about deterministic influences that caused, or will cause, variability

observed in the time-series.

It has been mentioned that the Methods 3 and 4, which de-trended the sequence of

annual maxima before fitting GEV or Gumbel distributions to the de-trended residuals,

introduces a correlation (although not a serial correlation) amongst them. The issue of

introduced correlation appears to be a much wider issue than in the present paper. Where,

for example, future “scenarios” of annual climatic extremes are produced by Global

Climate Models (GCMs), the annual (or seasonal) extremes in the scenario will all be

functions of GCM parameters and their initial values, and will remain so however many

“members” are calculated in the scenario. A similar argument would appear to hold

where coupled climate-hydrological models are used to produce future scenarios of

extreme flows. Further research is required to explore whether such introduced

correlations are important or whether they can be safely ignored.

In conclusion, we return to the title of this paper: how should trends in hydrologic

extremes be estimated? The answer is that the method must depend on the context, and

no single method can be recommended for all circumstances. We illustrate with three

cases: (i) where the context requires exploration of the causes of trend (i.e., determination

of whether certain predictors explain the trend fully, partially, or not at all) then a

parametric approach is required, with assumptions – to be verified subsequently – about

underlying probability distributions that allow the use of likelihood theory for hypothesis

21

testing (or, in a Bayesian context, for calculating the posterior probability distribution of a

trend). This would rule out the use of distribution-free methods such as bootstrapping or

Theil-Sen estimation. If, however, (ii) the context requires the exploration of a regional

average trend of, say, annual maximum n-day rainfalls measured at P sites within the

region, distribution-free methods could be used; the trend could be estimated by either

bootstrapping or Theil-Sen at each of the P sites, with standard geostatistical methods

used to calculate the regional trend and its uncertainty. Again (iii) if it were required to

estimate the difference between trends at two or more sites, so that estimating differences

between trends is more important than obtaining good estimates of the trend at each site

separately, then any of the methods compared above could be used (although if the

uncertainty in the differences were also to be estimated, this would complicate the use of

Theil-Sen and bootstrap methods). Whichever methods are used, the estimates of trend

may differ from method to method; the uncertainty as measured by confidence intervals

may differ; and (with due reminder of the warning of Cohn and Lins [2005] referred to

above) if significance-testing is required, conclusions about significance may vary from

one method to another.

Conclusion.

A comparison of six procedures for estimating trend β in annual maximum one-

day river flows at five sites showed marked differences between, on the one hand,

estimates obtained by incorporating trend into the GEV location parameters; and on the

other hand, estimates found by (i) trend removal by least squares prior to fitting the GEV,

22

(ii) boot-strap sampling, and (iii) Theil-Sen estimation. The same was true where trend

had been incorporated into the Gumbel location parameter. Comparison of approximate

95% confidence intervals for β given by the different methods, showed that 95%

confidence limits given by the different methods bracketed zero for some of the six

methods, whilst for others, they did not, so that different conclusions about statistical

significance would be drawn according to which method was used to estimate the trend.

Acknowledgement.

The author is grateful to Professor Juan Martin Bravo for making daily flow records

available, and to anonymous reviewers for constructive comments.

23

References.

Barry, D., and J. A. Hartigan (1993) A Bayesian analysis for Change-Point Problems, J. American Statist. Ass., 35(3), 309-319. Box, G. E. P., G. M. Jenkins and G. C. Reinsel (2009) Time Series Analysis: Forecasting and Control, Pearson Education (US), N. J. Burn, D. H., M. Sharif and K. Zhang (2010) Detection of trends in hydrological extremes for Canadian watersheds, Hydrol. Process., 24, 1781-1790. Cohn, T. A. and H. F. Lins (2005), Nature’s style: Naturally trendy, Geophys. Res. Lett., 32, L23402, doi:10.1029/2005GL024476. Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values, Springer- Verlag, London. Erdman, C., and J. W. Emerson (2007) bcp: An R Package for Performing a Bayesian Analysis of Change Point Problems, J. Statist. Software, 23(3), 1-13. Frei, C. (2011) Analysis of Climate and Weather Data. Section 3 of “Trend Analysis, |HS 2011” http://www.iac.ethz.ch/edu/courses/master/electives/acwd/Trend.pdf. Hannaford, J. and G. Buys (2012) Trends in seasonal river flow in the UK, J. Hydrol.,, 475, 158-174. Hannaford, J. and T. J. Marsh (2008) High flow and flood trends in a network of undisturbed catchments in the UK, Int. J. Climatol., 28, 1325-1338. Heffernan, J. E. and A. G. Stephenson (2013) An Introduction to Statistical Modeling of Extreme Values; package ‘ismev’ , http://www.ral.ucar.edu/~ericg/softextreme.php. Hodgkins, G. A. and R. W. Dudley (2006) Changes in the timing of winter-spring streamflows in eastern North America, 1913-2002, Geophys. Res. Lett., 33, L06402. Hosking, J. R. M. and J. R. Wallis (1997) Regional Frequency Analysis, Cambridge University Press, Cambridge, UK. Johnson, R. A. and D. W. Wichern (2007) Applied Multivariate Statistical Analysis (6th edition), Pearson Education (US), N.J. Kendall, M., A. Stuart and J. K. Ord (1983). The Advanced Theory of Statistics, vol.2,

24

Design and Analysis and Time Series, 4th ed., 780 pp., Oxford Univ. Press, Oxford, UK. Khodadadi, A. and M. Asgharian (2008) Change-point Problem and Regression: An Annotated Bibliography. COBRA Preprint Series. Working Paper 44, Montreal, Canada. Koutsoyiannis D (2006) Nonstationarity versus scaling in hydrology, J. Hydrol., 324, 239–254. Marsh, T. J. and C. L. Harvey (2012) The Thames Flood Series – a lack of trend in flood magnitude and a decline in maximum levels, Hydrol. Res., 43, 203-214. Novotny, E. V. and H. G. Stefan (2007) Streamflow in Minnesota: indicator of climate change, J. Hydrol., 334, 319-333.

Petrow, T. and B. Merz (2009) Trends in flood magnitude, frequency and seasonality in Germany in the period 1951-2002. J. Hydrol., 371, 129-141.

Siegel, A. F. (1982) Robust regression using repeated medians, Biometrika 69 (1): 242–

244, doi:10.1093/biomet/69.1.242. Svensson, C., J. Hannaford, Z. W. Kundzewicz and T. Marsh (2006) Trends in river floods: why is there no clear signal in observations? In: Frontiers of Flood Research (Proceedings of Kovacs Colloquium, Paris, June 2006), Publication 305, IAHS Press, Wallingford, UK. Toms, J. D. and M L Lesperance (2003). Piecewise regression: A tool for identifying ecological thresholds, Ecology, 84, 2034-2041, http://dx.doi.org/10.1890/02-0472 Venables, W. N. and B. D. Ripley (1999) Modern Applied Statistics with S_PLUS (Third Edition) Springer-Verlag, New York.

VSN International (2012). GenStat for Windows 15th Edition. VSN International, Hemel Hempstead, UK. Web page: GenStat.co.uk

Wilcox, R R (2012) Introduction to Robust Estimation and Hypothesis Testing (3rd edition, revised), Academic Press Inc, Amsterdam.

25

Wilson, D., H. Hisdal and D. Lawrence (2010) Has streamflow changed in the Nordic countries? Recent trends and comparisons to hydrological projections, J. Hydrol., 394, 334-346.

26

Table 1(a): Descriptions of drainage basins, Rio Grande do Sul, Southern Brazil

Gauge site: Code River: Latitude: Longitude: Area(km2): Annual P

14 de Julho 284 R. Antas -29.0648 -51.6749 12835 1613 1

Castro Alves 98 R. Antas -29.0056 -51.3844 7784 1617 2

Chapeco 94 R. Uruguai -27.1416 -53.0449 52949 1650 3

Ernestina 110 R. Uruguai -28.5556 -52.5456 1047 1646 4

Ita 92 R. Uruguai -27.2767 -52.3822 43954 1623 5

Periods: 1944-2010 1; 1944-2010 2; 1957-2010 3; 1930-1985 4; 1958-2008 5

Table 1(b): Summary statistics of annual maximum one-day flows. Q1 and Q3 are the

first and third quantiles:

Min: Q1: Median: Q3: Max:

14 de Julho 1170 2045 2938 3910 6912

Castro Alves 559 1097 1499 2391 4507

Chapeco 1434 5946 8583 12731 25466

Ernestina 54 154 221 326 729

Ita 1446 5067 7821 11332 25544

27

Table 2(a): Trends (m3s-1yr-1)obtained when trend is incorporated in location parameter

of GEV and Gumbel distributions: i.e., GEV(μ + β t, σ, ξ) and Gumbel (μ + β t, σ )

(Methods 1 and 2),; be removing trend and fitting GEV(μ , σ, ξ) and Gumbel (μ , σ ) to

the residuals (Methods 3 and 4); by bootstrapped Linear Regression (Method 5) with 600

boot-strapped samples, and by using the Theil-Sen estimate (Method 6) with 600 boot-

strapped samples.

Site:

14 de Julho Castro Alves Chapeco Ernestina Ita

Method:

1 5.63 9.80 58.00 1.94 44.61

2 8.47 12.68 62.47 2.11 47.45

3 ]

4 ] 20.88 18.99 86.50 3.85 60.00

5 ]

6 15.63 17.55 79.50 3.67 59.42

Estimates of shape parameter ξ in GEV(μ + β t, σ, ξ), and of ξ in GEV(μ , σ, ξ) after de-

trending:

Method 1 0.1251 0.2249 0.0894 0.0372 0.1426

SE(ξ ) ± 0.1345 ± 0.1306 ± 0.1171 ± 0.1220 ± 0.1311

Method 3: -0.0725 0.0597 0.0340 -0.1133 0.1202

SE(ξ ) ± 0.1083 ± 0.1029 ± 0.0996 ± 0.0716 ± 0.1262

28

Table 2(b): Approximate 95% lower and upper confidence limits (denoted by L, U), and

width of confidence intervals. Units: (m3s-1yr-1).

Site:


Method 1:

L -5.64 3.32 17.76 0.27 11.47

U 16.89 16.26 98.24 3.60 77.75

Width: 22.53 12.94 80.48 3.33 66.28

Method 2:

L -3.20 4.92 19.37 0.78 8.89

U 20.15 20.43 105.57 3.44 86.01

Width: 23.35 15.51 86.20 2.66 77.12

Method 3:

L 9.48 11.93 38.08 2.36 19.22

U 32.34 25.41 135.72 5.36 98.78

Width: 22.86 13.48 97.64 3.10 79.56

Method 4:

L 7.79 10.31 32.78 2.29 12.64

U 34.03 27.03 141.02 5.43 105.36

Width: 26.03 16.72 108.24 3.14 92.72

Method 5:

29

L -5.53 7.80 36.80 2.03 17.20

U 37.24 31.07 133.90 5.62 104.60

Width: 42.37 23.27 97.10 3.59 87.40

Method 6:

L -15.54 -7.82 -51.74 -1.43 -47.45

U 17.02 8.66 50.19 1.44 49.64

Width: 32.56 16.48 101.93 2.87 97.09

30

Table 2(c): Estimates of location parameter α (= μ) and scale parameter σ, from methods

with and without de-trending by least squares. Values of ξ, where relevant, are shown at

the bottom of Table 2(a).

Method 1 (GEV, trend parameter incorporated in location parameter):

α: 2522 1364 7514 195.9 6392

σ: 1063 566.9 3670 106.4 3143

Method 2 (Gumbel, trend parameter incorporated in location parameter):

α: 2598 1439 7699 198.3 6647

σ: 1129 636.5 3818 108.2 3364

Method 3 (GEV, data de-trended):

α: 2639 1418 7622 205.0 6429

σ: 1221 644.8 3838 120.1 3200

Method 4 (Gumbel, data de-trended):

α: 2593 1438 7694 197.8 6646

σ: 1151 660.2 3889 118.1 3383

31

Table 3: Statistics derived from 600 simulated samples, at each of the 5 sites, drawn from

GEV(α + β t, σ, ξ), with parameters α , σ, ξ, β given in Table 2. Means, standard

deviations and standard errors of means, are given for 600 maximum-likelihood estimates ^

β ML, 600 least-squares estimates ^

β LS , and 600 Theil-Sen estimates ^

β TS. Correlations

between maximum-likelihood and least-squares estimates are denoted by r ML,LS .

Similarly for r ML,TS and r TS,LS.


“True” value: 5.626 9.795 58.00 1.938 44.610

Mean ^

β ML 5.742 9.727 58.757 1.931 44.122

SE(Mean) ± 0.248 ±0.125 ±0.883 ± 0.027 ± 0.698

SD(^

β ML ) ±6.083 ±3.059 ±21.612 ±0.659 ±17.083

Mean ^

β LS 6.094 9.608 56.901 1.929 44.437

SE(Mean) ± 0.388 ±0.265 ±1.291 ± 0.035 ± 1.258

SD(^

β LS ) ±9.496 ±6.489 ±31.588 ±0.853 ±30.797

Mean^

β TS 5.879 9.704 57.546 1.931 44.619

SE(Mean) ± 0.310 ± 0.169 ± 1.041 ± 0.027 ± 0.698

SD(^

β TS ) ±7.589 ±4.138 ±25.469 ±0.753 ±22.367

32

r ML,LS 0.613 0.472 0.686 0.751 0.557

r ML,TS 0.791 0.706 0.827 0.858 0.757

r TS,LS 0.877 0.820 0.832 0.917 0.863

33

Figure legends.

Figure 1: Annual maximum one-day flows at five sites, standardized by subtracting the

mean of the sequence and dividing by its standard error.

Figure 2: Site 14 de Julho: quantile-quantile plot when GEV distribution is fitted.

Figure 3. Site 14 de Julho: trends estimated from GEV distribution with trend parameter

incorporated (Method 1: broken black line), and by least-squares fit (Methods 3, 4 and 5:

broken red line). Symbols: YML: trend by maximum likelihood; YR: year; YLS: least

squares; Q: annual maximum discharge.

Figure 4 : Estimate b of trend coefficient β in 600 samples of size 70 “years” drawn

from a GEV distribution with parameters equal to those estimated for the site 14 de Julho.

Red line shows the 45o line. Ordinates along vertical axis are the estimates of β when

trend was incorporated into the GEV distribution; abscissa along the horizontal axis are

estimates of β found be least squares (as for boot-strapped estimate, and de-trended

values subsequently fitted to GEV(α, σ, ξ) ).

34

Figure 1: Annual maximum one-day flows at five sites, standardized by subtracting the

mean of the sequence and dividing by its standard error.

35

Figure 2: Site 14 de Julho: quantile-quantile plot when GEV distribution is fitted.

36

Figure 3. Site 14 de Julho: trends estimated from GEV distribution with trend parameter

incorporated (Method 1: broken black line), and by least-squares fit (Methods 3, 4 and 5:

broken red line). Symbols: YML: trend by maximum likelihood; YR: year; YLS: least

squares; Q: annual maximum discharge.

37

Figure 4 : Estimate b of trend coefficient β in 600 samples of size 70 “years” drawn

from a GEV distribution with parameters equal to those estimated for the site 14 de Julho.

Red line shows the 45o line. Ordinates along vertical axis are the estimates of β when

trend was incorporated into the GEV distribution; abscissa along the horizontal axis are

estimates of β found be least squares (as for boot-strapped estimate, and de-trended

values subsequently fitted to GEV(α, σ, ξ) ).

Documents

How should trends in hydrological extremes be estimated?