Bayesian averaging of classical estimates in forecasting macroeconomic indicators with application...

Preview:

Citation preview

ORI GIN AL PA PER

Bayesian averaging of classical estimates in forecastingmacroeconomic indicators with application of businesssurvey data

Piotr Białowolski • Tomasz Kuszewski •

Bartosz Witkowski

� Springer Science+Business Media New York 2013

Abstract In this paper, we develop a methodology for forecasting key macro-

economic indicators, based on business survey data. We estimate a large set of

models, using an autoregressive specification, with regressors selected from busi-

ness and household survey data. Our methodology is based on the Bayesian aver-

aging of classical estimates method. Additionally, we examine the impact of

deterministic and stochastic seasonality of the business survey time series on the

outcome of the forecasting process. We propose an intuitive procedure for incor-

porating both types of seasonality into the forecasting process. After estimating the

specified models, we check the accuracy of the forecasts.

Keywords Bayesian averaging of classical estimates � Business survey data �Seasonality � Automatic forecasting

JEL Classification C10 � C83 � E32 � E37

1 Introduction

Business and consumer surveys have a long history of describing changes in

economic activity and providing information on the expected future shape of

P. Białowolski (&)

Institute of Statistics and Demography, Warsaw School of Economics,

Warsaw, Poland

e-mail: piotr.bialowolski@sgh.waw.pl

T. Kuszewski � B. Witkowski

Institute of Econometrics, Warsaw School of Economics, Warsaw, Poland

e-mail: tkusze@sgh.waw.pl

B. Witkowski

e-mail: bwitko@sgh.waw.pl

123

Empirica

DOI 10.1007/s10663-013-9227-x

economic processes. In Poland, the history of tendency survey based research dates

from the mid-1980s, as traced in detail in Adamowicz (2008).

The main advantage of business and consumer survey data is their accessibility.

In particular, information gained from surveys can be included in models before

changes in economic indicators are reported in official statistics. Although there is

debate on the soundness and reliability of survey data, economic agents are believed

to have deep intuitive insight into various aspects of macroeconomic processes

(Drabarek 2006). An intuitive understanding of economic processes is only partly

determined by the level of knowledge and expertise one has acquired. Most

economic knowledge and acquisition of economic concepts is believed to be

obtained during an economic socialisation process (Tyszka 2004).

Business survey data are widely used in econometric analyses aimed at

forecasting key macroeconomic indicators. A comprehensive review of existing

approaches and models can be found in Bialowolski et al. (2007). However, the only

macroeconomic multi-equation prognostic model, for the Polish economy, that

uses business survey data is the CLIMA model developed by the team led by

M. Drozdowicz-Biec (Bialowolski et al. 2007, 2008). This model describes cyclical

changes in gross domestic product, unemployment and inflation and is constructed

as a tool for generating medium-term forecasts for these variables on a quarterly

basis. The specification of the model’s equations is based on the fundamentals

of economic theory, while business sentiment indicators are used to improve the

model fit.

The aim of this study is to show that it is possible to build a range of econometric

models based on regressors selected through the Bayesian averaging of classical

estimates (BACE) method. We use a set of business survey data from both Poland

and Poland’s main trading partner, Germany. We start with the assumption that

business survey indicators exhibit properties of a preemptive response to changes in

the values and relationships among key macroeconomic indicators, such as gross

domestic product (GDP) growth, inflation and unemployment. Additionally, we

analyse the impact of seasonality of the reference time series on the fit of

econometric models and on the accuracy of the forecasts generated on the basis of

these models. In the next step of the procedure, the developed econometric models

describing the Polish economy are used to generate short-term quarterly forecasts

of GDP dynamics, inflation dynamics, and the unemployment rate. Finally, the

accuracy of generated forecasts is examined by comparing them with other

economic forecasts.

The paper is arranged as follows. The following (Sect 2) briefly discusses

existing views on macroeconomic forecasting in times of economic crisis. Section 3

focuses on the data used to estimate the econometric models and on the statistical

properties of the time series used in the analysis. Section 4 describes the Bayesian

Averaging of Classical Estimates—the method in general and a modification to it

adopted in this paper. Section 5 examines the fit of obtained forecasting models and

analyses the impact of the seasonality of the time series and of the adopted approach

to modelling seasonality on the quality of the forecasts obtained. Finally, the

forecasts are compared with other predictions available on the issue date of our

forecast. Section 6 concludes.

Empirica

123

2 Macroeconomic forecasting before and after (?) the financial crisis

Macroeconomic forecasting has lost a considerable amount of public confidence in

recent years, due to a large number of inaccurate forecasts that cast doubt on the

ability of commonly used models to produce accurate forecasts. This is reflected in a

new and enhanced version of the well-known academic economics textbook by

Olivier Blanchard (2011). The author presents forecasts made by various American

economists just after the first signs of crisis in December 2007. The economists’

views with respect to short-term prospects for the U.S. economy were diverse. The

optimists among them argued in effect that nothing was wrong and that all

unfavourable economic developments could be neutralised through appropriate

decisions by the Federal Reserve Board. The pessimists, by contrast, suggested that

the decline in demand might prove so strong that it could not be overcome with

simple interest rate cuts and would eventually lead to recession in the U.S. Just nine

months later, it was clear that almost no one (with a few exceptions), at the

beginning of the crisis period, had been able to accurately predict further

developments in the U.S. economy. As Blanchard observes, ‘‘even the most

pessimistic opinions were too optimistic.’’ This view is supported by Coenen (2010)

who gave a presentation during a Workshop on Central Bank Forecasting.

According to Coenen, the models failed to recognise the size and persistence of the

shocks. Especially notable was a lack of proper recognition of developments in the

financial markets. Coenen suggested several remedies, including adoption of

weighting schemes in modelling approaches, to eliminate the worst performing

models from analysis. He also suggested that survey data should be included in

analysis, a view shared by Banbura et al. (2010) who show that survey data lead to

significant improvements in what is known as nowcasting of macroeconomic

variables. Banbura et al. (2010) take a set of variables from a survey conducted

using the methodology of the European Commission (similar to that used by the

Research Institute for Economic Development at the Warsaw School of Economics)

and show a significant improvement in data ‘‘nowcasting’’ up to the point where

official data are released.

Thus, there appears to be a need for further development of forecasting models in

times of poor economic performance. Based on standard statistical procedures, the

economic crisis in the eurozone has resulted in a shift from above-potential output

to below-potential output. What is more, this transition did not result in a significant

decline in inflation, which to a large extent was externally driven (Hucek et al.

2010). Forecasts made in uncertain economic environments associated with large

output gaps and continually high inflation are potentially more prone to bias, if

standard specifications of models are applied. As data from business and consumer

tendency surveys possess the element of common knowledge of economic agents

and therefore should be free of bias in the long run. It is also reasonable to assume

that such data tend to be relatively resistant to structural changes and quick to adapt

to changes in the economic environment. This is well stated in Benes et al. (2010):

‘‘Additional information in the high-frequency data may assume crucial importance

during a crisis or more generally during periods of high uncertainty, when the

regular relationships built into the model may cease to hold’’.

Empirica

123

Taking all the events of recent years into account, it might seem reasonable to

take the rigid approach of Roubini (2010) who state that the economic system

changes and that these changes are not incorporated into economic theory quickly

enough, while the new era requires changes in ways of thinking. Changes in the

economic environment should also have a significant impact on approaches to

economic modelling. As reported in the paper of Lee et al. (2010), developments in

the U.S. economy—where a significant decline in consumption, along with an

increase in the savings, was noted—are expected to have a significant influence on

global growth. The change in global growth patterns should induce changes in the

modelling strategies of other economies.

With these various arguments in mind, we undertake the task of modelling the

key macroeconomic variables of the Polish economy for the period 1996–2011.

Aware of the flaws of current mainstream modelling methods (such as DSGE or

SVAR), we decided to use tendency survey data, as we expect survey data will

provide additional, leading information, while initiating a novel approach to

forecasting with survey data. Our method is based on Bayesian averaging, an

approach to enhancing the process of variable selection and reducing the set of

regressors in the final model to those with the highest information content.

3 Data

To build our forecasting model, we use quarterly data covering the 1996–2011

period. The data on GDP, the consumer price index (CPI) and the unemployment

rate (UNE) are from statistics produced by Poland’s Central Statistical Office

(CSO). The unemployment rate is determined on the basis of a Labour Force Survey

also conducted at Poland’s CSO. The GDP indices come in two variants. ‘‘GDP’’

presents dynamics, calculated according to the ‘‘corresponding period of the

previous year = 100’’ principle, while ‘‘GDP_VOL’’ covers information on

quarterly seasonally unadjusted GDP values at constant 2005 prices, calculated

from the 2010 values, and using the link chain method retroactively for the

1996–2009 period and prospectively for 2011.

In forecasting models, lagged endogenous variables were used as explanatory

variables, in addition to the indicators from the business survey data. The dataset

applied in the modelling procedure comprised time series from the Research

Institute for Economic Development (RIED) at the Warsaw School of Economics

(WSE) on sentiment in the manufacturing industry, trade, construction and

households. In all fields (except for the manufacturing industry, where the survey

is conducted monthly), RIED conducts quarterly surveys, which are collected in the

first month of each quarter, i.e., in January, April, July and October.1 An additional

source of data were indices published by the Centre for European Economic

Research (ZEW), the Leibniz Institute for Economic Research at the University of

1 For the survey of the manufacturing sector, conducted on a monthly basis, data from January, April,

July and October were used.

Empirica

123

Munich (Ifo Institute), the Bureau for Investments and Economic Cycles (BIEC)

and the data on Purchasing Managers’ Index (PMI) for Polish industry.

For most of the time series from the business and consumer surveys (except for

zew_ies and PMI), values are expected to hover at around 100 because this value

represents an equal proportion of positive and negative responses to a given

question. However, in order to provide a common base for all variables, a constant

of 100 was subtracted from the Ifo, BIEC and RIED indices. From the PMI index,

we subtracted 50, because in this case this value denoted a neutral level.

With the set of time series prepared in this way, the order of integration and

seasonal properties were examined for each variable. ADF and KPSS tests were

used to determine order of integration. No time series with an order of integration

higher than 1 were identified in the database. It turned out that, in many cases, the

two tests did not provide consistent conclusions about a given series. In particular,

for the variables2 biec_wwk, biec_wrp, biec_wd, ind_q4f, ind_q5s, ind_q5f,

ind_q6s, ind_q6f, ind_q8s, hhs_q4, hhs_q5, hhs_q7, and hhs_q8, the conclusions

about the order of integration of the time series resulting from the two tests were

different. The general outcome of the analysis is that the time series for responses to

business survey questions targeted at the industrial sector are stationary I(0) time

series, while the time series for responses to business survey questions targeted at

households are integrated I(1) time series. However, the remaining time series are

also stationary. This explains why we decided against differentiating the values of

the series I(1) and instead studied the statistical properties of the residual series of

the estimated models.

We also faced the problem of seasonality in the time series of business survey

data. We treat seasonality as a consequence of the presence of different social and

climate factors. However, we do not limit our analysis to regular seasonality, but

allow it to change from one year to the next. For that reason, our investigation

covers both deterministic and stochastic seasonality. The task of testing for

deterministic seasonality in the time series used was reduced to that of estimating

the parameters of a trend model for a given series (linear or nonlinear), with binary

variables designed to detect seasonal deviations. Stochastic seasonality was assessed

using the Demetra? software package (Grudkowska 2011), but we decided to apply

a simple approach to testing the values of the regressor parameters, specifically, the

value of the time series component lagged four periods (Enders 1995).

In the time series for the variables3 zew_ies, pmi, ifo_bs, ifo_be, biec_wwk,

biec_wpi, biec_wrp, biec_wd, we did not find deterministic seasonality, but the

presence of stochastic seasonality was not rejected. In all time series of the RIED

data, deterministic seasonality was detected and the presence of stochastic

seasonality was not rejected. Because taking into account the occurrence of each

type of seasonality influences the quality of forecasts (Franses and Lof 2000), we

decided to consider seasonality in the final versions of the forecasting models.

2 The list of symbols used for the further analyses was presented in ‘‘Appendix’’.3 Full names can be found in ‘‘Appendix’’.

Empirica

123

4 Prognostic models

4.1 Modelling strategy

The main aim of our research was to estimate a model capable of forecasting the

quarterly figures of macroeconomic indicators—GDP (gross domestic product), CPI

(consumer price index) and UNE (unemployment rate), using business and

consumer survey data. We decided to avoid adopting an economic theory that

would suggest which indicators were of key importance for a given dependent

variable and should thus be selected as regressors in the model. In our previous work

(Bialowolski et al. 2010), we used a partly modified stepwise regression technique

for regressor selection. Depending on the selected stepwise algorithm, different

models were found as the best fitting ones. In this study, the selection of regressors

was based on the BACE, a method recently proposed by Sala-i-Martin et al. (2004)

and that has gained popularity among macroeconomists. In most previous research,

it has mainly been used to find the ‘‘best possible’’ growth factors. We believe it can

also be applied to the selection of a set of relevant indicators, as in the present

analysis.

Let M = {X1, X2,…, XK} be the set of K potential regressors in a single

regression equation in which Y is the dependent variable. Suppose that we are

interested in evaluating the influence of individual components of M on Y. In brief,

the idea of BACE is to first estimate all 2K - 1 possible models, with every possible

subset of M as independent variables. However, in the case of large values of K, the

estimation of all models might exceed current computing capabilities. In many

applications, therefore, instead of estimating all possible models (with each possible

subset of M), a large number of possible subsets of M is drawn and only models

involving those variables are estimated. The next step is to calculate the parameters

in the ‘‘final’’ regression (bk). These are derived as weighted averages of the

estimates of the parameters on Xk, obtained in each of the estimated models by using

a measure of ‘‘goodness of fit’’ of each model as the weight.4

This approach, based on BACE, has two main advantages. First, the number of a

priori assumptions regarding the set of independent variables is reduced as much as

possible: with a predefined set of possible independent variables Xk, the only prior

assumption made concerns the number of variables in the relevant model. Under the

BACE approach, it is possible to select the relevant regressors for the equation of Y

by assigning a probability of relevance to each of the estimated models and then

considering as relevant those elements of M included in the models whose estimated

probability of relevance is the highest. Second, the final bk’s are calculated using all

the obtained estimates—without limiting our focus to a given subset of M, which

would be based purely on a researcher’s views.

We here create three-equation models. To improve the quality of the forecasts,

we include as variables in the equations lagged values of GDP, CPI and UNE. We

also believe that there are interactions between the three macroindicators, which

compels us to include them not only as dependent, but also independent, variables.

4 The weight used in the procedure is given by Eq. 4.

Empirica

123

In order to avoid endogeneity problems, we construct a recurrent model in

accordance with the following procedure: let Y1, Y2 and Y3 be the macroindicators of

interest (GDP, CPI and UNE) ordered in a given way (one of the six possible ways).

The general form of our model is then:

Y1;t ¼ f1ðY1;t�1;M1; e1;tÞY2;t ¼ f2ðY2;t�1; Y1;t;M2; e2;tÞY3;t ¼ f3ðY3;t�1; Y1;t; Y2;t;M3; e3;tÞ;

ð1Þ

where: Yi,t—value of i-th macroindicator (GDP, CPI, UNE) from period t, i = 1, 2,

3, Yi;t—theoretical value of the i-th macroindicator (from the i-th equation of the

model), Mit—business survey indicators selected for explaining i-th macroindicator,

ei,t—error term in i-th equation, fi—linear function in i-th equation.

The model is estimated following a 2SLS-logic, which ensures that there are no

direct endogeneity problems with the Yi indicators, as these do not appear directly as

independent variables—only the theoretical values of these variables are used for

this purpose. Thus, we are able to use a regular equation-by-equation OLS

estimation of the model. Of the six possible ways of ordering the variables in a

three-equation model, we decided, based on results of our previous research

(Bialowolski 2010, 2011), to use the following order: Y1 = GDP, Y2 = UNE and

Y3 = CPI.

Our goal was to produce out-of-sample forecasts for three quarters ahead. All of

the Yi,t’s can be used to calculate Yi,t?10s, which can be treated as independent

variables in the subsequent models. As business survey indicators are not predicted

in our models and our forecasting horizon is three quarters ahead, these indicators

are lagged three periods. Thus the model (1) can be written:

GDP1;t ¼ f1ðGDP1;t�1;M1;t�3; e1;tÞUNE2;t ¼ f2ðUNE2;t�1;GDP1;t;M2;t�3; e2;tÞCPI3;t ¼ f3ðCPI3;t�1;GDP1;t;UNE2;t;M3;t�3; e3;tÞ:

ð2Þ

Relevant Mi,t-3 for i = 1, 2, 3 are selected through application of a modified

BACE algorithm.5 In the classical BACE approach, each independent variable is

potentially irrelevant and subject to elimination from the model, whereas in our

case, similarly to (Prochniak and Witkowski 2013), we retain certain variables

(lagged dependent variables and the theoretical values of the variables from

previous equations) in each model. The BACE algorithm that we use can be briefly

summarised as follows:

The number of variables in M (denoted as K) is 41, which yields a total number

of possible models of (241 – 1)3. As so large a number cannot be handled

computationally, we instead draw one million subsets of M with an average number

of 20 Xk’s per subset and an equal probability of entering each model for each

variable. This reflects our strategy of not having any economic preference for

particular Xk’s. In the next step, we estimate the model defined by (2), with a prior

5 A detailed description of Bayesian averaging algorithms can be found, e.g., in Moral-Benito (2011).

Empirica

123

assumption of the number of Xk’s in each equation of an ‘‘ideal’’ model (2). We

denote this number as �s. In our case, based our previous research, we assume that

�s = 10.

The next steps of the procedure are based on the following assumptions: let us

denote a model with the j-th drawn subset of M as Wj and the number of variables of

the subset of M in Wj as Kj. Further, assuming the independence of Xk’s and no

knowledge of which variables should be introduced into the model, using a negative

binomial, we can state that the prior probability that a given Xk is in the model is �sK,

whereas the prior probability of the relevance of Wj is given by

PðWjÞ ¼�s

K

� �Kj

1� �s

K

� �K�Kj

ð3Þ

and is equal for every Wj that shares the same Kj. Using Bayes’ rule and the Schwarz

approximation, the posterior probability of the relevance of Wj, that is, the proba-

bility ‘‘supported’’ by the data, denoted as P(Wj|D), is given by

PðWjjDÞ ¼PðWjÞn�Cj;i=2SSE

�n=2jP1;000;000

p¼1 PðWpÞn�Cp;i=2SSE�n=2p

; ð4Þ

where: n—number of observations in the sample (effectively the length of the time

series); cj,i, cp,i—the total number of independent variables in the ith equation of

models Wj and Wp, respectively, which equals Kj ? 1 for the GDP equation, Kj ? 2

for the UNE equation and Kj ? 3 for the CPI equation; SSEj, SSEp—sum of squared

residuals in models Wj and Wp, respectively.

The probabilities given by (4) can be treated as measures of ‘‘probability of

relevance’’ of a given model Wj. Such probability measures are computed for each

of the 1 million GDP equations, each of the 1 million UNE equations and each of

the 1 million CPI equations.

The final step of our adapted BACE procedure is to select the variables for the

three equations of interest. To do so, for each Xk in M, we compute its posterior(‘‘confirmed’’ by the dataset) probability of relevance for each of the three equations

separately, for a given �s:

kiðXkj�s;DÞ ¼X1;000;000

j¼1

PðWjjDÞ � Ik;j; ð5Þ

where Ik,p is an indicator function that takes a value of 1 if Xk 2 Kj and 0 otherwise.

Posterior probabilities for each of the Xk’s are used to confirm the relevance of a

given Xk in a given equation—its relevance is confirmed if its posterior probability

is higher than its prior probability. Thus, for each of the three equations, we select

those Xks for which the value of (5) exceeds �sK, which equals 0.24 in the present case.

It must also be emphasised that, although there is one common model, variables for

each equation are selected separately, and the final sets of business survey indicators

used as regressors for GDP, UNE and CPI need not be the same ones across the

different equations.

Empirica

123

Having selected the sets of regressors for each of the equations, we then consider

two further problems:

1. possible nonstationarity of the dependent variables, resulting in spurious

regressions and thus inaccurate forecasts;

2. possible violation of the OLS assumptions, in particular those concerning

properties of the error terms in each of the equations.

To evaluate the problem of nonstationarity of the dependent variables, we check

for cointegration of the estimated models. A lack of cointegration might raise

serious doubts about the quality of the obtained results. In order to eliminate the

influence of trend, we simply try to introduce it into the model. Finally, we check for

violation of the OLS assumptions, for autocorrelation of the error terms and for

collinearity among the selected Xk’s.

4.2 Estimation

The procedure described in Sect. 4.1 was run twice, separately for GDP and for

GDP_VOL. The initial M used to select particular Mi’s was the same for each

equation and for each of the two models (which differ in the measure of aggregate

of GDP used). A full list of variables in M is provided in ‘‘Appendix’’.

For each of the two types of model (2), we first selected business survey

indicators for the final model, using BACE. In Table 1, we present the posteriorprobabilities of inclusion (4) of particular variables in each of the equations. The

probabilities for the variables selected for a given equation are bolded. It should be

noted that the distribution of kiðXkj�s;DÞ for some of the equations is such that there

are a number of variables whose posterior probabilities of inclusion exceed their

prior probabilities. In such cases, we reduce the final set of independent variables in

the i-th equation to an initial set of the ten variables with the highest kiðXkj�s;DÞ.However, among the set of variables selected in this way, near multicollinearity is

present in some cases. To address this issue, we sequentially drop variables that are

strongly collinear (that is, variables whose variance inflation factor in a given

equation exceeds 10). The posterior probabilities for such variables are shaded in

the appropriate columns of Table 1.

As noted in the previous section, we ran selected validation tests and procedures.

Specifically, we ran the ADF tests for nonstationarity in the residuals, finding that it

is not present. As the dependent variables are mostly nonstationary, we concluded

that there is cointegration in the tested equations and thus that the modelled

variables remain in a balanced state in the long term. Similarly, using both Durbin

and LM tests for autocorrelation, we concluded that there is no autocorrelation up to

order 4.

With application of the BACE procedure for two model specifications (1—GDP,

UNE, CPI; 2—GDP_VOL, UNE, CPI), we established the sets of regressors in

individual equations. In the following step, we estimated the parameters of the

models with the previously identified regressors, additionally taking into account

seasonal components in three variants: deterministic (introducing a set of quarterly

dummy seasonal variables), stochastic (imposing a MA (4) process on the error

Empirica

123

Table 1 Posterior probabilities of inclusion of particular variables for equations in the GDP and

GDP_VOL models

Regressor Xk Model for GDP Model for GDP_VOL

GDP UNE CPI GDP_VOL UNE CPI

zew_ies 0.529973 0.473289 0.570007 0.204261 0.289795 0.172322

Pmi 0.143643 0.120608 0.144214 0.235723 0.343614 0.186498

ifo_bs 0.244707 0.046347 0.064009 0.087284 0.533400 0.197818

ifo_be 0.111710 0.147197 0.125835 0.362892 0.193616 0.094349

biec_wwk 0.092843 0.477494 0.036627 0.999998 0.999703 0.291972

biec_wpi 0.045370 0.754221 0.049681 0.377101 0.723594 0.168876

biec_wrp 0.186897 0.156539 0.284151 0.077910 0.150351 0.327932

biec_wd 0.324782 0.049047 0.084054 0.273071 0.136324 0.165020

ind_q1s 0.247212 0.973989 0.215227 0.877355 0.292578 0.274490

ind_q1f 0.179209 0.468517 0.25861 0.284972 0.307676 0.205343

ind_q2 s 0.142753 0.059925 0.236697 0.213152 0.260536 0.238870

ind_q2f 0.447050 0.085075 0.115459 0.115784 0.316939 0.110308

ind_q3s 0.097127 0.204787 0.260067 0.255725 0.208179 0.327371

ind_q3f 0.071420 0.179672 0.268869 0.435574 0.123372 0.235761

ind_q4s 0.200863 0.109203 0.137070 0.127792 0.104256 0.068615

ind_q4f 0.194507 0.133494 0.068642 0.270591 0.062994 0.172786

ind_q5s 0.141234 0.170285 0.044506 0.135926 0.262133 0.205454

ind_q5f 0.341719 0.054075 0.127928 0.169695 0.112532 0.232022

ind_q6s 0.181444 0.514534 0.258962 0.058501 0.084833 0.229098

ind_q6f 0.073308 0.095002 0.267408 0.193924 0.286455 0.068994

ind_q7s 0.308216 0.980561 0.237513 0.523176 0.376295 0.187030

ind_q7f 0.335049 0.077684 0.339904 0.374239 0.301672 0.235311

ind_q8s 0.181177 0.455188 0.087633 0.282741 0.383942 0.165216

ind_q8f 0.358626 0.603194 0.073181 0.202558 0.622445 0.145072

ind_r 0.077118 0.099542 0.116935 0.163334 0.296612 0.164803

hhs_q1 0.254419 0.281131 0.038463 0.091679 0.083131 0.086198

hhs_q2 0.086899 0.239642 0.228539 0.608905 0.205476 0.215549

hhs_q3 0.043738 0.418542 0.055954 0.193661 0.292155 0.127054

hhs_q4 0.069227 0.056893 0.146240 0.315878 0.144906 0.157511

hhs_q5 0.043636 0.22991 0.270003 0.224784 0.180944 0.149217

hhs_q6 0.945867 0.137937 0.263677 0.146575 0.240750 0.220438

hhs_q7 0.038287 0.131539 0.228455 0.130678 0.164926 0.073209

hhs_q8 0.121711 0.436050 0.106511 0.703570 0.187004 0.341998

hhs_q9 0.258586 0.172663 0.302928 0.070369 0.730140 0.427353

hhs_q10 0.178904 0.157873 0.235332 0.263119 0.075909 0.178285

hhs_q11 0.162894 0.270946 0.183867 0.302393 0.215743 0.204204

hhs_q12 0.156168 0.183020 0.301902 0.068248 0.244099 0.191933

Hhs 0.212643 0.028864 0.119975 0.292017 0.154258 0.099906

Trade 0.108158 0.600264 0.217376 0.099199 0.330489 0.235338

Empirica

123

term), and both simultaneously. Tables 2, 3 and 4 show the results of the estimation

of equations for each variable: GDP, UNE, and CPI.

In none of the three models in which we attempted to account for the seasonality

of changes in GDP were variables characterising this seasonality statistically

significant. This is probably due to the fact that the reference variable is expressed

as a year-on-year growth rate.

In all three models in which we attempted to account for seasonality in changes in

UNE, variables that characterise seasonality were found to be statistically significant.

For most regressors, including seasonality does not drastically influence the value of

the estimates of the regressors’ parameters. Thus we conclude that one can hypothesise

that changes in the value of the unemployment rate are seasonal in nature. It is also

worth noting that no variables among the regressors come from the household survey.

An analysis of the seasonality of changes in CPI reveals that only the model with

stochastic seasonality contains statistically significant variables describing the seasonal

process. The introduction of stochastic seasonality results in a change of the parameter

signs only for the ind_q3f and ind_q7f variables. Additionally, in the CPI models, we

note significant variation in the constant term under different seasonal specifications.

Table 2 Estimates of parameters in GDP equations

Regressor Models

No

seasonality

Deterministic

seasonality

Stochastic

seasonality

Both types of

seasonality

GDP(-1) 0.87099 0.85162 0.91933 0.89558

zew_ies -0.00554 -0.00595 -0.00296 -0.00397

biec_wd -0.01502 -0.01823 -0.00937 -0.01967

ind_q2f -0.03630 -0.03892 -0.04256 -0.04436

ind_q5f -0.00990 -0.01095 -0.01614 -0.01532

ind_q7s -0.00133 0.01353 -0.00202 0.01520

ind_q8f 0.02008 0.01584 0.02659 0.02053

hhs_q1 -0.01207 -0.01505 -0.01538 -0.01487

hhs_q6 -0.04353 -0.04997 -0.04533 -0.05078

hhs_q9 0.01487 0.02028 0.01595 0.02100

Const 3.21615 3.97296 3.06701 3.89963

Own estimates. All regressors lagged by 3 periods unless specified otherwise

Table 1 continued

Regressor Xk Model for GDP Model for GDP_VOL

GDP UNE CPI GDP_VOL UNE CPI

Agri 0.146806 0.999196 0.507167 0.279091 0.964015 0.667514

Constr 0.138002 0.469658 0.079768 0.913880 0.105246 0.193564

Own estimates. Columns show posterior probabilities for particular Xk’s in the respective models.

Probabilities for variables selected for particular equations are bolded. The probabilities for variables

selected for particular equations and those dropped due to multicollinearity are bolded and italicized

Empirica

123

Two variables, zew_ies and ind_q7, were present in all the estimated equations of

the model. The first describes changes in business sentiment in Germany, which is

found to have a very significant influence on the Polish economy. In recent years,

the Polish economy has been largely driven by the growth of exports, its main

export market being Germany. The second variable (ind_q7) indicates business

executives’ views about the financial conditions of their companies.

Table 3 Estimates of parameters in UNE equations

Regressor Models

No

seasonality

Deterministic

seasonality

Stochastic

seasonality

Both types of

seasonality

UNE(-1) 0.92739 0.95250 0.93913 0.96021

GDP -0.19378 -0.18612 -0.22500 -0.21489

zew_ies -0.00325 -0.00174 -0.00136 -0.00120

biec_wpi 0.04949 0.04951 0.04399 0.04184

ind_q1s 0.04152 -0.00321 0.02138 -0.00322

ind_q7s -0.06273 -0.02539 -0.04271 -0.02769

ind_q8f 0.02755 0.02526 0.02622 0.02589

Trade 0.02128 0.01772 0.03317 0.02889

Agri -0.04430 -0.04260 -0.04535 -0.04511

Constr -0.01193 -0.00916 -0.01178 -0.00311

Const 1.55463 1.08792 1.58558 1.01892

Own estimates. All regressors lagged by 3 periods unless specified otherwise

Table 4 Estimates of parameters in CPI equations

Regressor Models

No

seasonality

Deterministic

seasonality

Stochastic

seasonality

Both types of

seasonality

CPI(-1) 0.83816 0.85233 0.86764 0.84039

GDP 0.36635 0.36457 0.41948 0.28196

UNE 0.09501 0.05464 0.05638 0.05672

zew_ies -0.00284 -0.00330 -0.00422 -0.00165

ind_q3f -0.00396 0.00832 0.02927 0.01366

ind_q7f 0.01330 0.00230 -0.03800 -0.01461

hhs_q5 0.01982 0.01486 0.00583 0.01105

hhs_q6 0.00889 0.00943 0.03026 -0.00105

hhs_q9 0.01442 0.01119 0.01599 0.02219

hhs_q12 0.03610 0.03047 0.03885 0.03729

Agri -0.02242 -0.03466 -0.03189 -0.02299

Const -2.83184 -2.66463 -3.62635 -1.36974

Own estimates. All regressors lagged by 3 periods unless specified otherwise

Empirica

123

5 Forecasting

For forecasting purposes, we decided to discard the model that included the value of

gross domestic product at constant prices (GDP_VOL). The estimates for this model

were not satisfactory and forecasts generated on the basis of this model would very

likely be highly unstable. We also estimated a forecasting model called the final

model (Final), which maintains the previously identified sets of regressors while

incorporating information about the significance of seasonal components. In the

GDP equation, seasonality components were ignored; in the UNE equation, both

types of seasonality were modelled; while in the CPI equation, only stochastic

seasonality was taken into account.

Before conducting an out-of-sample forecast of GDP, UNE and CPI, we

compared the models in terms of how well they fit the data. We decided to check

their performance using RMSEs for individual quarters of 2009–2011 and

2010–2011, treating the theoretical values of the endogenous variables as ex post

forecasts. We used the period 2009–2011, in order to check the model

performance during a rapid transition between high growth rates and stagnation.

The period 2010–2011 represents a period of moderate growth for the Polish

economy.

The RMSE values are expressed in the same units as the endogenous variables,

which in our case are percentage points. The values given in Table 5 show that

taking seasonality into account slightly improves the data fit, making it less probable

that a generated forecast will exhibit a major error. In the final model, the error

decreases markedly only for the UNE variable. At the same time, it can be seen that

the mean errors for the 2010–2011 period are generally lower than those for the

2009–2011 period. This reflects the fact that significant change in economic

conditions, that had the strongest impact on the economy at the end of 2008 and in

2009, significantly reduced the forecasting ability of our model.

Table 5 Root mean square errors in 2009–2011 and 2010–2011

Regressand Models

No

seasonality

Deterministic

seasonality

Stochastic

seasonality

Both types of

seasonality

Final

Quarters 2009–2011

GDP 0.74 0.68 0.71 0.66 0.74

UNE 0.62 0.42 0.38 0.37 0.30

CPI 0.54 0.54 0.39 0.39 0.42

Quarters 2010–2011

GDP 0.54 0.52 0.57 0.56 0.54

UNE 0.59 0.42 0.25 0.30 0.20

CPI 0.53 0.58 0.37 0.39 0.42

Own estimates

Empirica

123

6 Concluding remarks

This paper is a follow-up to our previous research conducted in 2010 and 2011. In

this study, we constructed a prognostic model of three key macroeconomic

indicators: GDP growth, the unemployment rate and the consumer price index. We

used the Bayesian averaging of classical estimates method to build our model,

making necessary modifications to it, as needed, to serve the purposes of the study.

The model was modified to enable us to predict the values of the endogenous

variables without making any assumptions about the values of the regressors in the

forecast period. The modifications also ensured a specific sequence of relationships

between endogenous variables. Another change from our former approach is that

here we considered the possibility of seasonality in both the exogenous and

endogenous variables. The results of our study are promising. They show that it is

possible to build models that take into account both deterministic and stochastic

seasonality and that this procedure improves the goodness-of-fit of such models. It

also turns out that our forecasts are almost identical to those generated by the DSGE

model of the National Bank of Poland and similar to the 2012 projections of the

Gdansk Institute for Market Economics. It might be interesting to examine to what

extent applying the general Bayesian averaging technique to DSGE models might

yield even better results than those obtained with the atheoretical models considered

here, although such a process could be extremely time-consuming. Given that even

with relatively simple linear models, estimating hundreds of thousands of equations

takes considerable time, one might expect such computations to take far longer

when a more complex structure or estimation method is applied, as with DSGE

models. Nevertheless, this appears to be an interesting area for future research.

There is still a need for further development of this project. We would like to

examine the impact of the progressive aging of past data on the quality of the

estimated models and on the accuracy of their forecasts. We plan to test the

prognostic usefulness of business activity indicators other than those already

included in the model.

Acknowledgment The authors would like to thank the Research Institute of Economic Development at

the Warsaw School of Economics and especially prof. Elzbieta Adamowicz for support with the project.

An earlier version of this paper was published in Polish in Prace i Materiały IRG SGH.

Appendix: The list of symbols adopted for variables for the estimationprocedure

zew_ies ZEW indicator of economic sentiment

ifo_bs Ifo business situation indicator

ifo_be Ifo business expectations indicator

biec_wwk BIEC leading index

biec_wpi BIEC future inflation index

biec_wrp BIEC future unemployment rate index

biec_wd BIEC well-being index

Empirica

123

pmi Purchasing Managers’ index (PMI) for Polish industry

ind_xxx Balance of responses to question ‘xxx’ from a business sentiment

survey in industry RIED

hhs_xxx Balance of responses to question ‘xxx’ from a consumer sentiment

survey RIED

trade Business sentiment indicator RIED in trade

agri Business sentiment indicator RIED in agriculture

cons Business sentiment indicator RIED in construction

See Tables 6 and 7.

Table 6 Questions from the business sentiment survey in industry

Symbol Question (ind_xxs—current state, ind_xxf—projection)

ind_q1 Production

ind_q2 Total orders

ind_q3 Export orders

ind_q4 Stock of finished products

ind_q5 Prices of goods produced by enterprise

ind_q6 Employment

ind_q7 Financial standing

ind_q8 Poland’s macroeconomic performance

Business sentiment survey in industry, Research Institute for Economic Development, Warsaw School of

Economics

Table 7 Questions from the consumer sentiment survey

Symbol Question

hhs_q1 Assessment of household financial status, compared with the situation 12 months earlier

hhs_q2 Projected household financial status in the next 12 months

hhs_q3 Performance of the Polish economy in the last 12 months

hhs_q4 Projected performance of the Polish economy in the next 12 months

hhs_q5 Comparison of maintenance costs now and 12 months earlier

hhs_q6 Projection for the inflation rate in the next 12 months

hhs_q7 Projection for the unemployment rate in the next 12 months

hhs_q8 An advantage to make major purchases at the present time

hhs_q9 Projected spending on durable consumer goods over the next 12 months in relation to the level

reported in the last 12 months

hhs_q10 Assessment of savings and the climate for saving in the context of the country’s

macroeconomic performance

hhs_q11 Projected household’s saving in the next 12 months

hhs_q12 Financial position of the household

Survey of households, Research Institute for Economic Development, Warsaw School of Economics

Empirica

123

References

Adamowicz E (ed) (2008) Koniunktura gospodarcza—20 lat doswiadczen Instytutu Rozwoju Gospod-

arczego Szkoły Głownej Handlowej, Oficyna Wydawnicza Szkoły Głownej Handlowej, Warszawa

Banbura M, Giannone D, Reichlin L (2010) Nowcasting, European Central Bank. Working paper series,

No. 1275, December 2010

Benes J, Clinton K, Johnson M, Laxton D, Matheson T (2010) Structural models in real-time. IMF

Working Paper, WP/10/56

Bialowolski P, Kuszewski T, Witkowski B (2010) Business survey data in forecasting macroeconomic

indicators with combined forecasts. Contemp Econ 4(4):41–58

Bialowolski P, Drozdowicz-Biec M (eds) Lada K, Pater R, Zwiernik P, _Zochowski D (2007) Forecasting

with composite coincident and leading indexes and the CLIMA model. The case of Poland, Oficyna

Wydawnicza SGH, Warszawa

Bialowolski P, Drozdowicz-Biec M, Lada K, Pater R, Zwiernik P, _Zochowski D (2008) Forecasting with

composite coincident and leading indexes and the CLIMA model 29th CIRET Conference, Santiago

de Chile

Bialowolski P, Kuszewski T, Witkowski B (2011) Prognozy podstawowych wskaznikow mak-

roekonomicznych z u _zyciem danych z testow koniunktury, Modelowanie i prognozowaniegospodarki narodowej. Prace i materiały Wydziału Zarzadzania Uniwersytetu Gdanskiego, No.

4/8, pp. 49–64

Blanchard O (2011) Macroeconomics, Fifth Expanded Edition, published by Pearson Education Inc.,

publishing as Prentice Hall

Coenen G (2010) Central Bank forecasting during the financial crisis. Presentation during Workshop on

Central Bank Forecasting, 14–15 October 2010

Drabarek A (2006) Intuicja. Poznanie bezposrednie, Wydawnictwo Wy _zszej Szkoły Handlu i Prawa im.

Łazarskiego R, Warszawa

Enders W (1995) Applied econometric time series. Wiley, New York

Franses PH, Lof M (2000) On forecasting cointegrated seasonal time series. Econometric Institute Report.

Erasmus University, Rotterdam

Grudkowska S (2011) Demetra?. User manual. National Bank of Poland, Warsaw

Hucek J, Relovsky B, Siroka J (2010) The impact of the global economic and financial crisis on the

potential GDP. http://www.nbs.sk/_img/Documents/PUBLIK/MU/potential_output_ENG.pdf, vis-

ited at 18.03.2012

Lee J, Rabanal P, Sandri D (2010) U.S. Consumption after the 2008 Crisis, International Monetary Fund.

Authorized for distribution by Olivier Blanchard

Moral-Benito E (2011) Model averaging in economics. Working paper

Prochniak M, Witkowski B (2013) Time stability of the beta convergence among EU countries: Bayesian

model averaging perspective. Econ Model 30:322–333

Roubini N (2010) Crisis economics: a crash course in the future of finance. The Penguin Press, Shiller

Sala-i-Martin X, Doppelhofer G, Miller R (2004) Determinants of long-term growth: a Bayesian

averaging of classical estimates (BACE) approach. Am Econ Rev 94:813–835

Tyszka T (ed) (2004) Psychologia ekonomiczna, Gdanskie Wydawnictwo Psychologiczne, Gdansk

Empirica

123

Recommended