252
Multiple regression In the previous section, we have covered a regression with one explanatory variable. In this section, we will cover the regression model with more than one explanatory variables. We will also cover the analysis of variance in terms of the Chow test. We will also cover the adjusted R 2 and how it affects the inclusion or exclusion of independent variables. The R 2 is affected by adding independent variables. We will also cover the ANOVA table single factor or analysis of variance. It is used to show the decomposition of the variance explained by the independent variables and the error term in the regression equation. Let’s take as an example a financial model related to the returns of a share price in relation to market returns and price earnings ratio expressed in percentages. The dependent variable is share price returns expressed in percentage and the independent variables are market price returns and price earnings ratio expressed in percentages. The mathematical equation will be as follows: y i = Where: y i = share price returns. It is the dependent variable. 1

Introduction to Econometrics 1

Embed Size (px)

Citation preview

Page 1: Introduction to Econometrics 1

Multiple regression

In the previous section, we have covered a regression with one explanatory variable. In this section, we will cover the regression model with more than one explanatory variables. We will also cover the analysis of variance in terms of the Chow test. We will also cover the adjusted R2 and how it affects the inclusion or exclusion of independent variables. The R2 is affected by adding independent variables. We will also cover the ANOVA table single factor or analysis of variance. It is used to show the decomposition of the variance explained by the independent variables and the error term in the regression equation.

Let’s take as an example a financial model related to the returns of a share price in relation to market returns and price earnings ratio expressed in percentages.

The dependent variable is share price returns expressed in percentage and the independent variables are market price returns and price earnings ratio expressed in percentages.

The mathematical equation will be as follows:

yi = Where: yi = share price returns. It is the dependent variable.

1

Page 2: Introduction to Econometrics 1

A numerical example will be very helpful to understand the calculations that are involved in a multiple regression analysis with two independent variables. In this example, we use the difference of the numerical values from their means displayed in the following columns . Please consider the example that we have covered in simple regression section by adding an additional independent variable. The dependent variable is annual profits and the independent variables are expenditures for R&D and marketing expenses.

Year Annual profits(£000) (y)

Expenditures for R&D(£000) (X1)

Marketing expenses(£000) (X2)

2003 31 5 42002 40 11 102001 30 4 82000 34 5 71999 25 3 51998 20 2 3

A convenient way to solve the multiple regression problem is to construct the following Table. Please sum each variable. Then, find the mean of the dependent and independent variables. Then, subtract from each value from its mean.

Year(n=6)

Annual profits(£000) (y)

Expenditures for R&D(£000) (X1)

MarketingExpenses (£000) (X2)

y x1 x2

2003 31 5 4 0 0 -2.1666672002 40 11 10 10 6 3.8333332001 30 4 8 -1 -1 1.8333332000 34 5 7 3 0 0.8333331999 25 3 5 -6 -2 -1.1666671998 20 2 3 -11 -3 -3.166667

∑y = 180 ∑x1 = 30

x1y x2y x1x2

2

Page 3: Introduction to Econometrics 1

0 -2.16667 0 0 4.6960 38.33333 23 36 14.69

0 0 -1.83333 1 3.360 3.333333 0 0 0.69

10 5.833333 2.333333 4 1.3630 31.66667 9.5 9 10.03

∑x1y =100 77 33 ∑ = 50

Please calculate , 1 and 2 . I have included the required equations.

First of all, we calculate b1 and b2.

443.1652941ˆ

1

1089174133003850ˆ

2

3

Page 4: Introduction to Econometrics 1

844.0652550ˆ

2

Then, we substitute the numerical values of 1 and b2 in the following equation to find the intercept.

= 30 – ( x 5) – ( x 6.166667) = 30 – 7.22 – 5.20 = 17.58.

The regression equation may now be written as:

I have included the Excel output that you will get by running a multiple regression. Please use the Data analysis pack in Tools. Then, select regression. In the Y range, input the numerical values of the dependent variable. In the X range, input the numerical values of both independent variables. Select labels and confidence levels. Then, select the cell that your output will be displayed. Select by clicking on the residuals and residuals plots. This will show the error term of your multiple regressions as the difference between actual and predicted values.

4

Page 5: Introduction to Econometrics 1

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.929919713R Square 0.864750673Adjusted R Square 0.774584455Standard Error 3.303045922Observations 6

ANOVA

  df SS MS FSignificance

FRegression 2 209.2697 104.6348 9.590628 0.049739643Residual 3 32.73034 10.91011Total 5 242      

Confidence Intervals

  CoefficientsStandard

Error t Stat P-value Lower 95% Upper 95%Lower 95.0%

Upper 95.0%

Intercept 17.58426966 3.760573 4.675954 0.018476 5.616435321 29.552104 5.616435 29.5521

x1 1.443820225 0.763074 1.892111 0.154831-

0.984622726 3.8722632 -0.98462 3.872263

x2 0.842696629 0.914227 0.921759 0.424636-

2.066783462 3.7521767 -2.06678 3.752177

RESIDUAL OUTPUT

Observation Predicted Y Residuals1 28.1741573 2.8258432 41.89325843 -1.893263 30.1011236 -0.101124 30.70224719 3.2977535 26.12921348 -1.129216 23 -3

5

Page 6: Introduction to Econometrics 1

We will solve the above example by using equations related to the sum of squares of deviations of the variables from their means.The dependent variable is annual profits and the independent variables are expenditures for R&D and marketing expenses. It is required to calculate the regression equation, the residual sum of squares, the coefficient of multiple determination and the correlation coefficient. In addition, calculate the standard errors and the 95% confidence interval of the intercept and the coefficients . Finally, calculate the t-statistics for the intercept and the coefficients.

Year Annual profits(£000) (y)

Expenditures for R&D(£000) (x1)

Marketing expenses(£000) (x2)

2003 31 5 42002 40 11 102001 30 4 82000 34 5 71999 25 3 51998 20 2 3

The equations related to the sum of squares of deviations of the variables from their means are as follows:

The equations for , 1 and 2 are as follows:

6

Page 7: Introduction to Econometrics 1

The first step is to construct the Table wit the relevant calculations to facilitate you to calculate the coefficients , and the intercept .

Year(n=6)

Annual profits(£000) (y)

Expenditures for R&D(£000) (x1)

MarketingExpenses (£000) (x2)

2003 31 5 4 0 0 -2.1666672002 40 11 10 10 6 3.8333332001 30 4 8 -1 -1 1.8333332000 34 5 7 3 0 0.8333331999 25 3 5 -6 -2 -1.1666671998 20 2 3 -11 -3 -3.166667

∑y = 180 ∑x1 = 30

y2 x12 x2

2 x1x2 x1y x2y961 25 16 20 155 124

1600 121 100 110 440 400900 16 64 32 120 240

1156 25 49 35 170 238625 9 25 15 75 125400 4 9 6 40 60

5642 200 26321

8100

0118

7

We calculate first 1 and 2 .

The equations related to the sum of squares of deviations of the variables from their means are as follows:

7

Page 8: Introduction to Econometrics 1

The equations for , 1 and 2 are as follows:

Finally, the intercept will be calculated as follows:

Thus, the regression equation is:

The residual sum of squares is calculated from the following equation:

The coefficient of multiple determination is calculated from the following equation:

The correlation coefficient is as follows:

(to 4.d.p.).

8

Page 9: Introduction to Econometrics 1

The mathematical formulas of the estimated variance of the error term as estimation of , the variances, standard errors and the covariance of the coeficients and the intercept are as follows:

Once we know the coeficients and the standard errors, then, we can calculate the confidence intervals. The sample size is 6 and the degrees of freedom are n-3 or 6-3 = 3 egrees of freedom. From the t-distribution and with 5% significance level the value is 3.182.

The confidence intervals with 95% confidence level for the intercept and the coefficients are as follows:

9

Page 10: Introduction to Econometrics 1

The t-statistics of the intercept and the coefficients are calculated as follows:

t-statistic(

t-statistic

Please complete the following calculation to become familiar with the t-statistic.

t-statistic =

I have attached the Excel output of the multiple regression equation. Please check the calculations of the R square, the residual, the intercept and the coefficients.

SUMMARY OUTPUT

10

Page 11: Introduction to Econometrics 1

Regression StatisticsMultiple R 0.929919713R Square 0.864750673Adjusted R Square 0.774584455Standard Error 3.303045922Observations 6

ANOVA

  df SS MS FSignificance

FRegression 2 209.2697 104.6348 9.590628 0.049739643Residual 3 32.73034 10.91011Total 5 242      

Confidence Intervals

  CoefficientsStandard

Error t Stat P-value Lower 95% Upper 95%Lower 95.0%

Upper 95.0%

Intercept 17.58426966 3.760573 4.675954 0.018476 5.616435321 29.552104 5.616435 29.5521

x1 1.443820225 0.763074 1.892111 0.154831-

0.984622726 3.8722632 -0.98462 3.872263

x2 0.842696629 0.914227 0.921759 0.424636-

2.066783462 3.7521767 -2.06678 3.752177

I have attached another example with the following dataset :

Share Market PE3.526787 8.73209 0.499819922-4.34533 -5.19815 -4.21807425.222709 6.21865 0.877518989-4.99619 -5.5393 -0.745571156

11

Page 12: Introduction to Econometrics 1

-3.04336 7.69808 -0.317399673-2.375422 -4.99735 2.0611907912.651303 5.42777 2.629939417-0.68924 -1.5424 -0.8809008470.205664 1.4639 0.2530117572.4783 3.6528 -0.920953642

0.237407 -0.1494 -1.1487126780.329728 0.16688 0.372533199-0.26869 -0.1444 -1.7140759240.064769 0.097873 4.630805383-0.5873 -0.09911 -2.530835538

0.329225 -0.08344 -1.038471775-0.11849 0.122767 0.5309048290.011541 -0.45767 -1.857415164-0.18757 -0.53046 1.227869802-0.38752 -0.11118 0.642577585-0.26835 -0.28947 1.8390492010.262798 -0.17676 -0.954329890.355054 -1.15686 -2.503369578-1.34302 -0.5771 0.877111656-0.77964 0.578182 -1.55602547-0.04649 -0.05331 0.0405596920.098381 -0.23054 -0.522472998-0.09585 -0.66625 2.93017109-0.0059 -0.50071 -0.497160825-0.05415 -0.53128 -1.885502172

The stochastic equation in Econometrics will incorporate the error term and it will be as follows:

yi= +

I will include the steps on how to perform a multiple regression in Excel.

1. Plot or insert the data in Excel.2. Press tools…..then select data analysis……then select regression…..3. Then in input Y range select the label and the data of the dependent variable.

In our case, it is the share price returns. 4. Then, in input X range select the labels and the data of the two independent

variables. In our case, it is the market price returns and the price earnings ratio.

5. Select the box labels and confidence levels. Select the required confidence levels. For example, 90%, 95% or 99%.

6. Press output range box and select the cell that your output will be displayed.7. Finally, select the box residuals in order that the residuals table and data will

be displayed.

12

Page 13: Introduction to Econometrics 1

The regression Excel output will be as follows:

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.696338968R Square 0.484887959Adjusted R Square 0.446731511Standard Error 1.482906946Observations 30

13

Page 14: Introduction to Econometrics 1

ANOVA

  df SS MS FSignificance

FRegression 2 55.88963329 27.94481665 12.70789 0.000129025Residual 27 59.37335129 2.199013011Total 29 115.2629846      

  Coefficients Standard Error t Stat P-value Lower 95%Upper 95%

Intercept -0.260766997 0.273942189 -0.951905212 0.349589 -0.82284956 0.3013156Market returns 0.405779907 0.087380092 4.643848503 7.93E-05 0.226490889 0.5850689P/E ratio 0.131419379 0.154479732 0.850722473 0.402405 -0.18554664 0.4483854

The intercept and the coefficients of the variables in relation to the t - statistics are as follows:

(-0.95) (4.64) (0.85)

The residual will be as follows:

RESIDUAL OUTPUT         

ObservationPredicted Share

returns Residuals1 3.348225697 0.1785613032 -2.924408515 -1.4209214853 2.377959224 2.8447497764 -2.606486136 -2.3897038645 2.821246723 -5.864606723

14

Page 15: Introduction to Econometrics 1

6 -2.017710802 -0.3577111987 2.287338016 0.3639649848 -1.002409369 0.3131693699 0.366504857 -0.160840857

10 1.100434692 1.37786530811 -0.472353622 0.70976062212 -0.144092364 0.47382036413 -0.54462441 0.2759344114 0.387525469 -0.32275646915 -0.633584679 0.04628467916 -0.431100589 0.76032558917 -0.141179432 0.02268943218 -0.690580635 0.70212163519 -0.314651119 0.12708111920 -0.22143446 -0.1660855421 -0.136541402 -0.13180859822 -0.457910095 0.72070809523 -1.059188817 1.41424281724 -0.379673112 -0.96334688825 -0.23064426 -0.5489957426 -0.277068794 0.23057879427 -0.422978574 0.52135957428 -0.146036594 0.05018659429 -0.529281622 0.52338162230 -0.724141271 0.669991271

The multiple regression output in EViews 6

Before loading and transferring your data in EViews 6, you have to insert the numerical values or the data in Excel. Do not use long titles for each time – series. Use abbreviations. For example, share for the first time – series, market for the second time – series and PE, price earnings ratio for the third time - series. Name the sheet of the Excel file and delete the other sheets. For example, the name of the sheet is reg2. Once the Excel file is ready, you close it and you open the statistical package EViews 6. You press file and then you select new worksheet. Then, in workfile structure type, select unstructured / undated. Insert the number of observations. In our case, the number is 30. You will get and untitled worksheet. Then press file, then, import, then, read text - lotus – excel. Select the Excel file and press OK. Excel spreadsheet import

15

Page 16: Introduction to Econometrics 1

screen will open. In the box, upper left data cell, select A2. This is the cell that your first observation starts. In Excel 5 + sheet name write reg2. This is the name of your filename. In the box name of series, please write share then press space bar and then market and then PE. These are the names of the variables.

Once you done these steps, you will be able to see the time series transferred from Excel to EViews 6. Press file, then, save as. Then, write the filename reg2 and save it with the extension wf.

You click on the time series for example share to do the following tests. You press view and then you select descriptive statistics and tests……… histogram and stats or correlogram or unit root test. The same thing you do for the file market and PE. You open it and then you press view.

To run a regression, you press quick, and then, estimate equation. In the specification box, you write:

share c market PE and you press OK.

Then, your output will be displayed. From the top menu you have options to do forecast and check the residuals or error term for further tests.

The dependent variable is share price returns and the independent variables are market price returns and

Dependent Variable: SHAREMethod: Least SquaresDate: 02/08/15 Time: 11:12Sample: 1 30Included observations: 30

Variable Coefficient Std. Error t-Statistic Prob.  

C -0.260767 0.273942 -0.951905 0.3496MARKET 0.405780 0.087380 4.643849 0.0001

PE 0.131419 0.154480 0.850722 0.4024

R-squared 0.484888    Mean dependent var -0.127295Adjusted R-squared 0.446732    S.D. dependent var 1.993636

16

Page 17: Introduction to Econometrics 1

S.E. of regression 1.482907    Akaike info criterion 3.720525Sum squared resid 59.37335    Schwarz criterion 3.860645Log likelihood -52.80788    Hannan-Quinn criter. 3.765351F-statistic 12.70789    Durbin-Watson stat 1.765635Prob(F-statistic) 0.000129

The intercept and the coefficients of the variables in relation to the t - statistics are as follows:

(-0.95) (4.64) (0.85)

There are five basic assumptions of the linear and multiple regression models to be valid. The coefficients have to be BLUE. They must have the smallest possible variance and be consistent to achieve best fit regression analysis. The multiple regression model that we will cover is as follows:

Our aim is to show that the coefficients are unbiased and have minimum variance in the multiple regression equation. In addition, the explanatory variables are not collinear to satisfy the fifth assumption, namely, that the independent variables are uncorrelated with the error term, the dependent variable and between them. For example, if advertising and revenues are higly correlated, it will be difficult to separate the effects of the variables under study. The coefficients have to be BLUE.

The acronyms of BLUE stands for the following words:

B: BestL: LinearU: UnbiasedE: Estimator

We would like that the coefficients are BLUE by comparing the hypothesis testing of the F-statistic. Please check the regression example that we covered to understand the relation of the F-statistic with the coefficients.

The first assumption is that the error term is normally distributed.

17

Page 18: Introduction to Econometrics 1

The second assumption is that the mean of the error term is zero.

The third assumption is that the variance of the error term is the same in each time period for all values of the independent variables. This is a homoskedastic situation.

The fourth assumption is that the numerical value of the error term is uncorrelated with its value in other time period.

The fifth assumption is that the independent variables are uncorrelated with the error term, the dependent variable and between them.

If the above assumptions are violated, then, you should consider diagnosing the problems related to multicollinearity, heteroskedasticity, autocorrelation, and errors in variables.

Thus, the assumptions related to the error term as follows:

E( )=0, cov(x1, , cov(x2,

When expressing the residual or error term in terms of the multiple regression equation, we have the following equation:

Multicollinearity

Multicollinearity is a violation of the fifth assumption. It is a case when the independent variables are highly correlated with the dependent variables. The estimated coefficients may be statistically insignificant although R2 is very significant. In addition, the standard errors could be very high or the t-ratios very low. The confidence intervals for the parameters of interest are thus very wide. When the explanatory variables are highly intercorrelated, it becomes difficult to disentangle the separate effects of each of the explanatory variables on the explained variable. It can be overcome or reduced by collecting more data or dropping one of the highly collinear variables.

Heteroskedasticity

Heteroskedasticity is a violation of the third assumption. The assumption is that the variance of the error term is the same in each time period for all values of the independent variables. Once that we have heteroskedasticity, then, the coefficients are not BLUE. We have biased estimates and larger variance in our data set. The problem is detected by observing the high values of the residuals in relation to the estimated data. A possible solution to this problem is to use a log function for both the dependent and independent variables. In the Econometrics book that I will prepare in the future, I will explain in detail through different examples how to perform the tests.Other heteroskedasticity tests are the ARCH test, Glejser’s test, Breusch and Pagan’s test, White’s test and detection of the residuals in the likelihood ratio test.

18

Page 19: Introduction to Econometrics 1

Autocorrelation

One of the assumptions of multiple regressions is that the value which the error term assumes in one period is uncorrelated to its value in any other time period. This ensures that the average value of the dependent variable depends only on the independent variable and not on the error term. The Durbin-Watson statistic tests for first-order autocorrelation. If there is autocorrelation, it leads to biased standard errors and thus to incorrect statistical tests. Autocorrelation is tested by using the test of the Durbin-Watson statistic.

Errors in variables

It refers to measurement errors related to the independent variables. For example, the data set is not consistent and there are gaps in the measurement history. A solution to this problem is to replace the independent variable with another one that is not correlated with the error term.

Exercise

You are given the following data for a sample size of n = 28.

It is required to calculate the following:

19

Page 20: Introduction to Econometrics 1

a) Calculate the intercept

In addition, please calculate their standard errors. b) Estimate the regression equation.

Solution

The equations for , 1 and 2 are as follows:

Thus, the regression equation is as follows:

Multiple regression with more than 2 independent variables

I have attached an example of 30 observations. It is related to a multiple regression with 3 independent variables. The dependent variable is the discount and the independent variables are the the market effect, the size effect, and the book to market effect. The data were gathered from datastream. It is required to calculate the multiple regression equation.

The multiple regression has the following format:

where: yi is the discount. is the intercept. is the market effect. is the size effect.

20

Page 21: Introduction to Econometrics 1

is the book-to-market effect.

dis markt s bm-2.36697 -9.12939 -1.70722 1.927814-0.51194 0.577933 0.669331 2.8669240.254743 2.162936 0.020789 0.225917-2.48828 -4.14315 -2.2856 -4.685693.491507 12.83122 -2.13122 3.176416-0.63164 -1.75343 0.887992 -0.751224.183572 2.886911 -1.57409 -0.188150.316764 -8.61904 -1.03613 2.5486413.375268 1.049722 -4.61812 2.9391050.938469 -1.5605 -1.5808 -2.33380.071534 5.700149 1.721926 4.103659.627071 10.81373 1.635702 0.7578495.921342 10.37399 1.888181 2.964885-0.85242 6.860824 0.805653 -6.209395.53793 5.806177 3.93546 -0.37606

2.480958 6.140132 -2.02608 -0.26034-0.54454 1.165931 1.246596 -0.059190.565246 -2.2651 -1.97628 -0.169530.608492 2.957873 2.345294 1.576884.826277 6.006215 1.126188 1.1268593.465707 2.439138 1.841884 -1.545847.663349 7.959142 2.43412 -0.045667.751392 5.265374 -1.00022 2.38889112.2131 21.38698 0.325796 -1.36015

2.025977 -1.55935 5.17537 1.816525-4.81799 -4.10934 0.590867 -0.737670.505699 -1.50866 -1.73348 0.08419.726398 11.7856 -4.2814 1.041869-3.51059 -2.34779 -0.2577 0.463281

Solution

By plotting the data in Excel and using the data analysis toolbox, you will get the following output.

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.273516R Square 0.074811

Standard Error 0.498851Observations 29

ANOVA

  df SS MS FSignificance

F

21

Page 22: Introduction to Econometrics 1

Regression 3 0.503055 0.167685 0.673834 0.576168Residual 25 6.221306 0.248852Total 28 6.724362      

  CoefficientsStandard

Error t Stat P-value Lower 95%Upper 95%

Intercept -0.18839 0.103061 -1.82797 0.079513 -0.40065 0.023866markt 0.018152 0.01424 1.274672 0.214146 -0.01118 0.04748s 0.020127 0.042147 0.477546 0.637122 -0.06668 0.106931bm 0.000836 0.041498 0.020146 0.984087 -0.08463 0.086304

t-statistics (-1.83) (1.27) (0.48) (0.02)

Exercise

You are required to calculate the confidence interval of the prediction with 95% confidence level of the multiple regression equation. The sample size is 6. Please consider the following regression:

You are given the following data:

The predicted

22

Page 23: Introduction to Econometrics 1

The standard error of the prediction is 0.245 and the degrees of freedom are n-3 or 6-3 = 3. From the table of the t-distribution the value is 3.182 for the 5% significance level.

The 5% significance level of the prediction is as follows:

Please complete the calculations for the upper and lower confidence level.

F-statistic calculation based on the multiple regression

It is very important to understand the analysis of variance based on the coefficients of the multiple regression. It is required to calculate the F-statistic based on the following sample size which consists of 6 observations.

Year(n=6)

Annual profits(£000) (y)

Expenditures for R&D(£000) (x1)

MarketingExpenses (£000) (x2)

2003 31 5 4 0 0 -2.1666672002 40 11 10 10 6 3.8333332001 30 4 8 -1 -1 1.8333332000 34 5 7 3 0 0.833333

23

Page 24: Introduction to Econometrics 1

1999 25 3 5 -6 -2 -1.1666671998 20 2 3 -11 -3 -3.166667

∑y = 180 ∑x1 = 30

y2 x12 x2

2 x1x2 x1y x2y961 25 16 20 155 124

1600 121 100 110 440 400900 16 64 32 120 240

1156 25 49 35 170 238625 9 25 15 75 125400 4 9 6 40 60

5642 200 26321

8100

0118

7

The mathematical formula for the F-statistic is as follows:

I have attached the Excel output with the F-statistic calculation.

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.929919713R Square 0.864750673Adjusted R Square 0.774584455Standard Error 3.303045922Observations 6

ANOVA

  df SS MS FSignificance

FRegression 2 209.2697 104.6348 9.590628 0.049739643Residual 3 32.73034 10.91011

24

Page 25: Introduction to Econometrics 1

Total 5 242      Confidence Intervals

  CoefficientsStandard

Error t Stat P-value Lower 95% Upper 95%Lower 95.0%

Upper 95.0%

Intercept 17.58426966 3.760573 4.675954 0.018476 5.616435321 29.552104 5.616435 29.5521

x1 1.443820225 0.763074 1.892111 0.154831-

0.984622726 3.8722632 -0.98462 3.872263

x2 0.842696629 0.914227 0.921759 0.424636-

2.066783462 3.7521767 -2.06678 3.752177

Analysis of variance or Chow test

The predictive test for stability is based on the fact that the prediction errors have zero mean. You nedd two datasets to estimate this test. The first sample size n1 is used to estimate the regreesion equation and then used to get the predictions for the second sample size n2. The F-test mathematical equation that is used in this test is as follows:

Where: SRSS is the total of the sum residual of squares from both regressions. RSS1 is the residual sum of squares of the first sample. RSS2 is the residual sum of squares of the second sample. n1 is the sample size of the first sample. n2 is the sample size of the second sample. The degrees of freedom are n2 and (n1-k-1).

25

Page 26: Introduction to Econometrics 1

Example

You are given the following data for different time periods. The sample sizes are n1= 20 and n2 = (20-2-1) = 17. Pease calculate the F-test and compare it with the F-value from the F-tables. Based on the result accept ot reject the hypothesis of stability.

From 2000 – to 2005: RSS1 = 0.1141From 2007 - to 2014: RSS2 = 0.0432 RSS from both regressions = 0.3761

Solution

The hypotheses are as follows:

H0: There is stability and the variances are constant.H1: There is no stability and the variances are not constant.

Plug the numbers in the above F-test equation.

Then, please look at the end of the book for the F-tables to find the F-value for 5% significance level. The degrees of freedom are 20 and 17. 20 referes to the vertical column and 17 refers to the horizontal column. The F-value is 2.23. Therefore, at the 5% significance level we cannot reject the hypothesis H0 of stability.

I have attached the stability Chow Forecast test in EViews 6 for different datasets. Let me explain how you do it. You run first the regressions for the different samples. You can check at the bottom of the test the RSS1 and RSS2 or the sum squared residuals for different regressions. Then, you click on view and you select stability tests. Please focus on the first two lines of the test that are bold. Check the F-statistic with the related p-values. Check if it is significant or not and acccordingly accept or reject the stability effect. Please remember that the test is applicable for diffierent time periods to show the stability of the variances in different time span. In this case the F-statistic is not significant at the 5 % significance value and the sample evidence suggests to reject the null hypothesis of stability.

Chow Forecast Test: Forecast from 30 to 30

F-statistic 0.095234    Prob. F(1,27) 0.7600Log likelihood ratio 0.105632    Prob. Chi-Square(1) 0.7452

Test Equation:Dependent Variable: SHAREMethod: Least Squares

26

Page 27: Introduction to Econometrics 1

Date: 04/24/16 Time: 11:56Sample: 1 29Included observations: 29

Variable Coefficient Std. Error t-Statistic Prob.  

C -0.300281 0.280689 -1.069799 0.2942MARKET 0.424144 0.086175 4.921896 0.0000

R-squared 0.472914    Mean dependent var -0.129817Adjusted R-squared 0.453393    S.D. dependent var 2.028876S.E. of regression 1.500007    Akaike info criterion 3.715289Sum squared resid 60.75056    Schwarz criterion 3.809585Log likelihood -51.87168    Hannan-Quinn criter. 3.744821F-statistic 24.22506    Durbin-Watson stat 1.874055Prob(F-statistic) 0.000038

Exercise of Chow breakpoint dates

Please consider the following dataset. The dependent variable is fund returns and the independent variables are size, mrket returns, and book to market returns. We want to apply Chow breakpoint dates to see the stability effect in different time periods or observations.

Fund return Size Market returnBook to market return

8.826227396 9.960064 6.770160683 -0.9656252914.3672068 5.355746 2.101215141 0.414592015

-3.744606009 -0.83212 -2.256078026 -0.4427797463.653337534 0.455159 3.415008092 0.489551486

-2.953694894 2.587989 -0.948914314 -1.8014539-2.422063841 -5.26719 -1.376494263 -0.724588286-2.739912699 -2.81245 -2.109216247 0.033230007-9.315254996 0.08267 -7.732750354 -0.682711797-14.94682585 1.892262 -15.8749113 -1.064875722-6.339207193 -5.17364 -5.148485371 -1.253981349

27

Page 28: Introduction to Econometrics 1

0.642977938 1.008631 0.24888198 0.937226066-3.606672395 -1.95412 -3.445206217 0.227324759-10.95272614 0.025582 -11.00011018 0.5205531780.400824294 -1.79843 0.244664806 0.6193559031.869571463 -2.02135 3.38517113 0.241543165

-3.897889112 0.929765 -0.924758847 0.07628458-11.29111047 1.537612 -8.775426604 -1.60504611-6.661286528 -1.18146 -7.103029456 2.763327169-4.448910028 0.796062 2.350319644 -0.904642691-6.364596029 0.908112 2.623642306 0.922935303

0.67082299 0.109951 -1.092979559 0.969071134-0.306823529 -1.66881 -0.566832617 1.797172793-4.701484083 -3.97912 -5.525479768 -2.189439024-7.227822193 -0.54654 -6.09870943 0.572655503-6.819388756 1.794597 -7.128801365 0.5947627363.798680926 2.22648 1.900347699 -0.241326231

-6.407574151 2.90064 -4.765574598 -0.100919561-6.800821645 4.582849 2.256769855 1.151969143

0.78018847 1.950273 -3.014224796 0.1502333773.281922722 -1.35779 -5.225298329 1.348764503

-5.902120198 -5.19986 -4.372702903 -0.9663620032.735700817 -0.28909 -3.009988946 -0.1402873421.127754174 -2.51951 -5.27213103 1.354648138

-6.979359505 1.359845 -9.017072966 -0.4995759880.132010282 -2.71113 2.694147467 -0.502174863

-1.783030396 -0.52075 -4.799965131 -0.896890172-6.070046292 2.410862 -7.305311366 -1.3709710933.932558537 0.709116 2.718674687 1.149926139

-5.498280124 -1.54017 -6.271749703 1.148277-1.026654953 -4.29409 2.547264128 0.063203037

1.29505961 -3.60993 3.746687032 -1.431831064-5.210391155 -7.10815 -1.127785203 -2.82488946-4.169418282 -6.16405 0.629884119 3.649972499-2.417722014 4.466414 -7.612368855 0.904322589-5.836022668 -5.06333 -6.631707976 0.327308887-4.34147211 8.042842 -10.15385517 -0.557336145

-6.013067376 -2.31934 -6.779492991 0.4645981982.351678237 -2.94247 5.807435413 0.362367951

-6.051033528 -6.47283 -5.191925454 5.5202052545.507247639 -0.0779 4.307946209 -4.1095313645.568041295 3.477585 2.141515594 -1.4359075664.050537123 0.95435 3.458736009 0.127594979

2.61824883 7.26415 -2.328445821 1.925220354-2.663025875 -7.75868 0.405165127 0.37703302-2.58735089 -4.16073 -3.001386449 -0.460596869

-18.49933968 -5.2051 -13.01239192 1.756621858-1.023099967 0.476781 -2.350246928 -0.998376504-6.992559532 -10.2425 -17.09413351 3.549554357-7.949112283 -0.11357 -9.429005018 0.456761373-10.99642206 -4.59335 -14.96744007 3.643166785-12.55287795 1.871977 -8.705122947 3.860200829-8.469066505 5.660192 5.107106143 -4.0571251568.947897286 1.23615 11.5829977 -1.2650910291.108790499 3.339655 3.200897507 -4.584911994

28

Page 29: Introduction to Econometrics 1

-2.478781079 3.986295 -4.792938648 -0.70736197413.26262082 0.448869 12.2606341 -0.140015364

-1.943407791 5.521951 -7.683346632 -0.8938766086.884125763 4.200894 6.296174259 -4.774779412-9.07257895 -0.40545 -14.81734321 1.2947414234.508494188 -7.34258 7.062232477 6.6638861413.41288662 2.428614 -0.922061916 3.83377619126.59266267 3.27669 15.66395515 -0.218511978

-9.770788482 12.44936 -19.02650795 3.44646497222.02580384 5.658625 -9.789377382 3.025288068

-13.12702576 -2.89127 17.84848758 -8.354330581-13.94608983 -2.56212 -2.867582535 -1.480902962-5.411771556 -1.62255 4.237393994 -5.2776033585.387498221 5.286432 2.683161051 -0.6522882951.306410511 2.040671 -4.275937147 -0.7888977615.44616019 -0.13068 6.778060323 0.880962704

-9.068237655 1.879668 -6.645082878 -6.81811859-4.03622483 -5.39266 2.160831737 -2.368982491

-16.29317457 1.90084 -5.304864768 -6.458821572.001570939 -2.21752 -1.30755068 0.9191446884.294160506 3.951971 3.005900556 -1.65017838

-4.825510415 1.283857 -5.165205523 0.046750836-10.30923811 -3.81126 -4.275694375 -3.43372624310.33376783 0.268692 13.31939629 -0.7390212745.352167827 3.495569 2.454867193 2.780105608

-9.732250156 -4.4211 -17.02418316 -0.967065568-8.973101425 -2.86788 -9.983114132 1.6771428151.301330571 2.530577 -0.088339962 -1.633333621

-14.93474615 -10.2433 -7.379219909 -0.114463999-20.25512925 1.787003 -16.96223063 1.31924314-11.48327474 10.58094 -10.26966727 -4.053618976-1.983759531 -1.11783 -4.146557016 0.0952221350.042296285 -1.77422 2.555558476 0.948573591

-3.732611732 -2.45459 0.106627672 3.3044426254.83643739 1.177549 1.868378125 -0.300813688

0.166514103 3.920495 -2.201628006 0.163661169-5.167810897 0.998025 -5.176270858 -3.21846229-10.32433827 -2.3444 -9.114406255 -1.525630134-10.17605333 2.571829 -10.6757764 -38.07453099-4.421339244 -4.67418 0.468311538 -2.324806591-9.258995579 -2.60953 -7.92730825 3.342569585-1.051830185 -5.61631 5.552752453 -2.6005188913.761018401 4.65114 2.817569926 -2.871721699

Solution

First of all, run the regression equation in EViews 6 as follows:

fr c s mr bm

Where: fr is fund returns. s is size and is is the first independent variable. mr is market return and it is the second independent variable.

29

Page 30: Introduction to Econometrics 1

bm is book to market return and it is the third independent variable.

Click view and select the stability tests and then the first one Chow breakpoint test.The total number of observations are 107.

The hypotheses are as follows:

H0: There is stability and the variances are constant.H1: There is no stability and the variances are not constant.

I have included the output of the Chow breakpoint test for 20, 40 60, 80 and 100 observations.

Chow Breakpoint Test: 20 40 60 80 100 Varying regressors: All equation variablesEquation Sample: 3 107

F-statistic 2.500424 Prob. F(20,83) 0.0020Log likelihood ratio 50.45822 Prob. Chi-Square(20) 0.0002Wald Statistic  101.7974 Prob. Chi-Square(20) 0.0000

The F-statistic, the log likelihood ratio and the Wald statistic p – values are all significant at the 5% significance level. The sample evidence suggests not to reject the null hypothesis of stability. In other words, the variances for different time periods are constant.

I have also attached a residual plot to facilitate your learning.

-30

-20

-10

0

10

20

30

-30

-20

-10

0

10

20

30

10 20 30 40 50 60 70 80 90 100

Residual Actual Fitted

30

Page 31: Introduction to Econometrics 1

Exercise

You are given the following two datasets.

Dataset 1 Dataset 2

Sx1 = 48 Sx1 = 47Sx2 = 35 Sx2 = 34Sx1x2 = 32 Sx1x2 = 33Sx1y =100 Sx1y =102Sx2y =78 Sx2y =80Syy = 240 Syy = 244n =10 n =15

It is required to calculate the regression equation, the residual sum of squares, the coefficient of multiple determination, and the correlation coefficient. In addition, calculate the standard errors and the 95% confidence interval of the intercept and the

31

Page 32: Introduction to Econometrics 1

coefficients . Finally, calculate the t-statistics for the intercept and the coefficients for each dataset.

We compute the first dataset.

The equations for , 1 and 2 are as follows:

Finally, the intercept will be calculated as follows:

Thus, the regression equation is:

The residual sum of squares is calculated from the following equation:

The coefficient of multiple determination is calculated from the following equation:

32

Page 33: Introduction to Econometrics 1

The correlation coefficient is as follows:

The mathematical formulas of the estimated variance of the error term as estimation of , the variances, standard errors and the covariance of the coeficients and the intercept are as follows:

33

Page 34: Introduction to Econometrics 1

The confidence intervals with 95% confidence level for the intercept and the coefficients are as follows:

The t-statistics of the intercept and the coefficients are calculated as follows:

t-statistic(

t-statistic

Please complete the following calculation to become familiar with the t-statistic.

t-statistic =

34

Page 35: Introduction to Econometrics 1

Please compute the second dataset.

The equations for , 1 and 2 are as follows:

Finally, the intercept will be calculated as follows:

Thus, the regression equation is:

The residual sum of squares is calculated from the following equation:

The coefficient of multiple determination is calculated from the following equation:

The correlation coefficient is as follows:

The mathematical formulas of the estimated variance of the error term as estimation of , the variances, standard errors and the covariance of the coeficients and the intercept are as follows:

35

Page 36: Introduction to Econometrics 1

The confidence intervals with 95% confidence level for the intercept and the coefficients are as follows:

The t-statistics of the intercept and the coefficients are calculated as follows:

t-statistic(

36

Page 37: Introduction to Econometrics 1

t-statistic

Please complete the following calculation to become familiar with the t-statistic.

t-statistic =

Exercise

You are given the following multiple regression equation from a sample of n = 20 observations. It was estimated based on the ordinary least squares method.

t-statistics (1.2) (2.1) (3.2) (2.4)

In parentheses, I have included the t-statistics. The R2 = 0.892.

You are required to comment what will happen to R2 if we drop or add an explanatory variable?

Solution

37

Page 38: Introduction to Econometrics 1

If an explanatory variable falls, then the R2 will fall. It is important to check the t-statistic if it is significant at the 5% or 1% significance level to determine if R 2 will rise.

Exercise

Based on the following summary output, please, calculate the R square of the multiple regression.

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.929919713R Square 0.864750673Adjusted R Square 0.774584455Standard Error 3.303045922Observations 6

ANOVA  Df SS MS F Significance

38

Page 39: Introduction to Econometrics 1

FRegression 2 209.2697 104.6348 9.590628 0.049739643Residual 3 32.73034 10.91011Total 5 242      

Confidence Intervals

  CoefficientsStandard

Error t Stat P-value Lower 95% Upper 95%Lower 95.0%

Upper 95.0%

Intercept 17.58426966 3.760573 4.675954 0.018476 5.616435321 29.552104 5.616435 29.5521

x1 1.443820225 0.763074 1.892111 0.154831-

0.984622726 3.8722632 -0.98462 3.872263

x2 0.842696629 0.914227 0.921759 0.424636-

2.066783462 3.7521767 -2.06678 3.752177

Solution

You are given the following mathematical equation:

R2 =

Another formula to use is as follows:

R2 =

Exercise

It is given the following multiple regression equation. The sample size is 20. The dependent variable is share prices and the independent variables are stock index and the numbers of buyers.

SE (123.43) (0.768) (7.342)

t-statistics (0.7976) (19.98) (4.66)

p (0.0000) (0.0000) (0.0000)

R2 = 0.885 F = 110.324

39

Page 40: Introduction to Econometrics 1

It is required the following:

(a) Please interpret the regression equation output.(b) Test the hypotheses that the coefficient = 0 under the null hypothesis.

Compute the t value and then decide whether to accept or reject the null hypothesis.

Solution

(a) The share prices are positively related to the stock index and the number of buyers. If the stock index increase, the average price of the share price will increase by about 15.345 dollars. Following the same way of thinking, by keeping the stock index constant, the average share price increase by 34.23 dollars justified by the increase of the number of buyers. The R2 number of 0.768 means that the two explanatory variables account for 76.8% of the variation in the share prices. All the p-values are significant.

(b) Please state the hypotheses as follows:

Under the null hypothesis, the stock index has no effect on the share price. Under the alternative hypothesis it affects the share prices.

The t-value is as follows:

The degrees of freedom are n-3. The sample size is 20. Therfore, 20 – 3 = 17

Do we reject or accep the null hypothesis with 5% significance level?

Solution

The degrees of freedom are n – 3 or 20 – 3 = 17. With 95% confidence level, the t- critical value of a two-tail t-test is 2.110. The calculated t-statistic is 19.98 and is higher than the critical value of 2.110. Therefore, the suggested evidence suggests that we can reject the null hypothesis with 95% confidence level. The econometrician would assume that the true value of the parameter is not equal to 0.

You should compare the value of the test statistic from the sample with the critical value and reject the null hypothesis if the test statistic exceeds the critical value. The value of the t-statistic 19.98 exceeds the critical value ± 2.110. Therefore, the sample evidence suggests to reject the null hypothesis.

40

Page 41: Introduction to Econometrics 1

Exercise

Please consider the following dataset and ANOVA table single factor. Compute te F - ratio. ANOVA is the acronyms of analysis of variance. It is used to show the decomposition of the variance explained by the independent variables and the error term in the regression equation. The mathematical formula for the F-statistic is as follows:

Where: d.f is the degrees of freedom.

The dependent variable is share return and the independent variables are risk free rate and size. The null hypothesis is that and = 0. The degrees of freedom are k-1 in the numerator and n – k in the denominator. Thus, we have 3 – 1 = 2 degrees of freedom in the numerator and 72 – 3 = 69 in the denominator.

ObservationsShare return

Risk free rate Size

1 3.526787 2.755836 0.823441

41

Page 42: Introduction to Econometrics 1

2 -4.34533 2.725726 4.458782

3 5.222709 3.092531 4.232887

4 -4.99619 3.089416 2.137064

5 -3.04336 2.802173 1.658494

6 -2.375422 2.409178 0.20628

7 2.651303 2.220845 0.028879

8 -0.68924 2.522834 0.774512

9 0.205664 2.669247 0.259913

10 2.4783 2.969892 0.325758

11 0.237407 1.994892 0.008251

12 0.329728 1.861559 0.008251

13 -0.26869 2.469416 0.929169

14 0.064769 2.503892 0.633557

15 -0.5873 2.265607 0.711829

16 0.329225 2.606003 0.008251

17 -0.11849 2.639092 2.19223

18 0.011541 2.572749 1.91165

19 -0.18757 2.540823 0.008251

20 -0.38752 2.259086 0.008251

21 -0.26835 2.608419 0.148522

22 0.262798 2.558915 0.191428

23 0.355054 3.042134 0.145221

24 -1.34302 2.168168 0.167979

25 -0.77964 3.415726 0.609765

26 -0.04649 2.244892 0.864844

27 0.098381 3.257392 0.072066

28 -0.09585 2.592576 0.960441

29 -0.0059 2.533267 0.00825130 -0.05415 2.678596 0.00825131 -0.00384 2.595078 0.24341132 -0.00799 3.029892 0.11551733 -0.0538 2.502834 0.40678534 -0.00541 2.801892 0.18714635 -0.0178 2.985979 0.55309536 -0.00335 2.873892 0.45271137 -0.03195 2.738983 0.25866138 -0.05187 2.188128 0.74260939 -0.00569 2.984753 0.14239940 -0.0009 2.774178 7.44718541 -0.15083 2.867511 0.88980242 -0.00748 3.143092 0.12376843 -0.78198 3.162392 0.20139644 -0.01982 2.054598 0.79945545 -0.04005 3.108503 0.4456446 -0.002 2.633226 0.272711

42

Page 43: Introduction to Econometrics 1

47 -0.00483 2.47726 0.15064248 -0.05627 2.887051 1.91840749 -0.01541 2.370292 0.7508650 -0.00233 2.802692 0.7508651 -0.00399 2.830845 0.7508652 -0.00422 1.625607 0.06806953 -0.00349 1.647749 0.16961754 -0.0038 2.855726 0.49380155 -7.53E-05 3.073577 0.7508656 -0.00448 2.516488 1.52307557 -0.01442 2.503017 0.33665858 -0.04197 2.515186 0.88538359 -0.05222 2.581392 0.14122860 -0.01357 2.804629 0.08251261 -0.01134 2.831559 0.08251262 -0.00305 2.935229 0.08251263 -0.0007 2.551392 0.08251264 -0.01616 2.944589 0.08251265 -0.00178 3.145347 0.08251266 -0.0375 4.196281 1.68386767 -0.04155 4.112035 0.66009768 -0.23705 4.090387 0.42163769 -9.67E-17 2.403577 2.98867270 -0.0583 2.703371 0.38615771 -0.04783 2.461197 2.88058172 -0.04177 2.928787 1.489131

Please find attached the ANOVA table single factor.

Anova: Single Factor

SUMMARY

Groups Count Sum Average Variance

Share return 72 -5.72171 -0.07947 1.633991

Risk free 72 195.3111 2.712654 0.212762

Size 72 57.48029 0.798337 1.469171

ANOVASource of Variation SS df MS F P-value F crit

Between Groups 293.5463 2 146.7731 132.78943.61E-

38 3.038267Within Groups 235.4306 213 1.105308

43

Page 44: Introduction to Econometrics 1

Total 528.9769 215        

F-ratio =

As the variance explained by the independent variables becomes larger relative to the unexplained variance, then, we will get a large number of the F-ratio. It indicates to us that we can reject the null hypothesis of no effect. The two independent variables affect the dependent variable. The F - ratio value exceeds the F critical value and therefore, we reject the null hypothesis that the explanatory or independent variables are equal to zero. The F ratio test is a measure of the overall significance of the estimated regression line. It is also related to R2 through the following equation:

Exercise

Please consider the following regression equation:

Where: y is the dependent variable and represents the hedge fund price. x1 is the independent variable and represents the hedge fund index. x2 is the second independent variable and represents the hurdle rate. It is required the following:

(a) Interpret the regression results(b) At the 5% significance level, test that the hurdle rate has no effect on the

hedge fund price.(c) Test the joint hypotheses through a t-test statistic that the two explanatory

variables affect substantially the hedge fund price. State the null and alternative hypotheses. The sample size is 25.

44

Page 45: Introduction to Econometrics 1

Exercise

Please consider the following dataset. Construct an ANOVA table and calculate the F-ratio statistic. Then, compare the F-ratio with the critical value and test the joint hypotheses of the independent variables related to significance at the 5 % significance value. The samle size is n = 20.

ObservationsShare return

Risk free rate Size

1 3.526787 2.755836 0.823441

2 -4.34533 2.725726 4.458782

3 5.222709 3.092531 4.232887

4 -4.99619 3.089416 2.137064

5 -3.04336 2.802173 1.658494

6 -2.375422 2.409178 0.20628

7 2.651303 2.220845 0.028879

8 -0.68924 2.522834 0.774512

9 0.205664 2.669247 0.259913

10 2.4783 2.969892 0.325758

11 0.237407 1.994892 0.008251

12 0.329728 1.861559 0.008251

13 -0.26869 2.469416 0.929169

14 0.064769 2.503892 0.633557

45

Page 46: Introduction to Econometrics 1

15 -0.5873 2.265607 0.711829

16 0.329225 2.606003 0.008251

17 -0.11849 2.639092 2.19223

18 0.011541 2.572749 1.91165

19 -0.18757 2.540823 0.008251

20 -0.38752 2.259086 0.008251

Exercise

Please consider the following dataset.

y x1 x2

1 2 33 4 45 7 57 8 79 10 810 2 913 11 1014 12 1616 14 18

(a) Estimate the regression equation.(b) From the summary output outline the t-statistics and the standard errors.(c) Estimate the 95% confidence intervals for .(d) Check the R2 and the F-statistic and compare it with F critical region.(e) Check the value of the TSS, ESS and RSS and comment on the variability.

46

Page 47: Introduction to Econometrics 1

Exercise

You are given the following data and mathematical equations based on a sample of 30 observations.

Please calculate , 1 and 2 . I have included the required equations.

47

Page 48: Introduction to Econometrics 1

(a) It is required to calculate the multiple regression coefficients.(b) Estimate their standard errors.(c) Estimate their t – statistics.(d) Estimate 95% confidence intervals for .(e) Show in Excel the ANOVA table and interpret the F ratio by comparing it with the F- critical region. Compare the variability of the explained sum of squares versus the residual sum of squares.

Exercise

Please find attached the regression results based on a sample of 25 open-ended mutual funds that are traded in the United Kingdom.

1 + 12.45x2 +

SE (0.32) (5.32) (6.21)

R2 = 0.48 F = 110.23 p-value = (0.0000)

Where: y is the dependent variable and is related to the price of the funds. x1 is the independent variable and it is related to the mutual funds index. x2 is the management fees that are charged by the funds.

(a) Calculate the t-statistics.(b) Interepret the regression results.(c) At the 95% confidence level, thest the hypothesis that the management fees

has no effect on the price of the funds. What test are going to use?(d) Are the coefficients of the two variables significant? What the R2 is telling us.

48

Page 49: Introduction to Econometrics 1

Exercise

Please consider the personal consumption in the UK in relation to disposable after-tax income and mortgage rates charged by banks.

Where: y is the personal consumption. x1 is the disposable after-tax income. x2 is the mortgage rate.

(a) Calculate the marginal propensity to consume, (MPC). Please check my book introduction to mathematical economics for clarification concerning the marginal propensity to consume.

(b) Please check and explain the signs of the coefficients.Would you expect a negative sign for mortgage rates.

(c) Calculate the standard error of each coefficient.(d) Is the coefficient significantly different from zero?

49

Page 50: Introduction to Econometrics 1

Exercise

You are given the following ANOVA table with missing data of a three variable regression results.

Source of variation Sum of squares (SS)

d.f. Mean sum of squares (MSS)

Explained sum of squares due to regression, (ESS)

20.456 2

Residual sum of squares, (RSS)Total variation, (TSS)

25.123 20

It is required the following:

(a) Please complete the table.(b) Compute the value of RSS.(c) What are the d.f of the ESS and RSS. The RSS has n-3 degrees of freedom.(d) Calculte the F-ratio.

50

Page 51: Introduction to Econometrics 1

Exercise

Please consider the following table that represents the fees of a closed-end mutual fund in relation to the net asset value nominated in billions pounds.

Fees nominated in percentages Net asset value in billions pounds0.98 200.88 300.76 400.65 500.62 600.58 700.59 800.42 900.48 1000.47 1100.37 1200.34 1400.23 1500.22 160

(a) Draw a scatterplot to show the relationship between fees and net asset value.(b) Estimate the regression equation and interpret the results.(c) Compute the t-statistics.(d) Are the coefficients of the fees related to the net asset value? Which test are

you going to use?

51

Page 52: Introduction to Econometrics 1

(e) Transform your variables to natural logarithmic and compare the R2 of the linear and log linear regressions.Draw your conclusion based on the findings?

(f) Use the p - values from the regression output to conclude if the t-statistics are significant. Check your results based on the 5 % significance level. p-values below the 5% are significant.

(g) Check the adjusted R2 and the standard errors of the logarithmic regression.

Exercise

Please consider the Cobb-Douglas production function where the dependent variable is output and the independent variables are labour input and capital input. I have attached the table with the relevant data. The output is nominated in millions. The labour input is nominated in thousands of persons and the capital is nominated in millions USD.

Year Output Labour , (x1) Capital, (x2)2000 1056780 2678 20455402001 1105780 3456 22456202002 1294780 4567 23689302003 1378940 5623 24678402004 1489620 6278 30878502005 1589230 6589 31567602006 1683490 7654 33678702007 1734890 7894 38907802008 1892340 8765 40357902009 1923450 8124 45789202010 2098340 9534 50987402011 2209870 10345 54678602012 2381340 11567 55678702013 2487650 12567 6087790

It is required to calculate the following:

52

Page 53: Introduction to Econometrics 1

(a) Estimate the regression equation. The dependent variable is output and the independent variables are labour and capital. Interpret the slope coefficients of labour and capital.

(b) Draw a scattergraph to see the relationship between the variables.(c) Test the hypotheses that and = 0.(d) Calculate the t-statistics.(e) Is the coeeficient of labour statistically different from zero?(f) Did you expect negative or positive signs of the coefficients of the

independent variables?

Example

I have attached a detailed example of multiple regressions that was performed during different time periods based on different sectors of investment trusts.There is also references based on the literature review of my PhD thesis. You can see the intercept and the coefficients values and their related t-statistics. In addition, the table include the adjusted R2 and the Durbin Watson statistic and we will cover it in the subsequent chapter related to autocorrelation. The aesterix denote the level of significance at the 5% or 1% significance level. The example is related to Fama and French’s three factor model of UK excess NAV return. The dependent variable is UK excess NAV return and the independent variables are market, size, and book – to – market.

You are required to consider carefully the following table and spot the level of significance and how it affects the adjusted R2 of the whole multipl regression model. Please write the regression equation for each period. I have included interpretation of the following table.

Table 1: Fama and French’s three factor model of UK excess NAV return

We use 12, 36, 60 and 109 observations by applying a rolling methodology. The sample includes 16 sectors of UK investment trusts with total number of 120 funds. We use two style indices to measure the size effect, and the book-to-market effect. Specifically, the UK size effect is measured as the difference between the return on the FTSE Smaller Companies index and the return on the FTSE 100 index. The FTSE 100 is used as a proxy for the return on large companies. The UK book to maket effect is measured as the difference between the return on the FTSE 350 Growth index and the return on the FTSE 350 Value index.

53

Page 54: Introduction to Econometrics 1

AITC Category Coefficients 1Y 3Y 5Y 9YGlobal Growth Adj R2 0.89 0.71 0.67 0.48

3.87( 3.30)**

0.72 ( 1.27)

1.91(3.01)**

-0.56(-0.89)

Market 1.68(9.70)**

0.81(9.74)**

0.65(4.45)**

0.69(5.85)**

Size 1.29(2.99)**

0.73(1.91)*

0.33(3.25)**

0.49(3.30)**

Book-to-market -1.85(-1.99)*

0.71(2.11)*

-0.05(-0.22)

0.39(1.58)

D/W 2.10 2.11 1.96 1.95Global Growth & Income

Adj R2 0.87 0.71 0.83 0.55

1.60(2.95)**

1.18(2.02)*

0.37(0.97)

-0.46(-1.15)

Market 0.95(8.60)**

0.82(10.89)**

0.86(17.78)**

0.66(6.92)**

Size 0.66(4.22)**

-0.00(-0.02)

-0.05(-1.18)

-0.02(-0.17)

Book-to-market 0.72(1.55)

-0.28(-0.52)

-0.11(-0.48)

-0.18(-0.77)

D/W 2.10 2.13 1.98 2.04Global Smaller Companies

Adj R2 0.89 0.74 0.67 0.49

3.54(3.24)**

-0.42(-0.33)

1.91 (3.01)**

-0.45(-0.70)

Market 1.64(10.34)**

1.13(10.06)**

0.65(4.45)**

0.71(5.77)**

Size 1.25(2.99)**

0.20(0.99)

0.33(3.25)**

0.49(3.30)**

Book-to-market -1.73(-1.91)*

-0.89(-1.78)*

-0.05(-0.22)

0.39(1.57)

D/W 2.03 1.95 1.96 1.97UK Growth Adj R2 0.90 0.84 0.80 0.78

2.27(1.60)

0.81 (1.50)

0.53(1.29)

0.12(0.34)

Market 0.82(6.64)**

0.69(13.15)**

0.82(15.56)**

0.82(13.54)**

Size 0.20(2.36)**

0.68(5.96)**

0.02(2.18)*

0.83(9.65)**

Book-to-market -0.18(-2.10)*

-0.63(-1.73)*

-0.40(-1.26)

0.06(0.23)

D/W 2.00 1.98 2.05 2.01UK Growth & Income Adj R2 0.84 0.81 0.77 0.76

1.88(1.08)

1.23 (2.02)*

0.22(0.46)

0.06(0.16)

Market 0.74(4.29)**

0.68(11.00)**

0.86(13.55)**

0.77 (9.24)**

Size 0.23(2.25)*

0.70(5.70)**

-0.11(-1.03)

0.82 (9.48)**

Book-to-market -0.13(-1.09)

-0.67(-2.43)**

-0.63(-1.83)*

0.06(0.22)

D/W 2.07 1.95 2.10 1.93

54

Page 55: Introduction to Econometrics 1

UK Smaller Companies

Adj R2 0.89 0.84 0.69 0.79

2.43(1.65)*

0.85(1.58)

1.97(2.43)**

0.13(0.38)

Market 0.84(5.92)**

0.68(13.44)**

0.54(4.76)**

0.83(13.94)**

Size 0.20(2.32)*

0.70(6.42)**

0.60(5.21)**

0.84(9.54)**

Book-to-market -0.19(-2.06)*

-0.67(-1.86)*

-0.10(-0.84)

0.06(0.23)

D/W 2.04 1.97 1.99 2.06North America Adj R2 0.53 0.76 0.79 0.69

-0.93(-0.89)

0.99(1.53)

0.48 (1.05)

0.64(1.77)*

Market 0.47(3.52)**

0.92(10.11)**

0.95 (14.82)**

0.77(9.53)**

Size -0.61(-1.86)*

0.67(2.33)*

0.62(2.98)**

0.43(4.37)**

Book-to-market 1.51(-1.80)*

-0.21(-1.30)

-0.09(-0.98)

0.19(0.97)

D/W 2.05 2.09 1.99 2.06North America Smaller Companies

Adj R2 0.79 0.84 0.61 0.69

1.98(1.37)

0.82(1.44)

0.28(0.25)

0.63(1.73)*

Market 0.58(3.16)**

0.86(2.97)**

0.69(6.23)**

0.71(9.56)**

Size 0.93(3.34)**

0.49(3.46)**

0.40(1.73)*

0.43(4.40)**

Book-to-market -1.30(-1.79)*

0.25(0.79)

-0.43(-1.76)*

0.18(0.94)

D/W 1.95 1.92 1.97 2.07Far East (Including Japan)

Adj R2 0.68 0.78 0.75 0.53

-1.45(-0.66)

0.93(1.38)

0.87(1.60)

-0.17(-0.32)

Market 0.63(2.07)**

0.96(10.05)**

0.91(12.96)**

0.82(11.06)**

Size 0.12(0.34)

0.24(1.92)*

0.14(1.08)

0.27(2.03)*

Book-to-market -0.03(-0.04)

-0.16(-1.03)

-0.37(-1.21)

0.24(1.09)

D/W 1.96 2.15 2.08 1.97Far East (Excluding Japan)

Adj R2 0.63 0.44 0.24 0.25

-1.75(-0.49)

0.85(0.68)

0.85(0.81)

-0.09(-0.11)

Market 0.29(1.35)

0.79(4.28)**

0.57(2.54)**

0.39(4.01)**

Size -0.13(-0.17)

-0.13(-0.42)

-0.26(-0.74)

0.08(0.63)

Book-to-market 2.10(1.86)*

0.84(1.19)

-0.12(-0.20)

0.49(2.31)*

D/W 2.04 2.02 1.92 1.92Japan Adj R2 0.91 0.74 0.77 0.53

55

Page 56: Introduction to Econometrics 1

4.40(5.00)**

-0.11(-0.08)

-0.14(-0.24)

-0.18(-0.33)

Market 1.70(10.85)**

1.16(9.78)**

1.13(16.43)**

0.81

(19.99)**Size 0.83

(2.04)*0.24

(0.86)0.51

(3.05)**0.28

(2.07)*Book-to-market -0.76

(-2.82)**-0.84

(-1.72)*-0.42

(-1.21)0.53

(1.55)

D/W 1.93 2.03 1.96 1.98Japanese Smaller Companies

Adj R2 0.80 0.80 0.65 0.53

3.54 (1.99)*

0.35(0.32)

0.40(0.48)

-0.18(-0.33)

Market 9.90(9.50)**

1.15(12.57)**

0.98(8.01)**

0.81(19.99)**

Size 5.40(6.00)**

0.84(3.96)**

0.59(2.82)**

0.28(2.07)*

Book-to-market -1.01(-1.50)

-0.51(-0.82)

-0.37(-0.77)

0.26(1.78)*

D/W 1.95 1.92 1.96 1.98Europe Adj R2 0.37 0.78 0.75 0.41

0.84(1.36)

0.68(1.19)

0.67(1.33)

-0.03(-0.07)

Market 0.60(2.85)**

0.73(10.05)**

0.79(11.05)**

0.67(5.23)**

Size 0.32(0.75)

0.16(1.52)

0.04(0.50)

0.16(0.93)

Book-to-market -0.25(-0.37)

-0.03(-0.12)

-0.02(-0.08)

-0.02(-0.08)

D/W 1.94 1.99 1.97 2.08European Smaller Companies

Adj R2 0.89 0.90 0.88 0.71

0.69(0.88)

-0.11(-0.28)

0.48(1.52)

-0.48(-1.28)

Market 0.90(12.45)**

0.86(21.13)**

0.84(16.27)**

0.84(10.13)**

Size 0.44(2.30)*

0.32(3.19)**

0.23(2.66)**

0.49(3.69)**

Book-to-market 0.50(1.07)

0.40(1.92)*

0.02(0.11)

0.30(-1.36)

D/W 2.02 2.05 2.02 2.06Country Specialists Far- East

Adj R2 0.83 0.87 0.80 0.53

1.65(2.45)**

0.32(0.52)

1.16(1.82)*

-0.17(-0.30)

Market 0.93(7.91)**

0.99(13.50)**

0.94(13.62)**

0.82(6.62)**

Size 0.64(3.58)**

0.32(3.45)**

-0.02(2.31)*

0.27(2.11)*

Book-to-market 0.45(0.78)

0.25(-4.18)**

-0.07(-0.91)

0.24(1.04)

56

Page 57: Introduction to Econometrics 1

D/W 2.01 2.09 2.06 1.97Sector Specialists Property

Adj R2 0.67 0.58 0.30 0.69

3.17(1.50)

0.22(0.20)

0.99(0.94)

0.60(1.64)*

Market 0.90(4.39)**

0.70(6.03)**

0.51(3.23)**

0.70(9.53)**

Size 1.03(1.53)

0.46(2.00)*

0.35(2.04)*

0.42(4.28)**

Book-to-market -1.95(-3.40)**

-0.40(-1.58)

-0.23(-1.09)

0.19(0.99)

D/W 2.05 2.01 2.09 2.04Total observations 12 36 60 109Source: author calculation* represents t –value that is statistically significant at 5% significance level** represents t-value that is statistically significant at 1% significance level

According to table 1 , the adjusted R squared for most AITC categories is higher than 0.50. In addition, there is no evidence of first order autocorrelation in the above regressions as on average the Durbin/Watson statistic (D/W) for each sector is close to 2.00. For example, the European Smaller Companies category has an adjusted R squared value of 0.89 for the first 12 months and 0.88 for the first 60 months. Similarly, UK Growth has an adjusted R squared value of 0.90 for the first 12 months and 0.84 for the first 36 months.

As indicated above, is used to measure the ability of managers to outperform the base index. A positive and statistically significant indicates a skilled fund manager whose decisions add value to the fund. Rhodes (2000) argued, that “persistent performance shows that some fund managers are able to outperform their peers. This implies that the fund managers must either have access to information that is insider or not widespread or make use of information in a speedier way than other managers. As markets become more efficient it will be more difficult for any fund manager to outperform the market continuously”(Rhodes,2000,p.7). On the other hand, negative

values or statistically insignificant values represent inferior or neutral performance of the manager. In other words, a negative indicates a poorly performing manager whose decisions affect negatively the value of the fund. According to Table 44, the results are mixed. Ten out of the sixteen sectors display an that is positive and statistically significant at the 1% and 5% level (in one-tailed tests). In more detail, Global Growth and Income shows a positive and statistically significant for the first year at the 1% level and at the 5% significance level for the first three years. Japan displays a significant for the first year at the 1% significance level. Global Smaller Companies show a positive and statistically significant at the 1% level the first five years. Finally, Japanese Smaller Companies and Country Specialist Far East display a positive and statistically significant for the first 12 months at the 1% and 5% levels respectively. On the other hand, the rest of the sectors show a mixed picture of positive and negative alphas that are not statistically significant at the 5% or 1% levels. Furthermore, only two alphas are significant for the full nine years at the 5% level in a one-tailed test. So while there is some evidence for managerial performance persistence in the short-term, there is little evidence for persistence in the long-run.

Most of the sectors show significant t-statistics for both the market return and size. The t-statistic for the UK size effect measured as the difference between the return on

57

Page 58: Introduction to Econometrics 1

the FTSE Smaller Companies index and the return on the FTSE 100 index is statistically significant at the 1% and 5% level. On the other hand, the UK book-to-market effect measured as the difference between the return on the FTSE 350 Growth index and the return on the FTSE 350 Value index is negatively and positively significant for some of the sectors and statistically insignificant for most of the sectors. Thus, consistent with Pontiff (1997), we find that the book-to-market factor does not seem to have any significant explanatory power. Through his cross-sectional regression, Pontiff found that the book-to-market effect only influenced funds with low premiums and discounts.

Example from my PhD thesis entitled as a multifactor model of investment trust discounts. A comparative study of UK investment trusts and US closed-end funds.

Econometrics refers to the application of economic theory to the economic variables or financial theory to financial data through statistical techniques in order to test hypotheses and estimate their coefficients. I will refer to the literature review of factor selection of a multifactor risk model and expectations concerning their coefficients. It is an excellent example to help you understand how research methods are used in postgraduate and research degrees.

The most common approach for choosing factors that affect stock returns are firm specific attributes. A well known model is the three factor model of Fama and French (1993). Fama and French group common stocks according to their size and their ratios of book to market value.

From the standpoint of both the academic researcher and practitioner, it is crucial to be able to identify which factors best capture stock return variation and specifically the discount variation. As a result, there has been a proliferation of research that attempts to identify the various factors. The importance of market risk, book-to-market risk and small firm risk in explaining cross-sectional differences in stock returns is discussed in Fama and French (1993). On the other hand, Carhart (1997) adds a fourth factor to these three risk measures, which is based on the results of Jegadeesh and Titman (1993), that mutual funds are affected by momentum, namely that past winners will continue to be next winners the following year and past losers will continue to be losers. In other words, a firm’s past return helps to predict

58

Page 59: Introduction to Econometrics 1

future returns (Chan, Jegadeesh and Lakonishok,1996; Chopra, Lakonishok and Ritter, 1992; DeBondt and Thaler,1985; Rosenberg, Reid and Lanstein, 1984). We found some evidence for this managerial performance persistence in our empirical studies. Momentum creates the suspicion that management performance might influence the discount return. Momentum is defined as the difference between funds NAV return and the sector’s average NAV returns. Past performance is measured over the full history of the funds. We test the momentum anomaly documented in Carhart’s four-factor model.

We add two more factors to these four risk measures. The first one is based on the results of Lee, Shleifer and Thaler (1991) that closed-end funds are subject to systematic “investor sentiment risk”. In more detail, one possible explanation for variations in the discount is based on the existence of two categories of investors. The first category includes rational investors, namely arbitrageurs, who make rational decisions in accordance with their preferences. The second category includes the so-called noise traders. These investors do not act fully rationally, and their investment decisions are considered as unpredictable. In some periods they overestimate expected returns relative to the rational expectations and in other periods they underestimate them. Therefore, prices of securities are a function of both categories.

The interaction of these two categories of investors may help to explain the variation of the discount. For arbitrageurs to buy funds that are characterized by a constant discount would be costly and ,therefore, not always profitable. In more detail, if investment trusts trade at a discount and then at a premium to NAV, an apparent arbitrage profit can be realised by selling the fund’s share above the risk free rate. However, if the discount stays relatively constant over the investment horizon, the arbitrageurs make no profit. According to the descriptive statistics presented in chapter 5, there is a wide range between the lower and upper bounds of the discount in each AITC and CEFA category, which suggests that there is active interaction between both parties. This active interaction can be measured quantitatively by using as proxies the retail flows of each sector or by using an investor sentiment index.

The retail flow measure was first used by Malkiel (1977) to measure general investor sentiment regarding investment companies. According to him “when individuals were showing little enthusiasm for open-end funds, it is probable that they showed a similar lack of enthusiasm for closed- end funds”(Malkiel, 1977, p.856). The relationship between changes in fund discounts and net redemptions of open-end funds can be seen in Figure 1

Figure 1 : Relationship between changes in fund discounts and net redemptions

P SC P

PA PA So

Change in D0 D0

59

Page 60: Introduction to Econometrics 1

Discount D1 D1

Q Sales Flow Q

Closed-end funds Open-end funds

Source: Malkiel (1977), the valuation of closed-end investment company shares

In Figure 1, according to Malkiel (1977), “ S0 is the supply curve of open-end funds, which is infinitely elastic because open-end funds will sell as many new shares or redeem outstanding ones at net asset value (PA). The supply curve for the closed-end funds (Sc) is perfectly inelastic as there is a fixed supply of shares. Shifts in the demand curve for open-end shares will result in changes in the quantity of shares outstanding or net sales” (Malkiel, 1977, p.856). Shifts in the demand curve for closed-end funds will result in changes in fund discounts or premiums. In a perfect market, demand shifts will not change the level of the discount but due to the interactions of noise and rational traders, we expect net sales of open-end funds to be associated with changes in the discount. Malkiel (1977) found that an increase in net sales of open-end funds was associated with an increase in the average discount, but the coefficients were not significant. The problem with his study was that he only used 24 US closed-end funds and not all companies could be included in his cross-sectional regression because of missing data. So the explanatory power of his model was weak.

On the other hand, Gemmill and Thomas (2002) conducted a more detailed analysis to find out whether investor sentiment can explain fluctuations in the discount. They found that monthly flows of retail investment into particular sectors were closely related to changes in sector discounts. According to their findings investor sentiment not only affects the discount in the short-term but may also influence its level over periods of several years. They investigated 11 UK sectors which consisted of 158 closed-end equity funds.

The second additional factor is management expenses. Malkiel (1977) defines management expenses as a percentage of fund assets. The problem is that the ratio of expenses to cash flow is not constant over the life of a fund. As the long-term expense to cash flow ratio is not observable, it can be proxied by current monthly expenses as a percentage of fund assets. In this chapter, we define expenses as the monthly management charge (or staff costs), auditors and custody fees, directors fees and marketing costs as a percentage of fund assets. We test two contradictory hypotheses. The first one is Barclay, Holderness and Pontiff’s (1995) view that a possible explanation of the discount is that as managers try to protect their private benefits, expenses increase over the long term and have an effect on the discount. The second one is Malkiel (1977) who claims that an expense ratio will not be significant due to the difficulty of measuring it accurately. The ratio of expenses to cash flow is not constant over the life of a fund.

The market factor is the difference between the return on the relevant index and one-month risk-free rate. The UK size effect is measured as the difference between the return on the FTSE Smaller Companies index and the return on the FTSE 100 index. The FTSE 100 is used as a proxy for the return on large companies. The size effect is

60

Page 61: Introduction to Econometrics 1

used to test the anomaly documented in Fama and French’s three-factor model. The UK book-to-market effect is measured as the difference between the return on the FTSE 350 Growth index and the return on the FTSE 350 Value index. The US size effect is measured as the difference between the return on S&P Small Cap and the return on the S&P 500 Composite as it is used as a proxy for the return on large companies. The US book-to-market effect is measured as the difference between the return on the S&P 500/Barra Growth index and the return on the S&P 500/Barra Value index.

From the theory, In terms of signs of the independent variables, we expect to find positive values for the coefficients of size, market and sentiment and negative values are expected for the book-to-market effect and momentum. In more detail, size effect was documented in Fama and French’s model in terms that small firms outperform big companies and therefore we expect a positive effect. Market effect has been significant in return explanation based on the CAPM. Sentiment is expected to have a positive value results based on the hypothesis of Lee, Shleifer and Thaler (1991) that closed-end funds are subject to systematic “investor sentiment risk”. In more detail, one possible explanation for variations in the discount is based on the existence of two categories of investors. The first category includes rational investors, namely arbitrageurs, who make rational decisions in accordance with their preferences. The second category includes the so-called noise traders. Book-to-market effect is expected to have a negative value based on the hypothesis of Pontiff (1997) who found that the book-to market effect is negative and insignificant and affects funds with low premiums and dicounts. Momentum is expected to have a positive sign during the short-term according to the hypothesis of Jegadeesh and Titman (1993). In more detail, Jegadeesh and Titman (1993) show that buying past winners and selling past losers generates significant profits when returns are measured over three to twelve-month periods. This hypothesis is tested later by using deciles and trying to test the significance between rankings. In contrast, momentumt is expected to have a negative sign over the long term as we assume that information both private and public were incorporated in the prices of funds. Expenses are expected to have positive or negative values as we test two contradictory hypotheses. The first one is Barclay, Holderness and Pontiff’s (1995) view that a possible explanation of the discount is that as managers try to protect their private benefits, expenses increase over the long term and have an effect on the discount. The second one is Malkiel (1977) who claims that an expense ratio will not be significant due to the difficulty of measuring it accurately. The ratio of expenses to cash flow is not constant over the life of a fund.

Fama and French (1993) show the importance of market risk, small firm and book-to-market risk in explaining cross-sectional differences in stock returns. The famous model of Fama and French (1993) was constructed and implemented on various portfolios of shares to explain various anomalies in financial markets in terms of size, book/market ratio etc. This leads to the three factor model. Equation (1) enables us to test the significance of these three factors in relation to the excess discount return. We use the excess discount return because changes in the discount can be interpreted as returns, and the discount was shown to be non-stationary.

(1)

61

Page 62: Introduction to Econometrics 1

where: is the excess discount return for each sector at time t

is the excess market return

is the size factor (small minus big), i.e the difference

between the return on a portfolio of small stocks and the return on a

portfolio of large stocks.

is the book to market factor (high minus low), i.e the

difference between the return on a portfolio of high-book-to-market

stocks and the return on a portfolio of low-book-to- market stocks

is the distrurbance term

We test the significance of the estimated coefficients using t-statistics and F-tests. We also use the Durbin/Watson statistic to check for autocorrelation. The estimation results are shown in Table 2 for the UK and Table 3 for the US.

Table 2 Fama and French’s three-factor model of the UK excess discount return

The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 90 funds for UK investment trusts. The t-statistic for each factor is displayed in parentheses. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3Fund Name by AITC category

D/W F-Ratio AdjustedR2

Market(

Size Book to market

Global Growth 2.07 30.43 0.36 0.26 (7.22)**

0.16 (4.41)**

-0.36 (-2.65)**

Global Growth & Income

2.01 37.52 0.41 0.34 (3.19)**

0.11 (2.30)*

-0.36 (-2.52)**

Global Smaller Companies

1.94 47.17 0.47 0.41 (7.24)**

0.07(1.78)*

-0.40 (-3.23)**

UK Growth 2.05 40.40 0.43 0.28 (3.01)**

0.15 (3.52)**

-0.41 (-3.05)**

UK Growth & Income

1.97 62.85 0.54 0.44 (9.92)**

0.11 (3.03)**

-0.33 (-2.64)**

UK Smaller Companies

1.97 49.72 0.49 0.41 (5.54)**

0.14 (3.32)**

-0.25 (-1.77)*

North America 2.06 35.43 0.40 0.46 (8.08)**

0.10 (1.79)*

-0.31 (-1.88)*

North America Smaller Companies

2.07 25.75 0.32 0.26 (2.24)*

0.13 (2.54)**

-0.64 (-3.43)**

Far East (Including 2.06 45.77 0.46 0.39 0.14 -0.21

62

Page 63: Introduction to Econometrics 1

Japan) (4.05)** (3.18)** (-1.51)Far East (Excluding

Japan)2.02 24.33 0.31 0.27

(7.31)**0.14

(3.07)**-0.24

(-1.41)Japanese Smaller

Companies2.03 34.57 0.39 0.30

(3.23)**0.11

(2.19)*-0.43

(-2.78)*Japan 1.95 46.12 0.47 0.40

(5.58)**0.15

(3.52)**-0.27

(-1.92)*Europe 2.03 49.67 0.49 0.36

(3.47)**0.12

(2.55)*-0.40

(-2.84)**Country Specialists

– Far East2.02 38.17 0.42 0.39

(5.89)**0.15

(3.37)**-0.45

(-2.41)**Sector Specialists –

Property2.06 30.78 0.37 0.25

(7.28)**0.17

(4.70)**-0.36

(-2.65)**European Smaller

Companies2.05 34.67 0.39 0.30

(3.23)**0.11

(2.19)*-0.42

(-2.73)**Average 2.02 0.42

Source : author calculation

* Statistically significant at 5% level

** Statistically significant at 1 % level

From Table 2, we see that on average, three-factor model can explain 42% of the variations in the UK excess discount return by taking into consideration market, size, and the book-to-market effect. As expected the coefficients of the market and size effect are positive and statistically significant for all sectors, while the coefficients of the book-to-market effect are negative and significant for most of the 16 sectors. Thus we find that the book-to-market factor does have an explanatory power. It has a significant negative influence on excess discount return. Our results contradict Pontiff (1997), who found that the book-to-market effect is insignificant and affects funds with low premiums and discounts.

On the other hand, there is no evidence of autocorrelation in the times series of 158 monthly observations from January 1990 to January 2003. On average, the D/W statistic is 2.02. The F-ratio is used to test the overall significance of the regression test. Overall, all the sectors show a significant F-ratio which implies that the model as a whole has a significant degree of explanatory power. As an example, Europe has an F-ratio of 49.67 which exceeds the tabulated value of 3.04 at the 5% significance level with degrees of freedom of 2 and 155.

To summarise the UK results, the UK book-to-market effect, measured as the difference between the return on the FTSE 350 Growth index and the return on the FTSE 350 Value index has a negative and statistically significant influence for most of the AITC sectors. As an example, UK Growth and North America Smaller Companies display significant t-statistics of -3.05 and -3.43. The coefficient for the UK size effect, measured as the difference between the return on the FTSE Smaller Companies index and the return on the FTSE 100 index, is statistically significant at the 1% and 5% level for most of the sectors. For example, Global Growth and UK Growth and Income show significant t-statistics of 4.41 and 3.03. Finally, the market effect is positive and statistically significant for all sectors. Global Growth and UK

63

Page 64: Introduction to Econometrics 1

Growth and Income display significant t-statistics of 7.22 and 9.92. Far East (excluding Japan) and Japan show t-statistics of 7.31 and 5.58.

Table 3 Fama and French’s three factor model of the US excess discount return

The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 30 funds for US closed-end funds. The t-statistic of each factor is displayed in parentheses. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3Fund Name by CEFA category

D/W F-Ratio AdjustedR2

Market(

Size Book to market

Equity Income 1.99 84.25 0.62 0.62 (11.73)**

0.02(0.55)

0.00(0.01)

Global Equity 1.96 67.45 0.56 0.58 (13.20)**

0.14 (2.50)**

0.01(0.92)

Growth and Income 1.96 67.00 0.48 0.55 (7.53)**

0.09(0.82)

0.06(0.29)

Growth Domestic 1.90 117.14 0.69 0.67 (13.2)**

0.22(2.31)*

0.60 (3.86)**

Average 1.95 0.59 Source : author calculation

* Statistically significant at 5% level

** Statistically significant at 1 % level

From Table 3, we see that the three-factor model can explain 59 % of the variation of the US excess discount returns by taking into consideration market, size, and the book-to-market effect. As expected, coefficients are statistically significant for the market and size effect for most of the sectors. There is no evidence of autocorrelation in the time series of 158 observations from January 1990 to January 2003. On average

64

Page 65: Introduction to Econometrics 1

the Durbin/Watson statistic is 1.95. Overall, all the sectors show a significant F-ratio, implying that the model as a whole has explanatory power. As an example, the Growth and Income sector has an F-ratio of 67.00 which exceeds the tabulated value of 3.04 at the 5% significance level with degrees of freedom of 2 and 155.

To summarise the US results, the US book-to-market effect is statistically insignificant for most of the CEFA sectors except the Growth Domestic sector which has an unexpected positive sign. One possible explanation is that this sector is sensitive to the book to-market index as it is dominated from growth funds and behaves in an irregular way. The t-statistics of the US size effect are statistically significant at the 1% significance level for Global Equity 2.50 and the Growth Domestic sector 2.31. On the other hand, the market effect is highly statistically significant for all sectors. Equity Income, Global Equity, and Growth and Income display significant t-statistics of 11.73, 13.20, and 7.53.

Carhart’s(1997) four-factor model

Carhart (1997) constructed his four-factor model using Fama and French’s (1993) three-factor model plus an additional factor capturing Jegadeesh and Titman’s (1993) one-year momentum anomaly. This was motivated by the three-factor model’s inability to explain cross-sectional variation in portfolio returns (Fama and French (1993)). Chan, Jegadeesh and Lakonishok (1996) suggest that the momentum anomaly is market inefficiency due to slow reaction to information. However, the effect is robust to time periods (Jegadeesh and Titman (1993), Asness, Liew, and Stevens (1996)).

Equation (2) enables us to test the significance of the four factors for each sector in relation to the excess discount return.

where: is the excess discount return for each sector at time t

is the excess market return

is the size factor (small minus big), i.e the difference between

the return on a portfolio of small stocks and the return on a portfolio of

large stocks.

is the book to market factor (high minus low), i.e the

the difference between the return on a portfolio of high book-to-market

stocks and the return on a portfolio of low book-to- market stocks

65

Page 66: Introduction to Econometrics 1

(RNAVfd,t – RNAVse,t) is the momentum factor measured as the

difference between the fund and sector return

is the distrurbance term

We test the significance of the estimated coefficients using t-statistics and F-tests. We also use the Durbin/Watson statistic to check for autocorrelation. The estimation results are shown in Table 4 for the UK and Table 5 for the US.

Table 4 Carhart’s four-factor model of UK excess discount return

The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 90 funds for UK investment trusts. The t-statistic of each factor is displayed in parentheses. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3 Factor 4Fund Name by AITC category

D/W F-Ratio AdjustedR2

Market(

Size Book to market

Momentum(RNAVfd,t

– RNAVse,t)

Global Growth 2.05 24.23 0.37 0.28 (3.03)**

0.15 (3.22)**

-0.35 (-2.50)**

-0.56(-1.18)

Global Growth & Income

2.10 47.02 0.54 0.48 (9.88)**

0.07 (2.03)*

-0.32 (-2.42)**

0.79(1.23)

Global Smaller Companies

1.98 39.61 0.50 0.45 (9.56)**

0.05(1.51)

-0.38 (-2.85)**

0.12(0.39)

UK Growth 2.08 32.70 0.45 0.33 (2.58)**

0.14 (2.75)**

-0.39 (-2.99)**

-1.24(-1.71)*

UK Growth & Income

2.00 50.80 0.56 0.44 (10.58)**

0.09 (2.84)**

-0.24 (-2.01)*

-3.01 (-2.64)**

UK Smaller Companies

2.05 30.77 0.43 0.47 (8.29)**

0.11 (2.03)*

-0.31 (-1.87)*

-0.57 (-1.93)*

North America 1.96 27.92 0.41 0.47 (8.25)**

0.09(1.64)

-0.27 (-1.58)

-0.56(-1.84)*

North America Smaller Companies

2.02 32.11 0.45 0.32 (9.23)**

0.12 (3.35)**

-0.41 (-2.89)**

-0.08(-0.39)

Far East (Including Japan)

2.07 32.15 0.44 0.37 (9.71)**

0.15 (3.74)**

-0.22(-1.50)

-1.13(-0.91)

Far East (Excluding Japan)

2.11 29.68 0.43 0.39 (3.12)**

0.09 (1.75)*

-0.22 (-1.77)*

-1.21 (-2.72)**

Japanese Smaller Companies

2.10 19.93 0.33 0.32 (6.23)**

0.18 (3.74)**

-0.33 (-1.88)*

-0.71(-1.83)*

Japan 2.05 24.23 0.37 0.28 (7.54)**

0.15 (4.17)**

-0.33 (-2.42)**

-0.51(-1.51)

Europe 2.00 32.65 0.45 0.33 (2.77)**

0.12 (2.62)**

-0.41 (-2.76)**

-0.08(-0.42)

Country Specialists 2.04 21.17 0.34 0.33 0.16 -0.38 -0.68

66

Page 67: Introduction to Econometrics 1

- Far East (2.60)** (2.93)** (-2.09)* (-1.82)*Sector Specialists –

Property1.95 22.09 0.35 0.41

(3.44)**0.04

(0.55)-0.26

(-1.51)-0.88

(-1.77)*European Smaller

Companies2.03 31.57 0.37 0.28

(2.63)**0.11

(2.20)*-0.42

(-2.67)**-0.31(1.20)

Average 2.04 0.42Source : author calculation

* Statistically significant at 5 % level

** Statistically significant at 1% level

Table 4 shows that on average, the four-factor model can explain 42% of the variation in the UK excess discount return by taking into consideration market, size, the book-to-market factor and momentum. As expected the coefficients for the UK are positive and statistically significant for the market and the size effect. In addition, the coefficients are negative and statistically significant for the book-to-market effect for some of the sectors. The explanatory power is no better than the three-factor model. However, momentum is only negative and statistically significant (at the 5% level in one-tailed tests) in five out of the sixteen sectors. Thus, we have very limited evidence that managerial performance has any influence on the excess discount return in the UK in the long-run. There is no evidence of autocorrelation. On average the Durbin/Watson statistic is 2.04. Overall, all the sectors show a significant F-ratio. As an example, the North America Smaller Companies sector has an F-ratio of 32.11 which exceeds the tabulated value of 2.65 at the 5% significance level with degrees of freedom of 3 and 154.

To summarise the UK results, the book-to-market effect is negative and statistically significant for some of the AITC sectors and insignificant for most of the sectors. As an example, UK Growth and Global Growth and Income display t-statistics of -2.99 and -2.42. The UK size effect is positive and statistically significant at the 1% and 5% level for most of the sectors. For example, Global Growth and Far East including Japan show significant t-statistics of 3.22 and 3.74. Also, the market effect is positive and highly statistically significant for all sectors at the 1% level. For example, UK Growth and Income and UK Smaller Companies display t-statistics of 10.58 and 8.29. Japan and Sector Specialist Property shows t-statistics for the market effect of 7.54 and 3.44.

Table 5 Carhart’s four -factor model of the US excess discount return

The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 30 funds for US closed-end funds. The t-statistic of each factor is displayed in parentheses. In addition, we provide tests of autocorrelation .

Factor 1 Factor 2 Factor 3 Factor 4Fund Name by CEFA category

D/W F-ratio AdjustedR2

Market(

Size Book to market

Momentum(RNAVfd,t – RNAVse,t)

Equity Income

1.93 83.32 0.68 0.65 (12.41)**

0.44 (2.94)**

0.08(1.05)

0.09(0.24)

67

Page 68: Introduction to Econometrics 1

Global Equity 2.02 52.00 0.57 0.61 (8.01)**

0.46 (2.81)**

0.10(0.93)

0.06(0.13)

Growth and Income

1.92 77.23 0.66 0.55 (12.20)**

0.48 (2.92)**

0.16(-2.01)*

0.03(0.11)

Growth Domestic

1.97 89.41 0.70 0.68 (14.29)**

0.53 (3.32)**

0.12(1.40)

-0.00(-0.04)

Average 1.96 0.65

Source : author calculation

* Statistically significant at 5% significance level

** Statistically significant at 1 % significance level

Table 5 shows that, on average, the four-factor model can explain 65% of the variation in the US excess discount return by taking into consideration the market effect, size, the book-to-market effect and momentum. Thus, there is a small improvement compared with the three-factor model. The coefficients for the US are statistically significant for the market and the size effect for all sectors. There is no autocorrelation. On average the Durbin/Watson statistic is 1.96. Overall, all the sectors show a significant F-ratio. As an example, Growth Domestic sector has an F-ratio of 89.41 which exceeds the tabulated value of 2.65 at the 5% significance level with degrees of freedom of 3 and 154.

However, the US book-to-market effect is statistically insignificant for most of the CEFA sectors. As an example, Global Equity has an insignificant t-statistic of 0.93. The t-statistic for the US size effect is statistically significant at the 1% significance level for Global Equity 2.81 and Growth Domestic sector 3.32. Finally, the market effect is highly statistically significant for all sectors. Equity Income, Global Equity and Growth and Income display significant t-statistics of 12.41, 8.01, and 12.20. However, the momentum variable is insignificant in all four sectors.

Guirguis six-factor model (I)

In this section we extend the four-factor model of Carhart (1997) by adding sentiment and expenses, as defined earlier. The purpose is to strengthen the four factor model and finally verify which factors are statistically significant and able to explain the fluctuations and persistence of the excess discount return over the longer term.

Equation (3) enables us to test the significance of the six factors in relation to excess discount return.

where: is the excess discount return for each sector at time t

is the excess market return

68

Page 69: Introduction to Econometrics 1

is the size factor (small minus big), i.e the difference between

the return on a portfolio of small stocks and the return on a portfolio of

large stocks.

is the book to market factor (high minus low), ie the

difference between the return on a portfolio of high book-to-market

stocks and the return on a portfolio of low book-to- market stocks.

(RNAVfd,t – RNAVse,t) is the momentum factor

(Flowt – Flowt -1) is the difference of retail flows by sector used as a

proxy for investor sentiment

(expt) is the expense factor.

is the distrurbance term

We test the significance of the estimated coefficients using t-statistics and F-tests. We also use the Durbin/Watson statistic to check for autocorrelation. The estimated results are shown in Table 6 for the UK and Table 7 for the US.

Table 6 Guirguis six -factor model of the UK excess discount return.

Equation (3) measures the sensitivity of the excess discount related to the market, size, book- to-market factor, momentum, sentiment and expenses. The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 90 funds for UK investment trusts. The coefficient and t-statistic of each factor is displayed in parenthesis. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3 Factor 4 Factor 5 Factor 6Fund

Name byAITC

category

D/W F/ratio Adj R2 Market Size Book to market

Momentum

(RNAVfd,t

– RNAVse,t)

Sentiment(Flowt - Flowt -1)

Expense(expt)

Global Growth

2.12 17.25 0.39 0.28 (3.98) **

0.13 (2.48) **

-0.01(-0.57)

-0.26(-1.25)

1.49 (3.20)**

0.59(1.38)

Global Growth &

Income

1.93 50.24 0.66 0.43 (10.49) **

0.12 (3.50) **

-0.14(-1.23)

0.33(0.66)

0.87 (3.54)**

-0.25(-1.06)

Global Smaller

Companies

1.96 42.83 0.62 0.40 (7.91)**

0.11 (3.16)**

-0.09(-0.68)

0.60(-1.96)*

0.82 (7.50)**

-0.04(-0.14)

UK Growth

2.04 60.65 0.70 0.58 (12.90)**

0.06 (2.11) *

-0.15(-1.41)

-2.76 (-6.89)**

0.88 (3.18)**

0.08(0.46)

69

Page 70: Introduction to Econometrics 1

UK Growth &

Income

1.97 57.98 0.69 0.45 (11.31)**

0.10 (3.22)**

-0.14(-1.31)

-2.43 (-2.54)**

0.89 (6.60)**

0.68(1.84)*

UK Smaller

Companies

1.99 45.06 0.63 0.40 (9.90)**

0.09 (2.83)**

-0.11(-0.90)

-0.22(-0.49)

1.02 (5.81)**

-0.04(-0.09)

North America

2.02 72.44 0.73 0.42 (9.42)**

0.13 (3.52)**

-0.14(-1.07)

0.02(0.16)

0.98 (13.41)**

0.03(0.06)

North America Smaller

Companies

1.97 70.10 0.73 0.43 (10.61)**

0.11 (3.82)**

-0.17(-1.42)

-1.02(-1.47)

0.97 (11.60)**

-0.11(-0.55)

Far East (Including

Japan)

2.05 65.31 0.71 0.45 (11.27)**

0.13 (4.18)**

-0.18(-1.42)

-1.44(-1.53)

0.68 (6.52)**

-0.49(-2.45)**

Far East (Excluding

Japan)

2.06 62.34 0.70 0.37 (4.78)**

0.12 (3.11)**

-0.15(-1.30)

-1.20 (-3.57)**

0.92 (7.97)**

-0.55(-1.73)*

Japanese Smaller

Companies

1.97 87.79 0.77 0.44 (10.13)**

0.10 (3.07)**

-0.15(-1.33)

0.15(1.05)

0.93(12.98)**

1.13 (2.29)**

Japan 1.94 92.07 0.78 0.46 (11.41)**

0.12 (3.98)**

-0.19 (-1.82)*

0.17(1.42)

0.94(14.58)**

-0.59(-2.71)*

Europe 2.02 49.04 0.65 0.49 (11.26)**

0.10 (3.12)**

-0.34 (-2.81)**

0.15(0.87)

0.32 (3.89)**

-0.34(-0.98)

Country Specialists – Far East

1.97 41.03 0.61 0.39(9.67)**

0.12 (3.33)**

-0.17(-1.23)

-0.27(-0.91)

0.78(8.89)**

-0.21(-0.64)

Sector Specialists – Property

1.97 95.97 0.79 0.43 (9.82)**

0.11 (3.66)**

-0.20 (-1.80)*

-0.15(-0.58)

1.02 (14.78)**

0.14(0.87)

European Smaller

Companies

1.96 53.41 0.61 0.39 (9.67)**

0.12(3.33)**

-0.17(-1.23)

-0.27(-0.91)

0.78 (8.89)**

-0.21(-0.64)

Average 2.00 0.67Source : author calculation

* Statistically significant at 5 % significance level

** Statistically significant at 1% significance level

Table 6 shows that, on average, the six-factor model can explain 67% of the variation in the excess discount return by taking into consideration market, size, book-to- market effect, momentum, sentiment and expenses. The coefficients for the UK are statistically significant for the market, the size effect and sentiment. Book-to-market effect, momentum and expenses show t-statistics that are insignificant or negatively significant. In more detail, momentum is negative and statistically significant, (at the 1% and 5% level in one tailed tests), in four out of the sixteen sectors. Thus, we have limited evidence of managerial performance persistence over the long-run. In addition, our results contradict the hypothesis proposed by Barclay, Holderness and Pontiff’s (1995) view that a possible explanation of the discount is that as managers try to protect their private benefits, expenses increase over the long term and have an effect on the discount. There is no autocorrelation in the times series of our 158 monthly observations from January 1990 to January 2003. On average the Durbin/Watson statistic is 2.00. Overall, all the sectors show a significant F-ratio. As

70

Page 71: Introduction to Econometrics 1

an example, Japan sector has an F-ratio of 92.07 which exceeds the tabular value of F =2.27 at the 5% significance level with degrees of freedom of 5 and 152.

To summarise the UK results, the book-to-market effect and expenses are statistically insignificant for most of the AITC sectors. For example, Global smaller companies, UK smaller companies and North America smaller companies have insignificant t-statistics for the book-to-market effect of -0.68, -0.90, and -1.42. In addition, the same sectors have insignificant t-statistics for expenses of -0.14, -0.09, and -0.55. The t-statistics for the UK size effect and sentiment are statistically significant at the 1% level for most of the sectors. For example, North America Smaller companies, Japanese Smaller companies, Japan and Country specialist Far East have significant t-statistics for the size effect of 3.82, 3.07, 3.98 and 3.33. The same sectors display a statistically significant sentiment effect of 11.60, 12.98, 14.58, and 8.89.

As in the other models, the market effect is positive and highly statistically significant for all sectors. UK Growth, Far East Including Japan, and European Smaller Companies display significant t-statistics of 12.90, 11.27 and 9.67.

Table 7 Guirguis six-factor model (I) of the US excess discount return.

The regressions by sector are estimated over the period January 1990 to January 2003 by using monthly observations. The sample covers 30 funds for US closed-end funds. The t-statistic of each factor is displayed in parentheses. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3 Factor 4 Factor 5 Factor 6Fund Name

byCEFA

category

D/W F/Ratio Adj R2

Market Size Book to market

Momentum(RNAVfd,t – RNAVse,t)

Sentiment(Flowt - Flowt -1)

Expense(expt)

Equity Income

1.98 43.14 0.62 0.40 (7.95)**

0.11 (3.26)**

-0.09(-0.69)

0.60(1.98)*

0.83 (7.53)**

-0.05(-0.20)

Global Equity

2.00 77.15 0.75 0.43 (9.70)**

0.11 (3.32)**

-0.17(-1.44)

-0.17(-0.61)

0.92 (8.80)**

0.18(0.91)

Growth and Income

2.00 56.53 0.68 0.44 (11.19)**

0.10 (3.38)**

-0.20 (-2.48)**

-2.60 (-3.04)**

0.81 (6.01)**

0.61(1.60)

Growth Domestic

2.02 46.28 0.64 0.40 (9.86)**

0.09 (2.77)**

-0.09(-0.72)

-0.22(-0.50)

1.02 (6.06)**

0.05(0.11)

Average 2.00 0.66Source : author calculation

* Statistically significant at 5% significance level

** Statistically significant at 1 % significance level

On average, for the US the six-factor model can explain 66 % of the variation in the excess discount return by taking into consideration market, size, book-to- market effect, momentum, sentiment and expenses. The coefficients for US are statistically

71

Page 72: Introduction to Econometrics 1

significant for the market and the size effect for all sectors. There is no autocorrelation as on average the Durbin/Watson statistic is 2.00. Overall, all the sectors show a significant F-ratio. As an example, Global Equity sector has an F-ratio of 77.15 which exceeds the tabulated value of 2.27 at the 5% significance level with degrees of freedom of 5 and 152. The momentum variable is significant in just two of the four sectors and the expenses variable is insignificant in all sectors. The sentiment factor is highly significant in all four sectors. In more detail, Equity Income, Global Equity, Growth and Income and Growth Domestic category have t-statistics of 7.53, 8.80, 6.01 and 6.06.

The US book-to-market effect is statistically insignificant for most of the CEFA sectors. As an example, Global Equity display an insignificant t-statistic of -1.44. The t-statistic for the US size effect is statistically significant at the 1% significance level for all sectors. Global Equity, Equity Income and Growth and Income have t-statistic of 3.32, 3.26 and 3.28. Similarily, the market effect is highly statistically significant for all sectors. Equity Income, Global Equity and Growth and Income have t-statistics of 7.95, 9.70, and 11.19.

Guirguis six-factor model (II)

In this section we extend the four-factor model of Carhart (1997) by changing the sentiment factor from retail flow to investor sentiment index. The only behavioural data available was for the US market. Investors are said to be confident when the news about the future is good and stock prices are relatively high. According to Yale Management School, the sample of US individual investors from 1989 to 1998 was purchased from W.S.Ponton Inc, a list of “High-Grade Multi-Investors.” Starting in 1999, the sample was a random sample of high-income Americans purchased from Survey Sampling, Inc. We use only US investor sentiment index constructed from Yale School of Management as it was the most suitable one to represent investor sentiment on the market. It is measured in percentage of respondents who support a particular view. It is derived from the responses to a single question that has been asked consistently through time since 1989 to a consistent sample of respondents. The wordings of the question focused on how pessimistic or optimistic are investor’s towards the stock market.

Equation (4) enables us to test the significance of these six factors in relation to excess discount return.

where: is the excess discount return of each sector at time t

is the excess market return

is the size factor (small minus big), i.e the difference between

72

Page 73: Introduction to Econometrics 1

the return on a portfolio of small stocks and the return on a portfolio of

large stocks.

(Rg,t – Rv,t) is the book to market factor (high minus low), i.e the

difference between the return on a portfolio of high-book-to-market

stocks and the return on a portfolio of low-book-to- market stocks

(RNAVfd,t – RNAVse,t) is the momentum factor

(ISIt ) is the difference in percentage of respondents as a proxy of

sentiment factor.

(expt) is the expense factor.

is the distrurbance term

We test the significance of the estimated coefficient using t-statistics and F-tests. We also use Durbin/Watson statistic to check for autocorrelation.

Table 8 Guirguis six factor model (II) of US excess discount return

The regressions by sector are estimated over the period October 1996 to January 2003 by using monthly observations. The sample covers 30 funds for US Closed-end funds. The t-statistic of each factor is displayed in parentheses. In addition, we provide tests of autocorrelation.

Factor 1 Factor 2 Factor 3 Factor 4 Factor 5 Factor 6Fund Name

byCEFA

category

D/W F/ratio Adj R2 Market Size Book to market

Momentum(RNAVfd,t – RNAVse,t)

Sentiment(ISIt )

Expense(expt)

Equity Income

1.97 2.83 0.32 0.06(0.66)

-0.11(-0.98)

-0.22(-0.56)

-0.49 (-8.35)**

0.63 (2.37)**

3.75(1.73)*

Global Equity 2.07 2.38 0.26 0.25 (2.28)*

-0.13(-0.99)

0.12(0.55)

-0.20(-2.08)*

0.03(0.39)

2.37(1.44)

Growth and 2.04 3.18 0.36 0.35 0.13 -0.21 -3.10 0.29 -1.19

73

Page 74: Introduction to Econometrics 1

Income (2.65)** (1.46) (-1.09) (-1.36) (1.93)* (-0.87)Growth Domestic

2.03 3.50 0.39 0.33 (3.52)**

-0.04(-0.43)

-0.19(-1.05)

-1.26(-1.63)

0.21(1.34)

1.11(0.52)

Average 2.03 0.33Source : author calculation

* Statistically significant at 5 % significance level

** Statistically significant at 1% significance level

Table 8 shows that, on average, the six-factor model can explain 0.33 % of the variation in the excess discount return by taking into consideration market effect, size, book-to- market, momentum, sentiment and expenses. The coefficients for US market are statistically significant for 3 out of four sectors at the 1% and 5% for one-tailed tests. On the other hand, there is no autocorrelation in the times series of our 26 monthly observations from October 1996 to January 2003. On average the Durbin/Watson statistic is 2.03. Overall, all the sectors show a significant F-ratio. As an example, Equity Income sector has an F-ratio of 2.83 which exceeds the tabulated value of F 2.27 at the 5% significance level with degrees of freedom of 5 and 152. The US size, book-to-market effect, momentum and expenses are statistically insignificant for most of the sectors of CEFA. On the other hand, the results for the sentiment factor are mixed. In more detail, Equity Income, Growth and Income show a statistically significant sentiment effect of 2.37 and 1.93. A possible explanation of this mixed picture is that the sample is small and surveys were initially conducted at six-month intervals from October 1996 to April 2001. Then surveys were conducted at a monthly basis. Our sample includes only 25 observations which reduce the explanatory power of the model. Thus, a possible area of future research is to include a larger sample measured on a daily basis and try to isolate the interaction of arbitrageurs and noise traders to identify shifts in the discounts of closed end funds.

In this section we attempted to extend the three-factor of Fama and French’s (1993) model in order to explain the existence and persistence of the excess discount return. We added two more factors to these four risk measures namely market, size, book-to-market, and momentum. The first one is based on the results of Lee, Shleifer and Thaler (1991) that closed-end funds are subject to systematic “investor sentiment risk”. We measure this sentiment in two ways as the changes in retail flows and investor sentiment index. This sentiment measure excludes the funds during their first six month of trading. The reason is to eliminate the influence of premiums that correspond to underwriting fees and start-up costs.

The expense ratio can be proxied monthly as percentage of total assets. These include the annual management charge (or staff costs), auditors and custody fees, directors fees and marketing costs.

On average the six factor model can explain 67% of the variation in the excess discount return in the UK market by taking into consideration market, size, book-to-market effect, momentum, sentiment and expenses. In contrast, Fama and French (1993) three factor and four factor model can explain 42% of the variation of the

74

Page 75: Introduction to Econometrics 1

excess discount return. The coefficients of the six-factor model for UK are statistically significant for market, size effect and sentiment and statistically insignificant for book-to-market, momentum, and expenses. On the other hand, there is no autocorrelation in the times series. On average the Durbin/Watson statistic is 1.98. The six-factor model is very robust with emphasis on sentiment namely the interaction of arbitrageurs and noise traders.

On the other hand, on average the six-factor model can explain 66 % of the variation of the excess discount return in the US market by taking into consideration market effect, size, book-to-market, momentum, sentiment and expenses. In contrast, Fama and French (1993), three-factor model can explain 59% of the excess discount return variation and Carhart’s (1997) four factor model can explain 65% of the variation.

The other version of my six factor model applied in the US market can explain 33 % of the variation of the excess discount returns by changing the sentiment factor from retail flow to investor sentiment index. The only behavioural data available was for the US market from the Yale management school. The index was measured in percentage of respondents who support a particular view. Our sample includes only 25 observations which decrease the explanatory power of the model. Thus, a possible area of future research is to include a larger sample measured on a daily basis and try to isolate the interaction of arbitrageurs and noise traders to identify shifts in the discounts of closed end funds.

75

Page 76: Introduction to Econometrics 1

Heteroskedasticity

One of the assumptions of the simple and the multiple regression is that the variance of the error term, , is constant and common. This is known as the homoskedasticity assumption.

E( )2 = σ2

This assumption ensures that each observation is equally reliable, so that estimates of the regression coefficients and tests of hypotheses are not biased. If the errors do not have a common variance, we say that they are heteroskedastic. This leads to biased estimates of the standard errors, higher variance of the estimators and thus incorrect statistical tests and confidence intervals. To overcome this problem we use EViews 6 heteroskedastic tests. We will analyze various tests based mainly on the ordinary least squares, (OLS) residuals. We will also discuss the application of heteroskedasticity on the residuals of the likelihood ratio test. The presence of heteroskedasticity shows that the standard errors and the t-statistics are biased. To overcome the problem we transform the variables to natural logarithmic linear form, ln. Another alternative to eliminate the heteroskedasticity problem and get efficient estimates of the coefficients is to deflate the variables by using the reciprocal of the square root or the original value of one of the independent variable. If you have more than one independent variables that show heteroskedastic problem, then, use the square root of the mean value of the dependent variable as a transformed value. You could also use the ratio from a new variable. The problem that could arise is spurious correlation between the new variable related to the dependent and independent variables. The above methods are applied when the variance of the model is unknown. When the error variance is known, then, the transformed regression is obtained by dividing each dependent and independent variable by its own standard deviation. In EViews 6 you select the weighted least squares option because each variable is weighted or divided by the relevant dataset standard deviation.

I will give you an example of using square root reciprocal as deflators. This method is based on the square root transformation. Please consider the following ordinary least square,OLS, equation. After conducting a scatterplot, we have found that the independent variable x1 shows heteroskedastic patterns with the error term. We divide each part of the regression with the square root of x1, namely, .

76

Page 77: Introduction to Econometrics 1

Then, run the regression in Excel and obtain the estimated ordinary least squares, OLS residuals . Regress the estimated squared residuals on the independent variable x1 and the squared independent variable each time separately and choose the regression model with the highest R2.

I have included a graph to show the homoskedastic effect. There is constant or equal variance along the line between the dependent and independent variable. There are no substantial deviations.

y

* ********* * * * * * * * * ****** * * * * * 0 x

I have also included a graph to show the heteroskedastic effect. The variance is not constant and is usually revealed in cross – sectional data. It decreases and then increases along the line of the explanatory variable x.Thus, we check the squared residual values in relation to the independent or explanatory variables. To overcome the problem, we use ln linear form to regress the dependent variable in relation to the independent and check again the residual or error term.

y * * * * * * * * * * * * * * * * * * * * * * * * 0

77

Page 78: Introduction to Econometrics 1

x

To check for heteroskedasticity plot the squared residuals across each observations and against each independent variable.

For example, let’s consider the following dataset and the squared residuals plot. Then, plot in a line graph the squared residuals along the independent variable. In our case, it is income and it is expressed in thousands.

ObservationResiduals squared Incomes

1 44602.429 10312 42083.497 12143 36.062242 14144 5379.947 16485 23000.459 18216 66332.716 20147 84.647914 21458 4059.1177 22349 1745.0951 2245

10 16.634201 229811 103264.69 234512 25512.72 241113 4.1959599 246714 103464.45 251815 23159.766 283916 9754.9813 311517 6983.4371 323118 62439.153 335119 42570.8 356120 135202.23 380021 108.91533 412522 6519.9155 433123 56651.487 437824 4525.1809 624525 71088.105 8100

78

Page 79: Introduction to Econometrics 1

Squared residuals plot

020000400006000080000

100000120000140000160000

Observations

Squa

red

resi

dual

s

I have plotted the squared residuals against the independent variable incomes. Please check the variability and determine if it is constant or unequal.

Squared residuals plot

0

20000

40000

60000

80000

100000

120000

140000

160000

1031

1414

1821

2145

2245

2345

2467

2839

3231

3561

4125

4378

8100

Incomes

Squa

red

resi

dual

s

Series1Incomes

79

Page 80: Introduction to Econometrics 1

The most common heteroskedastic tests that are available in EViews 6 are the Breusch - Pagan – Godfrey test, the Harvey test, the Glejser test, the ARCH and White test. The Breusch - Pagan – Godfrey test regress the squared residuals on the original regressors. The Harvey test regresses the ln of the squared residuals on the original regressors. The Glejser test regresses the absolute residuals on the original regressors. The ARCH test regresses the squared residuals on lagged squared residuals and a constant. The White test regresses the squared residuals on the cross product of the original regressors and a constant.

First of all, you need to collect your data and define the dependent and independent variable, then transfer your data from Excel to EViews 6. Then, estimate the regression equation by selecting the method of least squares. The regression equation in EViews 6 will be as follows:

share c market

Press OK and you will get the output of the regression equation. Then go to the upper left and press view, then, select residual tests and then heteroskedasticity tests. Then, you have the option to select the following heteroskedasticy tests:

The Breusch - Pagan – Godfrey test The Harvey test The Glejser test The ARCH The White test

To sum up, the heteroskedasticity tests are related to test the residual or error term that has a constant variance. In addition to the test outptut, EViews 6 provide a residual graph. The most common heteroskedasticity tests are the ARCH and the White tests.

Let’s try together an example to facilitate your learning experience. I have attached the data that I will use in this example.

Share returns (y) Market returns (x)

3.526787 8.73209-4.34533 -5.198155.222709 6.21865-4.99619 -5.5393-3.04336 7.69808-2.375422 -4.997352.651303 5.42777-0.68924 -1.54240.205664 1.46392.4783 3.6528

0.237407 -0.1494

80

Page 81: Introduction to Econometrics 1

0.329728 0.16688-0.26869 -0.14440.064769 0.097873-0.5873 -0.09911

0.329225 -0.08344-0.11849 0.1227670.011541 -0.45767-0.18757 -0.53046-0.38752 -0.11118-0.26835 -0.289470.262798 -0.176760.355054 -1.15686-1.34302 -0.5771-0.77964 0.578182-0.04649 -0.053310.098381 -0.23054-0.09585 -0.66625-0.0059 -0.50071-0.05415 -0.53128

This is the regression output that you will get in EViews 6.

Dependent Variable: SHAREMethod: Least SquaresDate: 04/16/16 Time: 10:16Sample: 1 30Included observations: 30

Variable Coefficient Std. Error t-Statistic Prob.  

C -0.284046 0.271224 -1.047275 0.3039MARKET 0.422744 0.084654 4.993807 0.0000

R-squared 0.471080    Mean dependent var -0.127295Adjusted R-squared 0.452191    S.D. dependent var 1.993636S.E. of regression 1.475573    Akaike info criterion 3.680310Sum squared resid 60.96484    Schwarz criterion 3.773723Log likelihood -53.20465    Hannan-Quinn criter. 3.710194F-statistic 24.93811    Durbin-Watson stat 1.868822Prob(F-statistic) 0.000028

Then, press view. Select the residual tests and then the heteroskedasticity tests. In this case, I choose the Breusch - Pagan – Godfrey test. It regress the squared residuals on

81

Page 82: Introduction to Econometrics 1

the original regressors. At the bottom of the test, you can see the regressors. In our case, we have one regressor and it is the market. Thus, you estimate the error term. You square it and finally you regress it in relation to the explanatory variable, which in our case is the market. I have attached the relevant dataset to facilitate your learning experience. You will find a datset series in EViews 6 denoted as resid and it means the residual or the error term. This time series is very important to test heteroskedasticity and regresses it with the explanatory variable and a constant according to the relevant test.

Error termError term squared (

) Market returns (x)0.119395394 0.01425526 8.73209

-1.863797548 3.473741301 -5.198152.877858835 8.282071477 6.21865

-2.370438465 5.618978517 -5.5393-6.01363018 36.16374794 7.698080.021223476 0.000450436 -4.997350.640792535 0.410615073 5.427770.24684642 0.060933155 -1.5424

-0.129144582 0.016678323 1.46391.218147284 1.483882807 3.65280.584611161 0.341770209 -0.14940.543226718 0.295095267 0.166880.076400441 0.005837027 -0.14440.307440007 0.094519358 0.097873-0.26135563 0.068306766 -0.099110.648544973 0.420610581 -0.083440.11365722 0.012917964 0.122767

0.489064424 0.239184011 -0.457670.320724953 0.102864496 -0.53046

-0.056473111 0.003189212 -0.111180.138067899 0.019062745 -0.289470.621568434 0.386347318 -0.176761.128155735 1.272735363 -1.15686

-0.815008271 0.664238482 -0.5771-0.740016695 0.547624709 0.5781820.260092699 0.067648212 -0.053310.479886601 0.23029115 -0.230540.469849348 0.22075841 -0.666250.489818322 0.239921988 -0.500710.454491603 0.206562617 -0.53128

82

Page 83: Introduction to Econometrics 1

If you regress the squared error term as the dependent variable in relation to the market return as the independent variable, then, you will get the following output.

Heteroskedasticity Test: Breusch-Pagan-Godfrey

F-statistic 5.965804    Prob. F(1,28) 0.0212Obs*R-squared 5.269244    Prob. Chi-Square(1) 0.0217Scaled explained SS 24.11401    Prob. Chi-Square(1) 0.0000

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 04/16/16 Time: 10:17Sample: 1 30Included observations: 30

Variable Coefficient Std. Error t-Statistic Prob.  

C 1.710507 1.137900 1.503213 0.1440MARKET 0.867473 0.355158 2.442499 0.0212

R-squared 0.175641    Mean dependent var 2.032161Adjusted R-squared 0.146200    S.D. dependent var 6.699751S.E. of regression 6.190656    Akaike info criterion 6.548300Sum squared resid 1073.078    Schwarz criterion 6.641713Log likelihood -96.22450    Hannan-Quinn criter. 6.578183F-statistic 5.965804    Durbin-Watson stat 1.347827Prob(F-statistic) 0.021154

Please state the hypotheses:

H0: The error term is homosedasticH1: The error term is heteroskedastic

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are significant. We reject the null hypothesis of homoskedasticity. Heteroskedasticity effect is present in the price change returns expressed as percentages. Specifically, the F-statistic is 5.9658 with a significant p-value of 0.02 and the probability Chi-square value is also significant with a numerical value of 0.02 and a Langrange LM statistic of 5.269. Please note the number 1 close to Prob. Chi-square(1), which mens that we have one explanatory variable.

Please feel free to e-mail me if you have more questions. Thanks for your participation and I wish you all the best in your future career in the Financial Services sector.

83

Page 84: Introduction to Econometrics 1

The Harvey test regresses the natural logarithmic ln of the squared residuals on the original regressors.I have attached the relevant dataset based on the above example. Thus, you regress the ln squared error as the dependent variable in relation to the market returns as the independent variable.

Error term Error term squared ln squared error ( ) Market returns (x)0.119395394 0.01425526 -4.250629302 8.73209

-1.863797548 3.473741301 1.245232198 -5.198152.877858835 8.282071477 2.114093115 6.21865

-2.370438465 5.618978517 1.726149889 -5.5393-6.01363018 36.16374794 3.588057179 7.698080.021223476 0.000450436 -7.705294746 -4.997350.640792535 0.410615073 -0.890099065 5.427770.24684642 0.060933155 -2.797977831 -1.5424

-0.129144582 0.016678323 -4.093645429 1.46391.218147284 1.483882807 0.39466217 3.65280.584611161 0.341770209 -1.073616671 -0.14940.543226718 0.295095267 -1.220457034 0.166880.076400441 0.005837027 -5.14353362 -0.14440.307440007 0.094519358 -2.35895062 0.097873-0.26135563 0.068306766 -2.683746461 -0.099110.648544973 0.420610581 -0.866047858 -0.083440.11365722 0.012917964 -4.349136402 0.122767

0.489064424 0.239184011 -1.430522103 -0.457670.320724953 0.102864496 -2.274342734 -0.53046

-0.056473111 0.003189212 -5.747981315 -0.111180.138067899 0.019062745 -3.960019381 -0.289470.621568434 0.386347318 -0.951018527 -0.176761.128155735 1.272735363 0.241168413 -1.15686

-0.815008271 0.664238482 -0.409114034 -0.5771-0.740016695 0.547624709 -0.602165063 0.5781820.260092699 0.067648212 -2.693434356 -0.053310.479886601 0.23029115 -1.468410902 -0.230540.469849348 0.22075841 -1.510686344 -0.666250.489818322 0.239921988 -1.427441457 -0.500710.454491603 0.206562617 -1.577151681 -0.53128

I have attached the output that you will get in EViews 6.

84

Page 85: Introduction to Econometrics 1

Heteroskedasticity Test: Harvey

F-statistic 1.264374    Prob. F(1,28) 0.2704Obs*R-squared 1.296157    Prob. Chi-Square(1) 0.2549Scaled explained SS 1.482117    Prob. Chi-Square(1) 0.2234

Test Equation:Dependent Variable: LRESID2Method: Least SquaresDate: 04/17/16 Time: 10:41Sample: 1 30Included observations: 30

Variable Coefficient Std. Error t-Statistic Prob.  

C -1.796732 0.442085 -4.064223 0.0004MARKET 0.155153 0.137982 1.124444 0.2704

R-squared 0.043205    Mean dependent var -1.739202Adjusted R-squared 0.009034    S.D. dependent var 2.416067S.E. of regression 2.405129    Akaike info criterion 4.657424Sum squared resid 161.9701    Schwarz criterion 4.750838Log likelihood -67.86137    Hannan-Quinn criter. 4.687308F-statistic 1.264374    Durbin-Watson stat 1.717438Prob(F-statistic) 0.270372

Please state the hypotheses:

H0: The error term is homosedasticH1: The error term is heteroskedastic

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are not significant. We can not reject the null hypothesis of homoskedasticity. Heteroskedasticity effect is not present in the price change returns expressed as percentages. Specifically, the F-statistic is 1.26 with a unsignificant p-value of 0.27 and the probability Chi-square value is also not significant with a numerical value of 0.25 and a Langrange LM statistic of 1.296.

Please feel free to e-mail me if you have more questions. Thanks for your participation.

The Glejser test regresses the absolute residuals on the original regressors. I have attached the relevant dataset based on the above example. Thus, you regress the absolute error term as the dependent variable in relation to the market returns as the independent variable. In Excel, you use the function = abs() to get the absolute value.

Error termAbsolute error term (

) Market returns (x)0.119395394 0.119395394 8.73209

-1.863797548 1.863797548 -5.19815

85

Page 86: Introduction to Econometrics 1

2.877858835 2.877858835 6.21865-2.370438465 2.370438465 -5.5393

-6.01363018 6.01363018 7.698080.021223476 0.021223476 -4.997350.640792535 0.640792535 5.427770.24684642 0.24684642 -1.5424

-0.129144582 0.129144582 1.46391.218147284 1.218147284 3.65280.584611161 0.584611161 -0.14940.543226718 0.543226718 0.166880.076400441 0.076400441 -0.14440.307440007 0.307440007 0.097873-0.26135563 0.26135563 -0.099110.648544973 0.648544973 -0.083440.11365722 0.11365722 0.122767

0.489064424 0.489064424 -0.457670.320724953 0.320724953 -0.53046

-0.056473111 0.056473111 -0.111180.138067899 0.138067899 -0.289470.621568434 0.621568434 -0.176761.128155735 1.128155735 -1.15686

-0.815008271 0.815008271 -0.5771-0.740016695 0.740016695 0.5781820.260092699 0.260092699 -0.053310.479886601 0.479886601 -0.230540.469849348 0.469849348 -0.666250.489818322 0.489818322 -0.500710.454491603 0.454491603 -0.53128

I have attached the output that you will get in EViews 6.

Heteroskedasticity Test: Glejser

F-statistic 3.696155    Prob. F(1,28) 0.0648Obs*R-squared 3.498363    Prob. Chi-Square(1) 0.0614Scaled explained SS 6.036546    Prob. Chi-Square(1) 0.0140

Test Equation:Dependent Variable: ARESIDMethod: Least Squares

86

Page 87: Introduction to Econometrics 1

Date: 04/17/16 Time: 10:42Sample: 1 30Included observations: 30

Variable Coefficient Std. Error t-Statistic Prob.  

C 0.770168 0.208943 3.686019 0.0010MARKET 0.125378 0.065215 1.922539 0.0648

R-squared 0.116612    Mean dependent var 0.816658Adjusted R-squared 0.085063    S.D. dependent var 1.188406S.E. of regression 1.136738    Akaike info criterion 3.158544Sum squared resid 36.18087    Schwarz criterion 3.251957Log likelihood -45.37815    Hannan-Quinn criter. 3.188427F-statistic 3.696155    Durbin-Watson stat 1.113795Prob(F-statistic) 0.064765

Please state the hypotheses:

H0: The error term is homosedasticH1: The error term is heteroskedastic

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are not significant. We can not reject the null hypothesis of homoskedasticity. Specifically, the F-statistic is 3.696 with a unsignificant p-value of 0.06 and the probability Chi-square value is also not significant with a numerical value of 0.06.

Please feel free to e-mail me if you have more questions. Thanks for your participation.

The ARCH test regresses the squared residuals on lagged squared residuals and a constant. I have attached the output that you will get in Eviews 6.

Heteroskedasticity Test: ARCH

F-statistic 0.383134    Prob. F(1,27) 0.5411Obs*R-squared 0.405756    Prob. Chi-Square(1) 0.5241

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 04/17/16 Time: 10:43Sample (adjusted): 2 30Included observations: 29 after adjustments

Variable Coefficient Std. Error t-Statistic Prob.  

87

Page 88: Introduction to Econometrics 1

C 1.853995 1.339443 1.384153 0.1776RESID^2(-1) 0.118251 0.191043 0.618978 0.5411

R-squared 0.013992    Mean dependent var 2.101744Adjusted R-squared -0.022527    S.D. dependent var 6.807298S.E. of regression 6.883546    Akaike info criterion 6.762617Sum squared resid 1279.347    Schwarz criterion 6.856913Log likelihood -96.05795    Hannan-Quinn criter. 6.792149F-statistic 0.383134    Durbin-Watson stat 2.026327Prob(F-statistic) 0.541119

Please state the hypotheses:

H0: The error term is homosedasticH1: The error term is heteroskedastic

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are not significant. We can not reject the null hypothesis of homoskedasticity. Heteroskedasticity effect is not present in the price change returns expressed as percentages .

Please feel free to e-mail me if you have more questions. Thanks for the participation.

The White test regresses the squared residuals on the cross product of the original regressors and a constant. First of all, I have attached the simple regression equation

share c market by selecting the following option box:

The White test is very common as it provides the researcher with corrected standard errors and t-statistics. It corrects the heteroskedasticity problem and provide us with efficient and unbiased estimators.

White Heteroskedasticity-Consistent Standard Errors & Covariance

Dependent Variable: SHAREMethod: Least SquaresDate: 04/17/16 Time: 10:43Sample: 1 30Included observations: 30White Heteroskedasticity-Consistent Standard Errors &

88

Page 89: Introduction to Econometrics 1

Covariance

Variable Coefficient Std. Error t-Statistic Prob.  

C -0.284046 0.231710 -1.225872 0.2305MARKET 0.422744 0.172422 2.451802 0.0207

R-squared 0.471080    Mean dependent var -0.127295Adjusted R-squared 0.452191    S.D. dependent var 1.993636S.E. of regression 1.475573    Akaike info criterion 3.680310Sum squared resid 60.96484    Schwarz criterion 3.773723Log likelihood -53.20465    Hannan-Quinn criter. 3.710194F-statistic 24.93811    Durbin-Watson stat 1.868822Prob(F-statistic) 0.000028

I have attached the relevant dataset based on the above example. Thus, you regress the squared residual or squared error term as the dependent variable in relation to the market returns and market returns squared as the independent variables. If you have two independent variables, then, you run the auxiliary regression of the squared residuals on the original values, their squared values, and their cross – products. For example, consider the following multiple regression with two independent variables.

The regression of the squared residuals is as follows:

In our case, we are dealing with a simple regression equation with one independent variable:

Where : the dependent variable is the error term squared.

Error term squared () Market returns (x1)

Market retunrs squared ( )

0.01425526 8.73209 76.249395773.473741301 -5.19815 27.020763428.282071477 6.21865 38.671607825.618978517 -5.5393 30.6838444936.16374794 7.69808 59.260435690.000450436 -4.99735 24.973507020.410615073 5.42777 29.460687170.060933155 -1.5424 2.378997760.016678323 1.4639 2.143003211.483882807 3.6528 13.342947840.341770209 -0.1494 0.022320360.295095267 0.16688 0.027848934

89

Page 90: Introduction to Econometrics 1

0.005837027 -0.1444 0.020851360.094519358 0.097873 0.0095791240.068306766 -0.09911 0.0098227920.420610581 -0.08344 0.0069622340.012917964 0.122767 0.0150717360.239184011 -0.45767 0.2094618290.102864496 -0.53046 0.2813878120.003189212 -0.11118 0.0123609920.019062745 -0.28947 0.0837928810.386347318 -0.17676 0.0312440981.272735363 -1.15686 1.338325060.664238482 -0.5771 0.333044410.547624709 0.578182 0.3342944250.067648212 -0.05331 0.0028419560.23029115 -0.23054 0.0531486920.22075841 -0.66625 0.443889063

0.239921988 -0.50071 0.2507105040.206562617 -0.53128 0.282258438

I have attached the output that you will get in Eviews 6.

Heteroskedasticity Test: White

F-statistic 6.802671    Prob. F(2,27) 0.0041Obs*R-squared 10.05189    Prob. Chi-Square(2) 0.0066Scaled explained SS 46.00115    Prob. Chi-Square(2) 0.0000

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 04/17/16 Time: 10:44Sample: 1 30Included observations: 30White Heteroskedasticity-Consistent Standard Errors & Covariance

Variable Coefficient Std. Error t-Statistic Prob.  

C 0.219509 0.428405 0.512386 0.6125MARKET 0.313314 0.412730 0.759124 0.4544

MARKET^2 0.165266 0.116086 1.423648 0.1660

R-squared 0.335063    Mean dependent var 2.032161Adjusted R-squared 0.285808    S.D. dependent var 6.699751S.E. of regression 5.661950    Akaike info criterion 6.400053Sum squared resid 865.5574    Schwarz criterion 6.540173Log likelihood -93.00080    Hannan-Quinn criter. 6.444879F-statistic 6.802671    Durbin-Watson stat 1.728352Prob(F-statistic) 0.004051

90

Page 91: Introduction to Econometrics 1

Please state the hypotheses:

H0: The error term is homosedasticH1: The error term is heteroskedastic

The R2 value times the number of observations follows the x2 distribution with degrees of freedom equal to the number of the independent variables excluding the intercept. If the p-value of the calculated chi-square is below the 1% or 5% significance level, then, reject the null hypothesis of homoskedasticity or the existence of no heteroskedasticity. In contrast, if the p – value of the chi-square value is above the 5% significance level, then, do not reject the null hypothesis.

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are significant. We reject the null hypothesis of homoskedasticity. Heteroskedasticity effect is present in the price change returns expressed as percentages. Specifically, the F-statistic is 6.80 with a significant p-value of 0.004 and the probability Chi-square value is also significant with a numerical p-value of 0.0066 .

Please feel free to e-mail me if you have more questions. Thanks for your participation.

I have also attached the graph that you will get in terms of the residual, actual and fitted once you perform the White test. You can see from the graph the variations and deviations between the actual and fitted numerical values.

91

Page 92: Introduction to Econometrics 1

-8

-6

-4

-2

0

2

4-6

-4

-2

0

2

4

6

5 10 15 20 25 30

Residual Actual Fitted

Exercise

You are given a datset that is related to household expenditures in relation to their disposable income. The dependent variable is household expenditures and the independent variable is disposable income. The dataset comprises of 25 U.K households. Please compute the regression equation and select from Excel to include the residual table. Then, check the residual values in relation to the explanatory variable for heteroskedasticity effect. If there are such effect, then, transform the values by using the natural logarithmic ln linear form and check again if heteroskedasticity persists. The effected will be revealed It as increases and decreases in the residual values are associated with increases in the independent numerical values.

Please find attached the monthly dataset expressed in thousands pounds:

Observations Expenditures (y)

Incomes (x)

1 1980 22342 3130 32313 3172 35614 1223 12145 4072 43316 6145 6245

92

Page 93: Introduction to Econometrics 1

7 3962 43788 2651 25189 1045 1031

10 3956 412511 8210 810012 3417 335113 3251 380014 1213 141415 1381 164816 2062 241117 2831 311518 2500 283919 1780 182120 2080 201421 1945 214522 2013 224523 1834 234524 2104 229825 2276 2467

Solution

Please insert your data in Excel and run the linear regression by using the data analysis toolbox and selecting residual output.

The output in Excel will be as follows:

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.993172327R Square 0.986391271Adjusted R Square 0.985799587Standard Error 190.9462289Observations 25

ANOVA  df SS MS F Significance F

Regression 1 60782933 60782933 1667.092 5.7279E-23Residual 23 838590.6 36460.46Total 24 61621523      

93

Page 94: Introduction to Econometrics 1

  CoefficientsStandard

Error t Stat P-value Lower 95% Upper 95%Lower 95.0%

Intercept -203.1098322 83.0775 -2.44482 0.02257 -374.9684948 -31.25116954 -374.96849Incomes 1.00573905 0.024632 40.83003 5.73E-23 0.954783253 1.056694847 0.9547833

The regression equation is as follows:

R2 = 0.986

t-stat (-2.44) (40.83) RSS = 838590.6

Once you get the residual output, copy and past the numerical values of the independent variable incomes adjacent to the residuals. Then, use the chart wizard in Excel to plot your data in a line or scatter format. There is another way to do it. You can select the residual plot in the regression function in addition to residual output. This will help you to see the pattern of variation and deviations from the regression line.

RESIDUAL OUTPUT

ObservationPredicted

Expenditures ResidualsIncomes (x)

9 833.8071284 211.1929 10314 1017.857375 205.1426 1214

14 1219.005185 -6.00518 141415 1454.348122 -73.3481 164819 1628.340978 151.659 182120 1822.448615 257.5514 201421 1954.20043 -9.20043 2145

1 2043.711206 -63.7112 223422 2054.774335 -41.7743 224524 2108.078505 -4.0785 229823 2155.34824 -321.348 234516 2221.727017 -159.727 2411

94

Page 95: Introduction to Econometrics 1

25 2278.048404 -2.0484 24678 2329.341096 321.6589 2518

18 2652.183331 -152.183 283917 2929.767309 -98.7673 3115

2 3046.433038 83.56696 323112 3167.121724 249.8783 3351

3 3378.326925 -206.327 356113 3618.698558 -367.699 380010 3945.563749 10.43625 4125

5 4152.745993 -80.746 43317 4200.015729 -238.016 43786 6077.730535 67.26946 6245

11 7943.376473 266.6235 8100

I have attached the layout of the chart that you will get in Excel. In EViews 6 , you get automatically the chart of the residual output with the variations.

Residual plot

-500-400-300-200-100

0100200300400

0 5 10 15 20 25 30

Incomes

Resi

dual

s

As you can see from the chart there is strong evidence of heteroscedasticity. The residual or error term is not constant. To solve this problem, we transform the original data in ln linear format as follows:

ln Expenditures ln Incomes7.590852124 7.7115498.048788284 8.0805478.062117583 8.1777977.109062136 7.1016768.311889558 8.3735548.723394022 8.739536

95

Page 96: Introduction to Econometrics 1

8.284504227 8.3843477.882692206 7.831226.951772164 6.9382848.282988693 8.3248219.013108202 8.9996198.136518252 8.117014

8.08671792 8.2427567.100851909 7.2541787.230563153 7.4073187.631431665 7.7877977.948385285 8.0439847.824046011 7.9512077.484368643 7.5071417.640123173 7.6078787.573017256 7.6708957.607381426 7.7164617.514254653 7.7600417.651595574 7.7397947.730174795 7.810758

Then, we do the same thing as we did above. We run again the regression equation by using ln numbers and the residual output and plot.

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.988294R Square 0.976724Adjusted R Square 0.975712Standard Error 0.07666Observations 25

ANOVA

  df SS MS FSignificance

FRegression 1 5.672052 5.672052459 965.1566 2.76E-20Residual 23 0.135167 0.005876821Total 24 5.807219      

  CoefficientsStandard

Error t Stat P-value Lower 95%Upper 95%

Lower 95.0%

Intercept -0.19467 0.258333-

0.753551944 0.458763 -0.72907 0.339735 -0.72907ln Incomes 1.015243 0.032679 31.06697001 2.76E-20 0.947641 1.082845 0.947641

The regression equation is as follows:

R2 = 0.9767

96

Page 97: Introduction to Econometrics 1

t-stat (0.75) (31.07) RSS = 0.135

Could you please see the differences with the previous regression and the following chart. Everything is different in terms of R2, residual sum of squares, coefficient, intercept, standard errors and t-statistics. So, please be very careful with the heteroskedastic problem as you will get biased estimators that will distrort the interpretation of the results.

RESIDUAL OUTPUT

ObservationPredicted ln

Expenditures Residuals ln Incomes1 7.634427 -0.04358 6.9382844842 8.00905 0.039738 7.1016759723 8.107782 -0.04566 7.2541778464 7.015258 0.093804 7.407317715 8.306523 0.005366 7.507141086 8.678084 0.04531 7.6078780737 8.317481 -0.03298 7.6708948318 7.755923 0.126769 7.711548989 6.849376 0.102396 7.7164608

10 8.257048 0.025941 7.73979445811 8.942132 0.070976 7.76004068112 8.046073 0.090445 7.78779687813 8.173732 -0.08701 7.81075811714 7.170085 -0.06923 7.83122021515 7.325559 -0.095 7.95120715616 7.711838 -0.08041 8.04398443117 7.97193 -0.02354 8.08054696618 7.877739 -0.05369 8.11701408819 7.426904 0.057465 8.17779668320 7.529176 0.110947 8.24275634621 7.593154 -0.02014 8.32482129922 7.639414 -0.03203 8.373553741

97

Page 98: Introduction to Econometrics 1

23 7.683658 -0.1694 8.38434727824 7.663103 -0.01151 8.73953642325 7.735149 -0.00497 8.999619341

According to the following chart, you can see that the residual or error term in relation to the independent variable becomes homoskedastic. Thus, transforming the data to ln has helped us to solve the problem of heteroskedasticity.

Residual plot

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

0.15

0 2 4 6 8 10

Ln Incomes

Resi

dual

s

98

Page 99: Introduction to Econometrics 1

The likelihood ratio test and analysis of heteroskedasticity on the residuals

Please find attached the following dataset of n = 30. The dependent variable is share returns and the independent variables are the risk free rate, the size, the market returns, the age, the hurdle rate and the management performance. Run the logit regression equation in EViews 6 and then check for heteroskedasticity by running the regression equation of the squared residuals in relation to the independent variable, the squared independent variable and their cross products. The likelihood ratio test is applicable for large sample test.

Solution

Dependent Variable: share returnsMethod: ML - Binary Logit Date: 04/22/16 Time: 17:05Sample: 2 73Included observations: 72Convergence achieved after 7 iterationsCovariance matrix computed using second derivatives

Variable Coefficient Std. Error z-Statistic Prob.  

C -0.397262 3.719469 -0.106806 0.9149Rf 0.607784 1.340246 0.453487 0.6502

Size -0.195838 0.633367 -0.309202 0.7572Mr -2.947545 1.216692 -2.422590 0.0154

Age -0.038684 0.133300 -0.290200 0.7717Hr 0.089281 0.161129 0.554100 0.5795Mp -0.048030 0.046253 -1.038402 0.2991

McFadden R-squared 0.424735    Mean dependent var 0.069444S.D. dependent var 0.255992    S.E. of regression 0.218670

99

Page 100: Introduction to Econometrics 1

Akaike info criterion 0.484607    Sum squared resid 3.108085Schwarz criterion 0.705950    Log likelihood -10.44587Hannan-Quinn criter. 0.572725    Restr. log likelihood -18.15837LR statistic 15.42500    Avg. log likelihood -0.145081Prob(LR statistic) 0.017197

Obs with Dep=0 67     Total obs 72Obs with Dep=1 5

I have attached the table with the relevant data. Based on the following data check for White tests heteroskedasticity. You will regress the squared residuals on all the explanatory variables, their squares and their cross products. Therefore, you will regress the following.

will be regressed on x1 , x2, x3, x4, x5, x6, and so on

To sum up, the dependent variable is the squared residuals and the independent variables are x1, x2, x3, x4, x5, x6, their squared values and their cross products.

If you have difficulty please e-mail me.

Residuals Risk free (x1)

Size (x2) Market return (x3)

Age (x4) Hurdle rate (x5)

Mgt performance (x6)

-0.02516 2.755836 0.823441 5.397865001 40 10.34 1-0.00662 2.725726 4.458782 6.857003219 25 5.677 1.5-0.02722 3.092531 4.232887 7.989673776 86 10.35 0.67-0.03513 3.089416 2.137064 3.379482658 36 2.345 1-0.00133 2.802173 1.658494 -0.45807375 50 3.13 1.75-0.01652 2.409178 0.20628 7.497715958 13 4.31 1.75-0.00979 2.220845 0.028879 -0.92491506 14 10.45 1.5-0.00098 2.522834 0.774512 -8.40360195 57 0 1.5-0.00015 2.669247 0.259913 1.42834889 72 2.144 2.25-0.00063 2.969892 0.325758 14.91365445 71 5.678 2.45-0.00521 1.994892 0.008251 7.948571731 3 8.99 2.31

100

Page 101: Introduction to Econometrics 1

-0.00434 1.861559 0.008251 6.81028783 3 2.76 2.31-0.03533 2.469416 0.929169 5.783098584 10 15 1.67-0.02719 2.503892 0.633557 -0.67634733 2 1.23 1.23-0.02316 2.265607 0.711829 6.876247514 10 1.34 1.6-0.19276 2.606003 0.008251 6.928020847 6 2.34 1.3-0.00763 2.639092 2.19223 -2.23667513 48 3.456 1.670.914445 2.572749 1.91165 8.918700935 17 4.311 1.32-0.36925 2.540823 0.008251 -1.6115003 32 2.67 0.32

-0.1796 2.259086 0.008251 2.446540443 27 3.21 0.480.529751 2.608419 0.148522 -1.40503816 24 7.89 0.130.318986 2.558915 0.191428 7.779562137 22 2.134 0.450.319484 3.042134 0.145221 5.438933357 24 1.34 0.67-0.01753 2.168168 0.167979 6.333734893 18 5 1.50.919461 3.415726 0.609765 -0.93701863 12 10 1.23-0.00194 2.244892 0.864844 0.084759516 24 1.33 2.34-0.00017 3.257392 0.072066 11.41224438 100 15 2.22-0.00235 2.592576 0.960441 -0.28480562 46 11 1.5-0.00275 2.533267 0.008251 0.89586715 48 14 1.5-0.00384 2.678596 0.008251 3.650721789 48 12 1.5-0.00799 2.595078 0.243411 0.080447353 24 13 1.5

-0.0538 3.029892 0.115517 7.968449474 4 13 1.5-0.00541 2.502834 0.406785 -3.33913367 24 10 1.5

-0.0178 2.801892 0.187146 1.174795116 12 15 1.5-0.00335 2.985979 0.553095 -5.23778486 36 10 1.5-0.03195 2.873892 0.452711 2.603855611 2 12 1.5-0.05187 2.738983 0.258661 6.248583233 24 15 1-0.00569 2.188128 0.742609 -7.40827572 10 10 1.5

-0.0009 2.984753 0.142399 -5.46280949 84 12 1.25-0.15083 2.774178 7.447185 12.74202837 36 20 0.23-0.00748 2.867511 0.889802 1.764284504 56 15 1-0.78198 3.143092 0.123768 0.320108901 5 11 0.33-0.01982 3.162392 0.201396 0.538832773 56 21 0.4-0.04005 2.054598 0.799455 -4.55270147 15 12 0.5

-0.002 3.108503 0.44564 -6.16328308 90 13 0.1-0.00483 2.633226 0.272711 6.60581503 90 14 0.5-0.05627 2.47726 0.150642 6.444594868 36 15 0.8-0.01541 2.887051 1.918407 3.828884296 14 16 1.5-0.00233 2.370292 0.75086 -0.03430596 14 8 2-0.00399 2.802692 0.75086 3.07357262 14 6 2-0.00422 2.830845 0.75086 3.52403273 14 5 2-0.00349 1.625607 0.068069 -3.02022138 24 4 1.5

-0.0038 1.647749 0.169617 -1.99379246 24 3 1.5-7.53E-05 2.855726 0.493801 -4.70154363 84 10 2-0.00448 3.073577 0.75086 -6.45093819 36 2 1.5-0.01442 2.516488 1.523075 5.770408932 24 3 1.5-0.04197 2.503017 0.336658 -5.22058709 4 15 1-0.05222 2.515186 0.885383 8.476452598 4 4 1.5-0.01357 2.581392 0.141228 5.945917922 24 10 1.5-0.01134 2.804629 0.082512 -3.63438734 7 17.5 1.5-0.00305 2.831559 0.082512 -3.01410765 36 11.34 1.5

-0.0007 2.935229 0.082512 5.530374864 84 11 1.5-0.01616 2.551392 0.082512 -0.04525887 3 12 1.5-0.00178 2.944589 0.082512 9.559315003 72 14 1.5

-0.0375 3.145347 0.082512 10.42954088 12 15 1.5

101

Page 102: Introduction to Econometrics 1

-0.04155 4.196281 1.683867 9.71768063 60 11 1-0.23705 4.112035 0.660097 10.7369913 24 12 1

-9.67E-17 4.090387 0.421637 -0.59326216 72 13 12-0.0583 2.403577 2.988672 10.65900268 6 9 1.2

-0.04783 2.703371 0.386157 2.721150814 10 8 1.2-0.04177 2.461197 2.880581 8.251326888 10 7 1.2-0.10652 2.928787 1.489131 3.99353016 5 6 1

Exercise

You are given two datasets of 10 observations each. You are required to calculate the Goldfeld – Quandt test to find out if the error term or residuals are homoskedastic or heteroskedastic.

The data are related to the monthly salary and expenditures of 10 employees at JP morgan. The dependent variable is annual salary and the independent variable is expenditures. The data are expressed as thousands and nominated in Pounds.

Statistician 1y x

8000 50007000 40456000 30005000 24674000 10003000 27892000 17895000 45679000 80007010 3145

Statistician 2y x

9076 90008367 8100

102

Page 103: Introduction to Econometrics 1

6456 62564000 3567

3134 27892346 17785789 37282145 20006678 57898100 7567

a) State the hypotheses of the residuals or error term.b) Calculate the regression equations from the data gathered from the two statisticians by stating the R2 , the standard errors and the mean square variance of the residuals.c) Calculate the F-ratio for both datasets for 5% significance level.d) Accept or reject the hypothesis of homoskedasticity.e) Transform the variables by using the natural logarithmic ln linear format.f) Run two regressions for each dataset.g) Calculate the F-ratio for both datasets.h) Accept or reject the hypothesis of homoskedasticity.

Solution

a) H0: The residuals are constant and therefore homoskedastic.

H1: The residuals are not constant and therefore heteroskedastic.

b) I have attached the Excel regression equation for the first dataset.

R2 = 0.616

SE (997.088) (0.2468)

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.784886R Square 0.616046Adjusted R Square 0.568052Standard Error 1460.238Observations 10

103

Page 104: Introduction to Econometrics 1

ANOVA

  df SS MS FSignificance

FRegression 1 27369739.21 27369739.21 12.83582 0.00716Residual 8 17058350.792529 2132293.849066Total 9 44428090      

  Coefficients Standard Error t Stat P-value Lower 95%Upper 95%

Intercept 2434.897 997.0883717 2.442007212 0.040438 135.6056 4734.188X 0.884337 0.24683455 3.58271121 0.00716 0.315135 1.453539

c ) I have attached the Excel regression equation for the second dataset.

R2 = 0.95

SE (429.2888) (0.076117)

SUMMARY OUTPUT

Regression Statistics

Multiple R 0.975255

R Square 0.951122Adjusted R Square 0.945012

Standard Error 600.8309

Observations 10

ANOVA

  df SS MS FSignificance

FRegression 1 56197292 56197292.31 155.6721085 1.59E-06Residual 8 2887983 360997.823Total 9 59085275      

104

Page 105: Introduction to Econometrics 1

  CoefficientsStandard

Error t Stat P-value Lower 95%Upper 95%

Intercept 806.0896 429.2888 1.877732704 0.097244025 -183.853 1796.032x 0.9497 0.076117 12.47686293 1.59218E-06 0.774174 1.125225

d) The F-ratio test for the Goldfeld – Quandt test is as follows:

From the F-distribution with 5 % significance level and n-2 degrres of freedom for each dataset or 10 - 2 = 8, the F-value from the tables is 3.44 and the F-ratio test is 0.1693. The F-ratio test is significant and lies inside the critical region. The sample evidence suggests that we can not reject the homoskedasticity hypothesis.

I have also attached the White’s test for the first datatset.

Heteroskedasticity Test: White

F-statistic 0.637810    Prob. F(2,7) 0.5566Obs*R-squared 1.541420    Prob. Chi-Square(2) 0.4627Scaled explained SS 0.326289    Prob. Chi-Square(2) 0.8495

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 04/23/16 Time: 16:09Sample: 1 10Included observations: 10

Variable Coefficient Std. Error t-Statistic Prob.  

C 1245431. 2016142. 0.617730 0.5563X 487.0253 1007.909 0.483204 0.6437

X^2 -0.078642 0.107989 -0.728242 0.4901

R-squared 0.154142    Mean dependent var 1705835.Adjusted R-squared -0.087532    S.D. dependent var 1462453.S.E. of regression 1525116.    Akaike info criterion 31.55636Sum squared resid 1.63E+13    Schwarz criterion 31.64714Log likelihood -154.7818    Hannan-Quinn criter. 31.45678F-statistic 0.637810    Durbin-Watson stat 0.863562

105

Page 106: Introduction to Econometrics 1

Prob(F-statistic) 0.556597

Please focus on the output of the first two sentences of the table that are bold. According to the above table, both the F-statistic and the Chi-squared statistics are not significant. We cannot reject the null hypothesis of homoskedasticity.

Please make yourself comfortable with the different tests that measure heteroskedasticity.

Please complete (e), (f,) (g), (h) questions……….. please feel free to e-amil me for different levels of difficulties.

Good luck……………

Exercise of Ramsey’s test

You are given one dataset of 10 observations each. You are required to calculate the Ramsey’s test to find out if the error term or residuals are homoskedastic or heteroskedastic.

The data are related to the monthly salary and expenditures of 10 employees at JP morgan. The dependent variable is annual salary and the independent variable is expenditures. The data are expressed as thousands and nominated in Pounds.

Statistician y x

8000 50007000 40456000 30005000 24674000 10003000 27892000 17895000 45679000 80007010 3145

106

Page 107: Introduction to Econometrics 1

To estimate the Ramsey’s test you regress the estimated error term or estimated residuals from the regression equation on the estimated square and cube dependent variable namely . The dependent variable in our case is annual salary.

The regression equation and the hypotheses testing will be as follows:

The hypotheses testing are as follows:

H0: = 0 and the error term is homoskedastic.

H1: and the error term is heteroskedastic.

Solution

The first step is to calculate the estimated residuals based on the above dataset. I have included the output in Excel in terms of the estimated dependent variable and the estimated error term .

RESIDUAL OUTPUT

Observation Predicted y Residuals1 6856.58154 1143.4182 6012.0398 987.96023 5087.90772 912.09234 4616.55615 383.44385 3319.23391 680.76616 4901.31264 -1901.317 4016.97573 -2016.988 6473.66366 -1473.669 9509.59227 -509.592

10 5216.13658 1793.863

107

Page 108: Introduction to Econometrics 1

I have included the dataset that will be used to solve the above regression, which is as follows:

RESIDUAL OUTPUT                 Observation Predicted y Residuals y squared y cube

1 6856.58154 1143.418 47012710 3.22346E+112 6012.0398 987.9602 36144623 2.17303E+113 5087.90772 912.0923 25886805 1.3171E+114 4616.55615 383.4438 21312591 983907717265 3319.23391 680.7661 11017314 365690412316 4901.31264 -1901.31 24022866 1.17744E+117 4016.97573 -2016.98 16136094 648182978918 6473.66366 -1473.66 41908321 2.713E+119 9509.59227 -509.592 90432345 8.59975E+11

10 5216.13658 1793.863 27208081 1.41921E+11

I have attached the output of the above equation.

SUMMARY OUTPUT

Regression StatisticsMultiple R 0.209095R Square 0.043721Adjusted R Square -0.2295Standard Error 1526.553Observations 10

ANOVA

  df SS MS FSignificance

FRegression 2 745805 372902.5 0.160019 0.855158Residual 7 16312546 2330364Total 9 17058351      

  CoefficientsStandard

Error T Stat P-value Lower 95%Upper 95%

Intercept -1094.04 2255.252 -0.48511 0.642413 -6426.86 4238.78y squared 9.05E-05 0.000166 0.543666 0.603546 -0.0003 0.000484y cube -8.8E-09 1.57E-08 -0.55977 0.593087 -4.6E-08 2.84E-08

108

Page 109: Introduction to Econometrics 1

Conclusion

The coefficients are not significant different from zero. All the p-values are aboe the 5% significance level. The coefficients to be significance they must be below the 5% significance level. We accept H0 and the error term is homoskedastic.

Exercise

You are given the following information related to the intercept, the independent variable, the number of observations, the coefficients, the standard errors, the t-statistics, and the R2.

Variable Coefficients Standard errors, (SE) t-statisticsIntercept 0.234 0.045 5.2Independent variable

8.221 8.214 0.9996

R2 = 0.654 n = 30 2 = 17.34

After plotiing the residuals against the independent variable, it was found that the variance of the residuals increased with x.

In contrast, by transforming the variables by ln function, it was found that there was no heteroskedasticity effect. I have attached all the information related to the intercept, the independent variable, the number of observations, the coefficients, the standard errors, the t-statistics, and the R2.

Variable Coefficients Standard errors, (SE) t-statisticsIntercept 0.122 0.031 3.935Independent variable

2.221 3.214 0.691

R2 = 0.721 n = 30 = 10.56

It is required the following:

(a) Please compare both tables and comment on the changes related to the intercept, the independent variable, the number of observations, the coefficients, the standard errors, the t-statistics, and the R2.

(b) Please explain the difference between heteroskedastic and homoskedastic effect.

(c) Please comment on the changes of the variance and R2 in the regression equation by using natural logarithmic ln. Please comment on the changes of the coefficients and the standard errors. In which table are they biased and unefficient?

109

Page 110: Introduction to Econometrics 1

Exercise

I will give you an example of using square root reciprocal as deflators. This method is based on the square root transformation. Please consider the following ordinary least square,OLS, equation. After conducting a scatterplot, we have found that the independent variable x1 shows heteroskedastic patterns with the error term. We divide each part of the regression with the square root of x1, namely, .

Then, run the regression in Excel and obtain the estimated ordinary least squares regression which will have different coefficients after the transformation from the original regression equation. Check the residuals . Regress the estimated squared residuals on the independent variable x1 and the squared independent variable each time separately and choose the regression model with the highest R2. Try different heteroskedastic tests in order to accept or reject the null hypothesis of homoskedasticity. For example, use the Breusch – Pagan, the White and the Glejser test.

Let’s take an example to fully understand the application of the square root transformation. Divide each variable with the square root of the first independent variable .

110

Page 111: Introduction to Econometrics 1

y x1 x2

9076 9000 92348367 8100 87896456 6256 67894000 3567 38903134 2789 32452346 1778 25675789 3728 46782145 2000 23456678 5789 57438100 7567 8231

Run the regression. I have attached the output in EViews 6.

……………………

Dependent Variable: YMethod: Least SquaresDate: 09/02/16 Time: 16:42Sample: 1 10Included observations: 10

Variable Coefficient Std. Error t-Statistic Prob.  

C 345.4160 577.5074 0.598115 0.5686X1 0.177116 0.668650 0.264886 0.7987X2 0.786860 0.676766 1.162676 0.2831

R-squared 0.959033    Mean dependent var 5609.100Adjusted R-squared 0.947328    S.D. dependent var 2562.232S.E. of regression 588.0400    Akaike info criterion 15.83479Sum squared resid 2420537.    Schwarz criterion 15.92557Log likelihood -76.17396    Hannan-Quinn criter. 15.73521F-statistic 81.93496    Durbin-Watson stat 2.743481Prob(F-statistic) 0.000014

Construct the tables with the relevant ratios.

95.67 0.01 94.87 97.33

111

Page 112: Introduction to Econometrics 1

92.97 0.01 90 97.6681.62 0.01 79.09 85.8366.97 0.02 59.72 65.1359.34 0.02 52.81 61.4555.64 0.02 42.17 60.8894.81 0.02 61.06 76.6247.96 0.02 44.72 52.4487.77 0.01 76.09 75.4893.12 0.01 86.99 94.62

Run the regression. I have attached the output in EViews 6.

……………..

Dependent Variable: y / sqrt(x1)Method: Least SquaresDate: 09/02/16 Time: 19:17Sample: 1 10Included observations: 10

Variable Coefficient Std. Error t-Statistic Prob.  

1/sqrt(x1) 1083.989 1522.429 0.712013 0.5032x1 / sqrt(x1) 0.422835 0.782749 0.540193 0.6085x2 / sqrt(x1) 0.819956 0.697460 1.175631 0.2843

R-squared 0.825097    Mean dependent var 77.58700Adjusted R-squared 0.737646    S.D. dependent var 18.33818S.E. of regression 9.392916    Akaike info criterion 7.606963Sum squared resid 529.3613    Schwarz criterion 7.727997Log likelihood -34.03482    Hannan-Quinn criter. 7.474189F-statistic 9.434911    Durbin-Watson stat 2.490561Prob(F-statistic) 0.010907

Could you please see the difference in the coefficients, t-statistics and standard errors between the two regressions.

Test the residuals for heteroskedasticity. Compare the Breusch – Pagan- Godfrey, the White and the Glejser test. Test the null hypothesis for homoskedasticity.

To facilitate you, I have attached the Breusch – Pagan- Godfrey test. Once you have run the regression equation, then, press view---- then---- residual tests ----- heteroskedasticity ---- select the Breusch – Pagan – Godfrey test.

112

Page 113: Introduction to Econometrics 1

Heteroskedasticity Test: Breusch-Pagan-Godfrey

F-statistic 0.020362    Prob. F(3,6) 0.9956Obs*R-squared 0.100786    Prob. Chi-Square(3) 0.9917Scaled explained SS 0.039285    Prob. Chi-Square(3) 0.9980

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 09/02/16 Time: 19:24Sample: 1 10Included observations: 10

Variable Coefficient Std. Error t-Statistic Prob.  

C -57.40305 525.3184 -0.109273 0.9165X1 0.907957 8.338245 0.108891 0.9168X2 -0.110792 7.429710 -0.014912 0.9886

XSQURE 3761.196 16217.71 0.231919 0.8243

R-squared 0.010079    Mean dependent var 52.93613Adjusted R-squared -0.484882    S.D. dependent var 82.11204S.E. of regression 100.0582    Akaike info criterion 12.33856Sum squared resid 60069.90    Schwarz criterion 12.45959Log likelihood -57.69278    Hannan-Quinn criter. 12.20578F-statistic 0.020362    Durbin-Watson stat 2.413581Prob(F-statistic) 0.995627

Exercise

It is given the following dataset:

Share returns (y) Market returns (x)

3.526787 8.73209-4.34533 -5.198155.222709 6.21865-4.99619 -5.5393-3.04336 7.69808-2.375422 -4.997352.651303 5.42777-0.68924 -1.5424

113

Page 114: Introduction to Econometrics 1

0.205664 1.46392.4783 3.6528

0.237407 -0.14940.329728 0.16688-0.26869 -0.14440.064769 0.097873-0.5873 -0.09911

0.329225 -0.08344-0.11849 0.1227670.011541 -0.45767-0.18757 -0.53046-0.38752 -0.11118-0.26835 -0.289470.262798 -0.176760.355054 -1.15686-1.34302 -0.5771-0.77964 0.578182-0.04649 -0.053310.098381 -0.23054-0.09585 -0.66625-0.0059 -0.50071-0.05415 -0.53128

Perform and compare the following heteroskedasticy tests:

The Breusch - Pagan – Godfrey test The Harvey test The Glejser test The ARCH The White test

Which tests show heteroskedastic effect and which tests show homoskedastic effect?

To facilitate your learning, calculate the absolute and the squared values of the residuals and plot them against the independent variable which is market returns. What conclusions you draw by performing the various tests?Would you use the weighted least squares, WLS, transformation to eliminate heteroskedasticity? Please compare the R2 of the original regression with the WLS if it is applicable.

Develop a suitable model with more independent variables that could capture the deviations of share returns. Test the model with the above mentioned heteroskedastic tests. Would you transform your data to eliminate the heteroskedastic problem? Will you use a dummy variable? Please interpret and explain the investment reasoning of the intercept and the slope. Based on the White test check the t-statistics and the p – values of the error term. If there is heteroskedastic problem, click the heteroskedastic corrected standard errors and t-statistics in EViews 6 and compare your results with the original regression.

114

Page 115: Introduction to Econometrics 1

I have attached a recent article that is testing the ARCH and GARCH heteroskedasticity effect on the total returns of Jeffries Commodity Performance Index. I am trying to support you with research skills that are needed for postgrasuate studies.

An application of ARCH and GARCH models to test total returns of Jeffries Commodity Performance Index.

Abstract

Autoregressive Conditional Heteroskedastic models (ARCH), and Generalized Autoregressive Conditional Heteroskedastic models, (GARCH) take into account the non-linearity that arises in the financial time series. Well known anomalies such as the calendar effects, January effect and seasonality’s have been studied indicating the weakness of the Efficient Market Hypothesis, (EMH) and showing the direction of a non random path for price changes. In this paper, we are going to test the volatility of the Thomson Reuters Jeffries Commodity Performance Index returns,(TR/JCPI), for changing conditional variances to spot any anomaly that contradict the validity of the Efficient Market Hypothesis. The data that we will use are daily returns starting from 07/07/2008 to 10/27/2010, which total to 585 observations.

Keywords: Autoregressive Conditional Heteroskedastic models, Generalized Autoregressive Conditional Heteroskedastic models, Jeffries Commodity Performance Index total returns, Efficient Market Hypothesis.

115

Page 116: Introduction to Econometrics 1

Introduction

The study will focus on modeling the total returns of the Thomson Reuters Jeffries Commodity Performance Index by using Autoregressive Conditional Heteroskedastic models and Generalized Conditional Heteroskedastic models. Using EViews 6 the models will be tested to validate the hypotheses that will be formulated. The existence of seasonality in terms of January effect’s will shed light to determine if luck or skills allow investors to outperform the market. The analysis will be based on the Thomson Reuters Jeffries Commodity Performance Index,(TR/JCPI). According to Wikipedia, it consists of 19 commodities, as quoted on the New York Mercantile Exchange, (NYMEX), Chicago Board of Trade, (CBOT), London Metal Exchange, (LME), Chicago Mercantile Exchange, (CME) and Commodity Exchange, (COMEX) exchanges. They are divided into 4 groups, which are petroleum, liquidity assets, highly liquid assets, diverse commodities. The commodities are: Aluminum, cocoa, coffee, copper, corn, cotton, crude oil, gold, heating oil, lean hogs, live cattle, natural gas, nickel, orange juice, silver, soybeans, sugar, unleaded gas and wheat.

Models which include a changing conditional variance are called autoregressive conditional heteroskedastic models. Engle (1982), suggested this type of model for estimating time series with conditional heteroskedasticity. Since then, different types of ARCH models appeared in the literature with different specifications such as mean reversion, mean dependency and variance dependency. For example, GARCH models capture all the history of shocks in a series, TARCH and EGARCH models allow for asymmetric shock to volatility and ARCH – M models are used in financial markets where the expected return on an asset is related to expected risk.

The models will be used to check the validity of the Efficient Market Hypothesis. Fama (1970) has defined the EMH into three types of efficiency. Strong, semi – strong and weak forms. The weak forms suggest that current prices of stocks and commodities fully reflect the information implied by all historical movements. Semi – strong efficient market is when prices reflect not only all historical data, but also relevant information that is related to the investment vehicle. A market is strong form efficient if stock and commodity prices fully reflect not only publicly available information but also private information. No investor could make abnormal profits by using any information. Most of the studies that have done show that markets are efficient in the weak and semi – strong forms. Fama (1970), classified the empirical approaches to test market efficiency based on predictability of returns, event studies and private information. The weak form of efficiency is based on the random walk hypothesis.

The rest of the paper is organized as follows. Section 1 describes the methodology and the data. Section 2 is an analysis of statistical and econometric tests and Section 3 summarizes and concludes.

116

Page 117: Introduction to Econometrics 1

1.Methodology and data description

The methodologies that will be used are autoregressive volatility models and specifically autoregressive and generalized autoregressive conditionally heteroscedastic models. Heteroskedasticity is a violation of the OLS assumption that the variance error term is constant for all observations. This leads to biased estimates of the standard errors, inefficient estimates of the coefficients and incorrect confidence intervals and statistical tests. In other words, we are checking if the variance of the errors is not constant and is creating volatility clusters. In addition, we are trying to spot if the January effect has affected the time series of the commodity index by showing higher positive returns in the month of January that could explain variance persistence.

The mean equation of the ARCH (1,5) is as follows:

(1)

Where:

rt: is the expected return and in our case are logarithmic and percentage daily returns of Jeffries Commodity Index.

The conditional variance equation is estimated by regressing the squared residuals on a constant and p lags:

(2)

Where:

The null hypothesis that there is no ARCH effect of order p is H0: The alternative hypothesis is that H1:

The mean equation of the GARCH(1,1) that we will use is as follows:

(3)

Where:

117

Page 118: Introduction to Econometrics 1

yt is the dependent variable and in our case is logarithmic and percentage daily returns of Jeffries Commodity Index.

is a constant. is the error term.

The variance equation is as follows:

(4)

Where:

According to Brooks, (2002), the log –likelihood function, (LIF) of the model that will be maximised is calculated as follows:

(5)

The hypotheses that we are going to formulate and test are as follows:

The null hypothesis, H0, states that by using autoregressive volatility models, returns expressed as percentages and logarithmic of the Thomson Reuters Jeffries Commodity Performance Index returns,(TR/JCPI) are constant and similarly distributed over time.

The alternative hypothesis, H1, states that by using autoregressive volatility models, returns expressed as percentages and logarithmic of the Thomson Reuters Jeffries Commodity Performance Index returns,(TR/JCPI) are not constant and similarly distributed over time.

To test for non-normality the Jarque – Bera statistic is analysed in relation to a histogram and descriptive statistics. Then, we will run an Augmented Dickey – Fuller’s, (ADF), statistic to test for stationarity and calculate and compare the critical values. Finally, we will test for seasonality by using a t-test paired two sample of means. The sample will be divided into sub-sample of two periods 2009-2010 to test for the persistence of the January effect. If seasonality exists, we would expect to find significantly positive mean returns in January and this positive return is higher then the returns in April. We are testing the hypothesis of Arsad and Coutts, (1997), that returns in April are significantly positive than in January. Their methodology was based on a dummy variable regression model for the period 1935-1994 using FT-30 index. They analysed sub-samples of the data and found that in all of the sub-samples April displays positive mean daily returns.

The data that we will use are total daily returns starting from 07/07/2008 to 10/27/2010 of the Thomson Reuters Jeffries Commodity Performance Index returns,

118

Page 119: Introduction to Econometrics 1

(TR/JCPI), which total to 585 observations. The return financial series are calculated by taking the log difference of the index and by using the daily price change returns expressed as percentages. The reason of using these two different methods is to show logarithms and percentages in spotting volatility changes.

The logarithmic formula that we have used is:

(6)

Where: Rt is the daily return for month t, Pt is the closing daily price for month t, and Pt-1 is the previous daily closing price for month t-1.

The daily price change returns expressed as percentages is given by the formula

x 100 (7)

Where: Rt is the daily return for month t, P1 is the price of a financial asset at the end of period 1, and P0 is the price of a financial asset at the beginning of the period.

Figure 1 shows the fluctuations of the prices of the Thomson Reuters Jeffries Commodity Performance Index, (TR/JCPI) starting from 2000.

Source: Thomson Reuters Jeffries Commodity Performance Index, (TR/JCPI) website.

According to Figure 1, there was a tremendous increase of 586,845 dated 07/03/08 and then a continuous decrease that has reached a high low of 246,266. It is obvious that there was an intense negative volatility cluster that resulted that the various commodities that constitute the index to fall sharply. In contrast, during the years 2003 to 2008, there was a continuous increase, which showed an upward positive trend. Unfortunately, we have a limited dataset that restrict our analysis for the January effects.

2. Statistical, econometric tests and analysis

Figure 2 and Table 1 shows the normality tests of the total returns calculated by taking the log difference of the index.

119

Page 120: Introduction to Econometrics 1

Table 1 and Figure 2 display Jarque - Bera normality test, which is calculated by taking the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

0

20

40

60

80

100

-0.06 -0.04 -0.02 0.00 0.02 0.04 0.06

Series: LRSample 2 585Observations 584

Mean -0.000974Median -0.000774Maximum 0.064717Minimum -0.074723Std. Dev. 0.017810Skewness -0.160579Kurtosis 4.624725

Jarque-Bera 66.74330Probability 0.000000

Source: Author’s calculation based on E-views software.

We state the hypotheses as follows:

H0: The log daily difference returns of the Thomson Reuters Jeffries Commodity Performance Index total returns, (TR/JCPI) are normally distributed.

H1: The log daily difference returns of the Thomson Reuters Jeffries Commodity Performance Index total returns, (TR/JCPI) are not normally distributed.

According to Table 1 and Figure 2, the Jarque – Bera statistic is 66.74, which is statistically significant at the 5% significance value. The joint test of the null hypothesis that sample skewness equals 0 and sample kurtosis equals 3 is rejected. The probability of the p- value 0.0000 is very significant, so we can reject H0 of normality. The distribution is leptokurtic and slightly negatively skewed. The mean is -0.000974 and the standard deviation is 0.017810. There is a high dispersion from the actual data.

Table 2 and Figure 3 display Jarque Bera normality test, which is calculated by taking the daily price change returns expressed as percentages of Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

120

Page 121: Introduction to Econometrics 1

0

20

40

60

80

100

-6 -4 -2 0 2 4 6

Series: RSample 2 585Observations 584

Mean -0.081590Median -0.077364Maximum 6.685728Minimum -7.199912Std. Dev. 1.777365Skewness -0.064443Kurtosis 4.613409

Jarque-Bera 63.74602Probability 0.000000

Source: Author’s calculation based on E-views software.

We state the hypotheses as follows:

H0: The daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) are normally distributed.

H1:. The daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) are not normally distributed.

According to Table 2 and Figure 3, the Jarque – Bera statistic is 63.75, which is above the critical value at 5% significance value. The probability of the p- value 0.0000 is very significant, so we can reject H0 of normality. The distribution is leptokurtic, as the value 4.61 is bigger than 3 and slightly negatively skewed. The mean is -0.081590 and the standard deviation is 1.777365. There is significant dispersion from the mean of the actual data.

Tables 3 and 4 show the ADF tests of the log difference and the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

Table 3 shows the ADF tests of the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

121

Page 122: Introduction to Econometrics 1

ADF Test Statistic -10.54738 1% Critical Value* -3.4440 5% Critical Value -2.8669 10% Critical Value -2.5696

*MacKinnon critical values for rejection of hypothesis of a unit root.

Augmented Dickey-Fuller Test EquationDependent Variable: D(LR)Method: Least SquaresDate: 08/28/13 Time: 18:38Sample(adjusted): 7 585Included observations: 579 after adjusting endpoints

Variable Coefficient Std. Error t-Statistic Prob. LR(-1) -0.980186 0.092932 -10.54738 0.0000

D(LR(-1)) -0.056470 0.084169 -0.670911 0.5025D(LR(-2)) -0.040042 0.073412 -0.545442 0.5857D(LR(-3)) -0.003290 0.059801 -0.055022 0.9561D(LR(-4)) 0.038236 0.041583 0.919517 0.3582

C -0.000936 0.000746 -1.255455 0.2098R-squared 0.520784 Mean dependent var -2.57E-05Adjusted R-squared 0.516602 S.D. dependent var 0.025641S.E. of regression 0.017827 Akaike info criterion -5.205851Sum squared resid 0.182109 Schwarz criterion -5.160656Log likelihood 1513.094 F-statistic 124.5404Durbin-Watson stat 1.999997 Prob(F-statistic) 0.000000

Source: Author’s calculation based on E-views software.

For a level of significance of one per cent, the critical value of the t-statistic from Dickey-Fuller’s table is -3.4440. According to Table 3 and to the sample evidence, we can reject the null hypothesis namely the existence of a unit root with one, five and ten per cent significance level. The ADF test statistic is -10.54738, which is smaller than the critical values, (-3.4440, -2.8669, -2.5696). In other words, the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns, (TR/JCPI) are stationary series. In addition, there is no autocorrelation problem as shown from the Durbin – Watson statistic.

122

Page 123: Introduction to Econometrics 1

Table 4 shows the ADF tests of the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

ADF Test Statistic -10.61150 1% Critical Value* -3.4440 5% Critical Value -2.8669 10% Critical Value -2.5696

*MacKinnon critical values for rejection of hypothesis of a unit root.

Augmented Dickey-Fuller Test EquationDependent Variable: D(R)Method: Least SquaresDate: 08/28/13 Time: 18:40Sample(adjusted): 7 585Included observations: 579 after adjusting endpoints

Variable Coefficient Std. Error t-Statistic Prob. R(-1) -0.988133 0.093119 -10.61150 0.0000

D(R(-1)) -0.050056 0.084265 -0.594032 0.5527D(R(-2)) -0.032258 0.073457 -0.439141 0.6607D(R(-3)) 0.002205 0.059846 0.036849 0.9706D(R(-4)) 0.040629 0.041580 0.977130 0.3289

C -0.078746 0.074288 -1.060009 0.2896R-squared 0.521579 Mean dependent var -0.002572Adjusted R-squared 0.517404 S.D. dependent var 2.561163S.E. of regression 1.779218 Akaike info criterion 4.000533Sum squared resid 1813.898 Schwarz criterion 4.045728Log likelihood -1152.154 F-statistic 124.9378Durbin-Watson stat 2.000263 Prob(F-statistic) 0.000000

Source: Author’s calculation based on E-views software.

According to Table 4, for a level of significance of one, five and ten per cent, the critical value of the t-statistic from Dickey-Fuller’s table is -3.4440, -2.8669 and -2.5696. According to Table 4, we can reject the null hypothesis namely the existence of a unit root with one, five and ten per cent significance level. The ADF test statistic is -10.61150, which is smaller than the critical values, (-3.4440, -2.8669, -2.5696). In other words, the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) are stationary series. The Durbin – Watson statistic is 2.00, so there is no autocorrelation.

123

Page 124: Introduction to Econometrics 1

Table 5 shows the Autoregressive Conditional Heteroskedastic test, (ARCH 1,5) of the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

ARCH Test:F-statistic 14.29835 Probability 0.000000Obs*R-squared 64.17012 Probability 0.000000

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 08/24/13 Time: 14:10Sample(adjusted): 12 585Included observations: 574 after adjusting endpoints

Variable Coefficient Std. Error t-Statistic Prob. C 1.419641 0.331044 4.288379 0.0000

RESID^2(-1) -0.030329 0.041412 -0.732374 0.4642RESID^2(-2) 0.071553 0.040282 1.776300 0.0762RESID^2(-3) 0.109845 0.040139 2.736618 0.0064RESID^2(-4) 0.229664 0.040287 5.700630 0.0000RESID^2(-5) 0.161264 0.041393 3.895883 0.0001

R-squared 0.111795 Mean dependent var 3.128923Adjusted R-squared 0.103976 S.D. dependent var 6.013453S.E. of regression 5.692247 Akaike info criterion 6.326485Sum squared resid 18404.15 Schwarz criterion 6.371983Log likelihood -1809.701 F-statistic 14.29835Durbin-Watson stat 2.021232 Prob(F-statistic) 0.000000

Source: Author’s calculation based on E-views software.

The presence of ARCH(1,5) in the residuals is calculated by regressing the squared residuals on a constant and p lags. In this case, we use the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

According to Table 5, both the F-statistic and the Lagrange Multiplier, LM- statistics are significant. ARCH effect is present in the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI). Specifically, the F-statistic is 14.29835 with a significant p-value of 0.000 and the LM – statistic is 64.17012 with a significant p-value of 0.000.

124

Page 125: Introduction to Econometrics 1

Figure 4 shows the residuals graph of the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

-8

-6

-4

-2

0

2

4

6

8

100 200 300 400 500

R Residuals

Source: Author’s calculation based on E-views software.

According to Figure 4, we can see that there is a significant positive and negative volatility clusters depicted by the illustrated arrows, which confirms the ARCH effect on the time series of Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) based on the Lagrange Multiplier, (LM). This figure shows that the mean and variance are not constant or non stationary.

Table 6 shows the Autoregressive Conditional Heteroskedastic test, (ARCH 1,5) of the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

ARCH Test:F-statistic 14.54688 Probability 0.000000Obs*R-squared 65.15890 Probability 0.000000

Test Equation:Dependent Variable: RESID^2Method: Least SquaresDate: 08/24/13 Time: 14:16Sample(adjusted): 12 585Included observations: 574 after adjusting endpoints

Variable Coefficient Std. Error t-Statistic Prob. C 0.000142 3.32E-05 4.262611 0.0000

RESID^2(-1) -0.034103 0.041408 -0.823595 0.4105RESID^2(-2) 0.068032 0.040261 1.689790 0.0916RESID^2(-3) 0.117220 0.040071 2.925303 0.0036

125

Page 126: Introduction to Econometrics 1

RESID^2(-4) 0.231715 0.040267 5.754497 0.0000RESID^2(-5) 0.161789 0.041387 3.909143 0.0001

R-squared 0.113517 Mean dependent var 0.000314Adjusted R-squared 0.105714 S.D. dependent var 0.000605S.E. of regression 0.000572 Akaike info criterion -12.08552Sum squared resid 0.000186 Schwarz criterion -12.04002Log likelihood 3474.544 F-statistic 14.54688Durbin-Watson stat 2.022482 Prob(F-statistic) 0.000000

Source: Author’s calculation based on E-views software.The presence of ARCH(1,5) in the residuals is calculated by regressing the squared residuals on a constant and p lags. In this case, we use the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI). According to Table 6, both the F-statistic and the Lagrange Multiplier, LM- statistics are significant. LM statistic is defined as the multiplication of the observation with R-squared. ARCH effect is present in the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI). Specifically, the F-statistic is 14.54688 with a significant p-value of 0.000 and the LM – statistic is 65.15890 with a significant p-value of 0.000.

Figure 5 shows the residuals graph of the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

-0.08

-0.06

-0.04

-0.02

0.00

0.02

0.04

0.06

0.08

100 200 300 400 500

LR Residuals

Source: Author’s calculation based on E-views software.

According to Figure 5, we can see that there is a significant positive and negative volatility clusters depicted by the arrows, which confirms the ARCH effect on the time series of Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) based on the Lagrange Multiplier, (LM).

Table 7 displays the GARCH(1,1) model of the log difference of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

Dependent Variable: LR

126

Page 127: Introduction to Econometrics 1

Method: ML - ARCHDate: 08/31/13 Time: 16:03Sample: 2 585Included observations: 584Convergence achieved after 21 iterationsBollerslev-Wooldrige robust standard errors & covariance

Coefficient Std. Error z-Statistic Prob. C 7.15E-05 0.000559 0.127973 0.8982

Variance EquationC 7.61E-07 1.12E-06 0.680912 0.4959

ARCH(1) 0.042498 0.017616 2.412435 0.0158GARCH(1) 0.953173 0.018179 52.43310 0.0000

R-squared -0.003455 Mean dependent var -0.000974Adjusted R-squared -0.008645 S.D. dependent var 0.017810S.E. of regression 0.017886 Akaike info criterion -5.456046Sum squared resid 0.185555 Schwarz criterion -5.426115Log likelihood 1597.166 Durbin-Watson stat 2.055019

Source: Author’s calculation based on E-views software.

According to Table 7, the z-statistic, the coefficients and the p- values of the variance equation of the log difference are very statistically significant. The log likelihood is maximised at the value of 1597.166. This means that the conditional variance is highly persistent. The ARCH has a coefficient of 0.042 and a significant z – statistic of 2.41.The GARCH effect has a coefficient of 0.95 and a very significant p-value of 0.0000. There is strong evidence of variance persistence and high volatility clustering.

Table 8 shows the GARCH(1,1) model of the daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI).

Dependent Variable: RMethod: ML - ARCHDate: 08/31/13 Time: 16:05Sample: 2 585Included observations: 584Convergence achieved after 15 iterationsBollerslev-Wooldrige robust standard errors & covariance

Coefficient Std. Error z-Statistic Prob. C 0.017548 0.055908 0.313861 0.7536

Variance EquationC 0.007689 0.011153 0.689394 0.4906

ARCH(1) 0.042456 0.017272 2.458056 0.0140GARCH(1) 0.953188 0.017865 53.35472 0.0000

R-squared -0.003116 Mean dependent var -0.081590Adjusted R-squared -0.008305 S.D. dependent var 1.777365S.E. of regression 1.784731 Akaike info criterion 3.752371Sum squared resid 1847.453 Schwarz criterion 3.782302Log likelihood -1091.692 Durbin-Watson stat 2.059260

Source: Author’s calculation based on E-views software.

According to Table 8, the z-statistic, the coefficients and the p- values of the variance equation of the daily price change returns expressed as percentages are very statistically significant. The log likelihood is maximised at the value of -1091.692. The coefficients of the lagged squared error and lagged conditional variance are very close to unity. This means that the conditional variance is highly persistent. The

127

Page 128: Introduction to Econometrics 1

ARCH effect after applying the Bollerslev-Wooldrige robust standard errors & covariance heteroskedasticity test shows a coefficient of 0.042 and a z – statistic of 2.46. The GARCH effect has a coefficient of 0.95 and a significant p-value of 0.0000 at the 5% significance level.

Tables 9 and 10 are used to test the hypothesis of Arsad and Coutts, (1997), that returns in April are significantly positive than in January. In other words, we are trying to prove that the market follows a random walk under the weak form hypothesis. If, we found evidence of the January effect, then, luck in contrast to skills allow investors to outperform the market.

Table 9 displays a t-Test: Paired Two Sample for Means for daily percentage changes in total returns for the years 2009 – 2010.

  January AprilMean -0,34407 0,033532Variance 3,584054 2,010858Observations 39 39Pearson Correlation -0,09435  Hypothesized Mean Difference 0  df 38  t Stat -0,95465  P(T<=t) one-tail 0,172894  t Critical one-tail 1,685954  P(T<=t) two-tail 0,345788  t Critical two-tail 2,024394  

Source: Author’s calculation based on Excel software.

We state the hypotheses as follows:

H0: There is no significant difference of daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) between the sample means of January and April.

H1: There is significant difference of daily price change returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) between the sample means of January and April.

According to Table 9, the mean return of April is 0,034, but not significantly positive from January. In January, we have a negative mean return of -0.34. The t-statistic is -0,95 and for a two-tailed test is within the critical area of ±2,02. Thus, we accept the null hypothesis in terms that there is no significant difference of the means of daily price change returns for the months of January and April. Thus, skills and not luck allow investors to outperform the market. The limitation of this test is that the sample period was restricted for one year.Table 10 shows a t-Test: Paired Two Sample for Means of logarithmic difference total returns for the years 2009 – 2010.

  January AprilMean -0,00362 0,000303Variance 0,000366 0,000202Observations 39 39

128

Page 129: Introduction to Econometrics 1

Pearson Correlation -0,07692  Hypothesized Mean Difference 0  df 38  t Stat -0,99292  P(T<=t) one-tail 0,163515  t Critical one-tail 1,685954  P(T<=t) two-tail 0,32703  t Critical two-tail 2,024394  

Source: Author’s calculation based on Excel software.

We state the hypotheses as follows:

H0: There is no significant difference of logarithmic difference total returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) between the sample means of January and April.

H1: There is significant difference of logarithmic difference total returns expressed as percentages of the Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) between the sample means of January and April.

According to Table 10, the mean return of April is 0,0003, but not significantly positive from January. In January, we have a negative mean return of -0.004. The t-statistic is -0,99 and for a two-tailed test is within the critical area of ±2,02. Thus, we cannot reject the null hypothesis in terms that there is no significant difference of the means of logarithmic daily price change returns for the months of January and April.

According to the results of the above tables, our evidence contradicts Arsad and Coutts, (1997), that returns in April are significantly positive than in January. Thus, volatility clusters are not the result of the January effect and investors use skills instead of luck to outperform the market. The limitation of this approach is that, we have used a limited dataset that did not show evidence of January effects. In addition, we did not use a dummy variable regression model.

3. Summarizes and Concludes

The author’s attempted to explain, illustrate, test and analyse the total returns of the Thomson Reuters Jeffries Commodity Performance Index by using Autoregressive Conditional Heteroskedastic models, (ARCH) and Generalized Autoregressive Conditional Heteroskedastic models, (GARCH).

We have concluded that the conditional variance is highly persistent. There is a significant positive and negative volatility clusters, which confirms the ARCH and GARCH effect’s on the time series of Thomson Reuters Jeffries Commodity Performance Index total returns,(TR/JCPI) based on the F-statistic and the Lagrange Multiplier, (LM). Thus, we reject the null hypothesis that the returns expressed as percentages and logarithmic of the commodity index are constant and similarly distributed over time.

Our evidence contradicts Arsad and Coutts, (1997), that returns in April are significantly positive than in January. Thus, volatility clusters and variance

129

Page 130: Introduction to Econometrics 1

persistence are not the result of the January effect and investors use skills instead of luck to outperform the market. If we take into consideration the US recession in addition to the autoregressive models, we conclude that the market is semi – strong efficient, as prices reflected not only all historical data, but also relevant macroeconomic information such as the US recession in 2008. It has created a large negative volatility of the commodity prices. Future research should focus on regressive relationships between the returns of the Jeffries Commodity Index and other macroeconomic indicators.

References

Alexander, C., (2003), Market Models. A Guide to Financial Data Analysis. John Wiley and Sons Ltd. ISMA Centre. The Business School For Financial Markets.

Arsad, M., and Coutts, A., (1997), Some Anomalous Evidence Regarding Market Efficiency. Journal of Financial Economics, Vol.5.

Brooks,C.,(2002), Introductory econometrics for Finance. Cambridge University Press.

Fama, E.F.,(1970), Efficient Capital Markets: A Review of Theory and Empirical Work. Journal of Finance, Vol.25, pp 383 – 417.

Wikipedia internet source. www. wikepedia.org.

I have attached a second recent article that is testing the TGARCH, EGARCH, and PARCH heteroskedasticity effect on the spot AUD/USD exchange rate volatility. I am trying to support you with research skills that are needed for postgrasuate studies. I

130

Page 131: Introduction to Econometrics 1

would like that you get the best from your studies and get recruited in the best places of the Financial Services sector around the U.K.

Application of a TGARCH, EGARCH, and PARCH models to test the spot AUD/USD exchange rate volatility.

Abstract

In this article, we have tested the volatility of the returns of the spot exchange rate of AUD/USD for changing conditional variances. Threshold generalized autoregressive conditional heteroskedastic models, (TGARCH), exponential generalized autoregressive conditional heteroskedastic models, (EGARCH) models and power autoregressive conditional heteroskedastic models, (PARCH), take into account the non-linearity that arises in financial time series. We have checked the volatility clusters for a long period of time that arises in the financial times series of returns or the fact that large and small values occur persistently in clusters. We have found that the AUD/USD spot exchange rate time series is non-normal. The natural logarithmic monthly returns of the AUD/USD are a stationary series. By applying a TGARCH (1,1) model, we have found that there is no leverage effect. By applying an EGARCH (1,1) model, we have found that the asymmetry term is not statistically significant at the 5% significance level. Negative shocks do not imply a higher next period conditional variance than positive shocks of the same sign. By applying a PARCH (1,1) model, we have found that the asymmetric term is statistically significant at the 5% significance level. Positive shocks related to good news implied a higher next period conditional variance than negative shocks related to bad news of the same extent. By comparing the three models in terms of Akaike information criterion and Schwarz criterion, we have concluded the PARCH (1,1) is the best – fit model, as it has the lowest values. There is no serial correlation in the residuals of the TGARCH (1,1), EGARCH (1,1), and PARCH (1,1) models. The residuals of the TGARCH (1,1), EGARCH (1,1) and PARCH (1,1) are homoskedastic and there is no additional ARCH effect. Finally, there are negative and positive shocks that are driving the AUD/USD exchange spot rate away from normality. The data that we have used are monthly returns starting from 01/01/1990 to 01/01/2013, which total to 276 observations. The total dataset includes 277 observations. The data was obtained from the Federal Reserve Statistical Release Department and the symbol of the series is H.10.

Keywords: Threshold generalized autoregressive conditional heteroskedastic models, (TGARCH), exponential generalized autoregressive conditional heteroskedastic models, (EGARCH) models and power autoregressive conditional heteroskedastic models, (PARCH), AUD/USD exchange spot rate.

Introduction

131

Page 132: Introduction to Econometrics 1

This article has focused on modeling the natural logarithmic returns of the AUD/USD exchange spot rate, by using TGARCH, EGARCH and PARCH models. The aim will be to spot the volatility clusters of the time series. By using E-Views 6, the models will be tested to validate the hypotheses that will be formulated.

Models, which include a changing conditional variance, are called autoregressive conditional heteroskedastic models. Engle (1982) suggested this type of model for estimating time series with conditional heteroskedasticity. Since then, different types of ARCH models appeared in the literature with different specifications such as mean reversion, mean dependency and variance dependency of TGARCH, and EGARCH. In contrast, the PARCH model measures the standard deviation rather than the conditional variance. According to EViews 6 User’s Guide II, (p.199), “the power parameter of the standard deviation can be estimated and the optional parameters are added to capture asymmetry of up to order r”. GARCH models capture all the history of shocks in a series in a changing variance environment, TGARCH and EGARCH models allow for asymmetric shock to volatility. EGARCH model was calibrated by Nelson in (1991). The advantage of the EGARCH model is that the variance will be positive even if the parameters are negative.

Diagnostic checking of the residuals of the volatility models will be done by checking the correlogram squared residuals, the ARCH LM test and the quantile – quantile, (QQ), theoretical distribution plots for normality. Specifically, the correlogram is used to check for serial correlation in the residuals. The ARCH LM test is used to check whether the standardized residuals have additional ARCH effect. Finally, the QQ plot is used to check whether the data are normally distributed. The quantile of normality and the quantile of residuals should lie on a straight line. Otherwise, there is deviation from linearity.

The rest of the paper is organized as follows. Section 1 describes the methodological issues and data descriptions. Section 2 is an analysis of statistical and econometrical tests and Section 3 summarizes and concludes.

1.Methodological issues and data descriptions

The methodologies that we have used are autoregressive volatility models. Heteroskedasticity is a violation of the OLS assumption that the variance error term is constant for all observations. This leads to biased estimates of the standard errors, inefficient estimates of the coefficients and incorrect confidence intervals and statistical tests. In other words, we are checking if the variance of the errors is not

132

Page 133: Introduction to Econometrics 1

constant and is creating volatility clusters. Autoregressive models have been studied by various researchers such as: Alexander,(2003), Brooks, (2002), Nelson, (1991), Bollerslev, (1986), Bollerslev, Chou, and Kroner, (1992), Bollerslev, Engle and Nelson, (1994), Bollerslev, Jeffrey, and Wooldridge, (1992), Ding, Granger and Engle, (1993), Engle, (1982), Engle and Bollerslev, (1986), Engle, Lilien and Robins, (1987), Glosten, Jaganathan and Runkle, (1993), Schwert, (1989), Taylor, (1986) and Zakoian, (1994).

The mean equation of the TGARCH(1,1), EGARCH(1,1), and PARCH(1,1) is as follows:

(1)

Where:yt is the dependent variable and in our case is the logarithmic monthly returns of the spot exchange rate AUD/USD.

is a constant.

The variance equation of the TGARCH (1,1) is as follows:

(2)

Where:

According to EViews 6 User’s Guide II, (p.199), the conditional variance equation of the EGARCH model is as follows:

(3)

Where:

133

Page 134: Introduction to Econometrics 1

We assume according to Nelson, (1991), that the follows a generalized error distribution, (GED). The leverage effect represented by is exponential.

The hypotheses that have been formulated and tested are as follows:

According to EViews 6 User’s Guide II, (p.199, 200), the standard deviation equation of the PARCH model is as follows:

(4)

Where:

The hypotheses that have been formulated and tested are as follows:

The return financial series are calculated by taking the natural logarithmic monthly returns of the AUD/USD spot exchange rate. The logarithmic formula that we have used is:

(5)

Where: Rt is the monthly return for month t, Pt is the closing price for month t, and Pt-1 is the previous closing price for month t-1.

Descriptive statistics will be displayed and to test for non-normality the Jarque – Bera statistic is analysed. We check for stationary of the series by applying the Augmented

134

Page 135: Introduction to Econometrics 1

Dickey – Fuller’s stationary test, (ADF). Finally, we apply different types of Autoregressive Heteroskedastic models to test for persistence of volatility clusters.

The data that we have used are monthly logarithmic returns starting from 01/01/1990 to 01/01/2013 of the AUD/USD spot exchange rate, which total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Figure 1 shows the fluctuations of the percentage returns of the spot rate AUD/USD.

AUD/USD

-20

-15

-10

-5

0

5

10

1990

-02

1991

-07

1992

-12

1994

-05

1995

-10

1997

-03

1998

-08

2000

-01

2001

-06

2002

-11

2004

-04

2005

-09

2007

-02

2008

-07

2009

-12

2011

-05

2012

-10

Date

Perc

enta

ge re

turn

Source: Author’s calculation based on monthly data obtained from the US Federal Reserve Statistical Release Department.

According to Figure 1, there was a continuous discount /premium fluctuations of the AUD against the USD. The greatest discount of the spot rate was recorded in October 2008 and the value was -15.89%. There is a persistent negative and positive volatility clusters that resulted to this fluctuations. In May, 2009, the spot rate value has recorded a premium of 6.85 percent. In May, 2010, the value of the spot rate has recorded a discount of -5.93 percent. As, we can see from Figure 1, large and small values occur persistently in clusters. 2. Statistical and econometrical tests

The following Figure and Table display descriptive statistics, the normality tests of the natural logarithmic returns of the AUD/USD spot rate.

Table 1 and Figure 2 display Jarque - Bera normality test and descriptive statistics, which are calculated by taking the log difference of the AUD/USD spot rate. The dataset has covered the period starting from 01/01/1990 to 01/01/2013.

135

Page 136: Introduction to Econometrics 1

0

10

20

30

40

50

-0.15 -0.10 -0.05 0.00 0.05

Series: LRSample 1990M02 2013M01Observations 276

Mean 0.001072Median 0.002359Maximum 0.071211Minimum -0.173060Std. Dev. 0.026888Skewness -1.127986Kurtosis 8.927606

Jarque-Bera 462.5981Probability 0.000000

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: The natural logarithmic monthly returns of the AUD/USD spot exchange rate are normally distributed.

H1: The natural logarithmic monthly returns of the AUD/USD spot exchange rate are not normally distributed.

According to Table 1 and Figure 2, the Jarque – Bera statistic is 462.60, which is very significant at the 5% significance level. The joint test of the null hypothesis that sample skewness equals 0 and sample kurtosis equals 3 is rejected. The probability of the p- value 0.0000 is very significant, so we can reject H0 of normality. The distribution is leptokurtic and slightly negatively skewed. The AUD/USD spot rate time series is non-normal. The mean is 0.001072 and the dispersion around the mean is 0.026888.

The following Table show the ADF test of the monthly natural logarithmic returns of the AUD/USD spot exchange rate for the period 01/01/1990 to 01/01/2013.

Table 2 shows the ADF test of the natural logarithmic monthly returns of the AUD/USD spot exchange rate for the period starting from 01/01/1990 to 01/01/2013.

Null Hypothesis: LR has a unit root

136

Page 137: Introduction to Econometrics 1

Exogenous: ConstantLag Length: 0 (Automatic based on SIC, MAXLAG=15)

t-Statistic   Prob.*

Augmented Dickey-Fuller test statistic -11.64366  0.0000Test critical values: 1% level -3.453997

5% level -2.87184510% level -2.572334

*MacKinnon (1996) one-sided p-values.

Augmented Dickey-Fuller Test EquationDependent Variable: D(LR)Method: Least SquaresDate: 10/21/13 Time: 20:19Sample (adjusted): 1990M03 2013M01Included observations: 275 after adjustments

Variable Coefficient Std. Error t-Statistic Prob.  

LR(-1) -0.661475 0.056810 -11.64366 0.0000C 0.000819 0.001529 0.535537 0.5927

R-squared 0.331824    Mean dependent var 0.000115Adjusted R-squared 0.329376    S.D. dependent var 0.030931S.E. of regression 0.025330    Akaike info criterion -4.506408Sum squared resid 0.175159    Schwarz criterion -4.480104Log likelihood 621.6310    Hannan-Quinn criter. -4.495851F-statistic 135.5748    Durbin-Watson stat 1.911204Prob(F-statistic) 0.000000

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

For a level of significance of one per cent, the critical value of the t-statistic from Dickey-Fuller’s table is -3.45. According to Table 2 and to the sample evidence, we can reject the null hypothesis namely the existence of a unit root with one, five and ten per cent significance level. The ADF test statistic is -11.64, which is smaller than the critical values, (-3.4749, -2.87, -2.57). In other words, the monthly natural logarithmic returns of the AUD/USD are a stationary series.

Table 3 displays the Threshold TGARCH (1,1) model of the natural logarithmic monthly returns of the AUD/USD spot rate. We have used the normal distribution. The dataset has covered the period starting from 01/01/1990 to 01/01/2013.

Dependent Variable: LRMethod: ML - ARCH (Marquardt) - Normal distributionDate: 10/21/13 Time: 20:45Sample: 1990M02 2013M01Included observations: 276

137

Page 138: Introduction to Econometrics 1

Convergence achieved after 26 iterationsPresample variance: backcast (parameter = 0.7)GARCH = C(2) + C(3)*RESID(-1)^2 + C(4)*RESID(-1)^2*(RESID(-1)<0) +        C(5)*GARCH(-1)

Variable Coefficient Std. Error z-Statistic Prob.  

C 0.000238 0.001450 0.164156 0.8696

Variance Equation

ω 4.35E-05 3.11E-05 1.401445 0.1611RESID(-1)^2 0.119815 0.086485 1.385384 0.1659

RESID(-1)^2*(RESID(-1)<0) 0.057151 0.081022 0.705375 0.4806

GARCH(-1) 0.790205 0.074457 10.61290 0.0000

R-squared -0.000965    Mean dependent var 0.001072Adjusted R-squared -0.000965    S.D. dependent var 0.026888S.E. of regression 0.026901    Akaike info criterion -4.548434Sum squared resid 0.199000    Schwarz criterion -4.482847Log likelihood 632.6838    Hannan-Quinn criter. -4.522115Durbin-Watson stat 1.317331

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

According to Table 3, the asymmetric term, RESID(-1)^2*(RESID(-1)<0) has a coefficient of 0.057, a z-statistic of 0.71 and a p-probability of 0.48, which is not statistically significant at the 5 % significance level. Positive shocks related to good news have not implied a higher next period conditional variance than negative shocks related to bad news of the same extent. The ARCH effect is not significant. Only, the GARCH effect is statistically significant as the p-probability is 0.0000.

The mean and variance equation (1) and (2) will be used in combination with Table 3 to show the coefficients .

Table 4 shows the estimation of the coefficients

Coefficients Coefficients Estimated

z-statistic Probability

c 0.0002 0.16 0.874.35E-05 1.40 0.16

0.12 1.39 0.17

138

Page 139: Introduction to Econometrics 1

0.057 0.71 0.480.79 10.61* 0.00

Source: Author’s calculation based on EViews 6 software.* Significant at the 5% significance level.

According to Table 4, all the coefficients are not significant, except the coefficient that captures the GARCH effect. Specifically, the constant c of the mean equation has a coefficient of 0.0002 and it is not significantly positive at 5% significance level. The coefficient of the constant of the variance equation is not statistically significant with a z- statistic of 1.40. The , which explains the ARCH effect is not significant. The leverage effect is not significant with a z-statistic of 0.71 and a p-value of 0.48, which implies that there is no leverage effect and persistent asymmetry in the time series. Finally, the coefficient of the variance equation, which measures the GARCH effect, is positively significant at the 5% significance level. The persistent asymmetry is a strong evidence of the interactions of noise traders and arbitrageurs. The noise traders by buying in the wrong price they have created persistent fluctuations or volatility clusters of the price of the AUD/USD spot exchange rate.

Table 5 shows the residuals test in terms of the correlogram of squared residuals of the TGARCH(1,1) model of the natural logarithmic monthly returns of the AUD/USD spot rate for the period 01/01/1990 to 01/01/2013.

AC PAC Q-Stat Prob1 -0.021 0.011 26.152 0.0722 0.045 0.006 26.746 0.0843 -0.021 -0.012 26.878 0.1084 -0.051 -0.068 27.651 0.1185 0.068 0.115 29.044 0.113

139

Page 140: Introduction to Econometrics 1

6 -0.081 -0.105 31.020 0.0967 -0.053 -0.092 31.864 0.1038 -0.016 0.010 31.942 0.1289 -0.039 0.012 32.408 0.147

10 0.045 0.030 33.017 0.16211 0.005 0.001 33.024 0.19612 -0.028 -0.072 33.267 0.226

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no serial correlation in the residuals.

H1: There is serial correlation in the residuals.

According to Table 5, the Q-statistic associated with the p-values is not significant, which is a sign that there is no serial correlation in the residuals of the TGARCH(1,1) model.

Table 6 displays the residuals test in terms of ARCH LM test of the TGARCH(1,1) model. The aim is to check whether the standardized residuals of the natural logarithmic monthly returns of the AUD/USD spot exchange rate exhibit additional ARCH effect for the period 01/01/1990 to 01/01/2013.

Heteroskedasticity Test: ARCH

F-statistic 0.546217    Prob. F(1,273) 0.4605Obs*R-squared 0.549120    Prob. Chi-Square(1) 0.4587

Test Equation:

140

Page 141: Introduction to Econometrics 1

Dependent Variable: WGT_RESID^2Method: Least SquaresDate: 10/21/13 Time: 21:08Sample (adjusted): 1990M03 2013M01Included observations: 275 after adjustments

Variable Coefficient Std. Error t-Statistic Prob.  

C 0.950693 0.118063 8.052435 0.0000WGT_RESID^2(-1) 0.044678 0.060452 0.739065 0.4605

R-squared 0.001997    Mean dependent var 0.995508Adjusted R-squared -0.001659    S.D. dependent var 1.678502S.E. of regression 1.679894    Akaike info criterion 3.882584Sum squared resid 770.4177    Schwarz criterion 3.908888Log likelihood -531.8553    Hannan-Quinn criter. 3.893141F-statistic 0.546217    Durbin-Watson stat 2.011617Prob(F-statistic) 0.460502

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no ARCH effect or the residuals are homoskedastic.

H1: There is ARCH effect and the residuals are heteroskedastic.

According to Table 6, the probability of the Chi – square statistic is 0.46, which is greater than the 5% significance level. The observations multiplied by the R-squared or the LM test statistic has a value of 0.55. Therefore, we reject H1 and accept H0. In other words, the residuals of the TGARCH(1,1) are homoskedastic and there is no additional ARCH effect.

Figure 3 displays the QQ – plot of the residuals of the TGARCH(1,1) model of the natural logarithmic monthly returns of the AUD/USD spot exchange rate for the period 01/01/1990 to 01/01/2013. The purpose of this graph is to show that the residuals are normally distributed.

141

Page 142: Introduction to Econometrics 1

-.08

-.04

.00

.04

.08

.12

-.20 -.15 -.10 -.05 .00 .05 .10

Quantiles of RESID01

Qua

ntile

s of

Nor

mal

Source: Author’s calculation based on EViews 6 software.

According to Figure 3, there are negative and positive shocks that are driving the residuals of the AUD/USD exchange rate away from normality. The prerequisite that the series is normal is that the points in the QQ – plots should lie alongside the straight line, which is not the case in this Figure.

Table 7 shows the EGARCH(1,1) model of the natural logarithmic monthly returns of the AUD/USD spot exchange rate. We have used the generalized error distribution. The dataset has covered the period starting from 01/01/1990 to 01/01/2013.

142

Page 143: Introduction to Econometrics 1

Dependent Variable: LRMethod: ML - ARCH (Marquardt) - Generalized error distribution (GED)Date: 10/21/13 Time: 20:48Sample: 1990M02 2013M01Included observations: 276Convergence achieved after 42 iterationsPresample variance: backcast (parameter = 0.7)LOG(GARCH) = C(2) + C(3)*ABS(RESID(-1)/@SQRT(GARCH(-1))) + C(4)        *RESID(-1)/@SQRT(GARCH(-1)) + C(5)*LOG(GARCH(-1))

Variable Coefficient Std. Error z-Statistic Prob.  

C 0.000667 0.001455 0.458548 0.6466

Variance Equation

C(2) -0.858893 0.494951 -1.735311 0.0827C(3) 0.326125 0.095315 3.421562 0.0006C(4) -0.039778 0.056685 -0.701734 0.4828C(5) 0.918693 0.062897 14.60642 0.0000

GED PARAMETER 1.706175 0.174460 9.779723 0.0000

R-squared -0.000227    Mean dependent var 0.001072Adjusted R-squared -0.000227    S.D. dependent var 0.026888S.E. of regression 0.026891    Akaike info criterion -4.537769Sum squared resid 0.198853    Schwarz criterion -4.459065Log likelihood 632.2122    Hannan-Quinn criter. -4.506187Durbin-Watson stat 1.318303

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

According to Table 7 and equation (3), the asymmetric term, RESID(-1)/@SQRT(GARCH(-1), has a coefficient of -0.040, a z-statistic of -0.70 and a p-value of 0.48, which is not statistically significant at the 5 % significance level. EGARCH represented by C(5), has a z – statistic of 14.61 and it is statistically significant at the 5% significance level. There is no evidence that positive shocks due to good news implied a higher next period conditional variance than negative shocks due to bad news of the same extent. The generalized error distribution, (GED) parameter has a coefficient of 1.71, a z-statistic of 9.78 and a p-value of 0.0000.

Table 8 shows the residuals test in terms of the correlogram of squared residuals of the EGARCH(1,1) model of the natural logarithmic monthly returns of the AUD/USD spot rate for the period 01/01/1990 to 01/01/2013.

AC PAC Q-Stat Prob1 -0.039 0.006 34.948 0.0892 0.033 0.009 35.284 0.106

143

Page 144: Introduction to Econometrics 1

3 -0.008 -0.011 35.305 0.1314 -0.037 -0.073 35.738 0.1495 -0.000 0.018 35.738 0.1816 0.004 0.036 35.742 0.2177 -0.033 -0.066 36.091 0.2438 0.030 0.047 36.378 0.2729 -0.022 -0.015 36.537 0.308

10 -0.012 -0.022 36.583 0.35011 0.029 0.066 36.853 0.38312 0.014 0.035 36.916 0.426

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no serial correlation in the residuals.

H1: There is serial correlation in the residuals.

According to Table 8, the Q-statistic associated with the p-values is not significant, which is a sign that there is no serial correlation in the residuals of the EGARCH(1,1) model. All the p-values are above the 5% significance level.

Table 9 displays the residuals test in terms of ARCH LM test of the EGARCH(1,1) model. The aim is to check whether the standardized residuals of the natural logarithmic monthly returns of the AUD/USD spot exchange rate exhibit additional ARCH effect for the period 01/01/1990 to 01/01/2013.

Heteroskedasticity Test: ARCH

F-statistic 0.791503    Prob. F(1,273) 0.3744

144

Page 145: Introduction to Econometrics 1

Obs*R-squared 0.794997    Prob. Chi-Square(1) 0.3726

Test Equation:Dependent Variable: WGT_RESID^2Method: Least SquaresDate: 10/21/13 Time: 21:09Sample (adjusted): 1990M03 2013M01Included observations: 275 after adjustments

Variable Coefficient Std. Error t-Statistic Prob.  

C 0.942907 0.121704 7.747522 0.0000WGT_RESID^2(-1) 0.053758 0.060425 0.889665 0.3744

R-squared 0.002891    Mean dependent var 0.996909Adjusted R-squared -0.000762    S.D. dependent var 1.748643S.E. of regression 1.749308    Akaike info criterion 3.963564Sum squared resid 835.4019    Schwarz criterion 3.989868Log likelihood -542.9901    Hannan-Quinn criter. 3.974121F-statistic 0.791503    Durbin-Watson stat 2.017043Prob(F-statistic) 0.374429

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no ARCH effect or the residuals are homoskedastic.

H1: There is ARCH effect and the residuals are heteroskedastic.

According to Table 9, the probability of the Chi – square statistic is 0.37, which is greater than the 5% significance level. The observations multiplied by the R-squared or the LM test statistic has a value of 0.79. Therefore, we reject H1 and accept H0. In other words, the residuals of the EGARCH(1,1) are homoskedastic and there is no additional ARCH effect.

Figure 4 displays the QQ – plot of the residuals of the EGARCH(1,1) model of the natural logarithmic monthly returns of the AUD/USD spot exchange rate for the period 01/01/1990 to 01/01/2013. The purpose of this graph is to show that the residuals are normally distributed.

145

Page 146: Introduction to Econometrics 1

-.08

-.06

-.04

-.02

.00

.02

.04

.06

.08

-.20 -.15 -.10 -.05 .00 .05 .10

Quantiles of RESID02

Qua

ntile

s of

Nor

mal

Source: Author’s calculation based on EViews 6 software.

According to Figure 4, there are negative and positive shocks that are driving the residuals of the AUD/USD exchange rate away from normality. The prerequisite that the series is normal is that the points in the QQ – plots should lie alongside the straight line, which is not the case in this Figure.

Table 10 shows the PARCH (1,1) model of the natural logarithmic monthly returns of the AUD/USD spot exchange rate. We have used the student’s t distribution. The dataset has covered the period starting from 01/01/1990 to 01/01/2013.

Dependent Variable: LRMethod: ML - ARCH (Marquardt) - Student's t distribution

146

Page 147: Introduction to Econometrics 1

Date: 10/21/13 Time: 20:55Sample: 1990M02 2013M01Included observations: 276Convergence achieved after 89 iterationsPresample variance: backcast (parameter = 0.7)@SQRT(GARCH)^C(6) = C(2) + C(3)*(ABS(RESID(-1)) - C(4)*RESID(        -1))^C(6) + C(5)*@SQRT(GARCH(-1))^C(6)

Variable Coefficient Std. Error z-Statistic Prob.  

C 0.001040 0.001426 0.729200 0.4659

Variance Equation

C(2) 1.30E-06 1.00E-05 0.129446 0.8970C(3) 0.108492 0.094904 1.143167 0.2530C(4) 0.087149 0.178676 0.487748 0.6257C(5) 0.757152 0.107071 7.071528 0.0000C(6) 2.990119 2.111413 1.416169 0.1567

T-DIST. DOF 15.21706 10.48813 1.450885 0.1468

R-squared -0.000001    Mean dependent var 0.001072Adjusted R-squared -0.000001    S.D. dependent var 0.026888S.E. of regression 0.026888    Akaike info criterion -4.548203Sum squared resid 0.198809    Schwarz criterion -4.456381Log likelihood 634.6520    Hannan-Quinn criter. -4.511357Durbin-Watson stat 1.318600

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

According to Table 10 and equation (4), the constant, the ARCH and GARCH effect are not significant. The coefficient C(5) *@SQRT(GARCH(-1))^C(6), which measures the asymmetric term is statistically significant at the 5% significance level. C(5) has a coefficient of 0.76, a z-statistic of 7.07 and a p-value of 0.0000. Positive shocks related to good news implied a higher next period conditional variance than negative shocks related to bad news of the same extent. The student’s t distribution has a coefficient of 15.22, a z-statistic of 1.42 and a p-value of 0.15.

Table 11 shows the residuals test in terms of the correlogram of squared residuals of the PARCH (1,1) model of the natural logarithmic monthly returns of the AUD/USD spot rate for the period 01/01/1990 to 01/01/2013.

AC PAC Q-Stat Prob1 -0.039 0.015 31.201 0.1822 0.060 0.049 32.296 0.1843 0.011 0.011 32.332 0.220

147

Page 148: Introduction to Econometrics 1

4 -0.027 -0.076 32.564 0.2525 0.011 0.025 32.604 0.2946 -0.018 0.014 32.707 0.3357 -0.025 -0.060 32.905 0.3748 0.026 0.050 33.115 0.4139 -0.009 -0.015 33.139 0.460

10 -0.009 -0.005 33.164 0.50811 0.022 0.055 33.316 0.55012 0.017 0.032 33.405 0.593

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no serial correlation in the residuals.

H1: There is serial correlation in the residuals.

According to Table 11, the Q-statistic associated with the p-values is not significant, which is a sign that there is no serial correlation in the residuals of the PARCH (1,1) model. All the values are above the 5 % significance level.

Table 12 displays the residuals test in terms of ARCH LM test of the PARCH (1,1) model. The aim is to check whether the standardized residuals of the natural logarithmic monthly returns of the AUD/USD spot exchange rate exhibit additional ARCH effect for the period 01/01/1990 to 01/01/2013.

Heteroskedasticity Test: ARCH

F-statistic 0.109173    Prob. F(1,273) 0.7413

148

Page 149: Introduction to Econometrics 1

Obs*R-squared 0.109929    Prob. Chi-Square(1) 0.7402

Test Equation:Dependent Variable: WGT_RESID^2Method: Least SquaresDate: 10/21/13 Time: 21:12Sample (adjusted): 1990M03 2013M01Included observations: 275 after adjustments

Variable Coefficient Std. Error t-Statistic Prob.  

C 0.979695 0.118508 8.266904 0.0000WGT_RESID^2(-1) 0.019991 0.060504 0.330414 0.7413

R-squared 0.000400    Mean dependent var 0.999833Adjusted R-squared -0.003262    S.D. dependent var 1.682675S.E. of regression 1.685417    Akaike info criterion 3.889149Sum squared resid 775.4921    Schwarz criterion 3.915453Log likelihood -532.7580    Hannan-Quinn criter. 3.899706F-statistic 0.109173    Durbin-Watson stat 2.002023Prob(F-statistic) 0.741341

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

We state the hypotheses as follows:

H0: There is no ARCH effect or the residuals are homoskedastic.

H1: There is ARCH effect and the residuals are heteroskedastic.

According to Table 12, the probability of the Chi – square statistic is 0.74, which is greater than the 5% significance level. The observations multiplied by the R-squared or the LM test statistic has a value of 0.11. Therefore, we reject H1 and accept H0. In other words, the residuals of the PARCH(1,1) are homoskedastic and there is no additional ARCH effect.

Figure 5 displays the QQ – plot of the residuals of the PARCH (1,1) model of the natural logarithmic monthly returns of the AUD/USD spot exchange rate for the period 01/01/1990 to 01/01/2013. The purpose of this graph is to show that the residuals are normally distributed.

149

Page 150: Introduction to Econometrics 1

-.08

-.06

-.04

-.02

.00

.02

.04

.06

.08

-.20 -.15 -.10 -.05 .00 .05 .10

Quantiles of RESID03

Qua

ntile

s of

Nor

mal

Source: Author’s calculation based on EViews 6 software.

According to Figure 5, there are negative and positive shocks that are driving the residuals of the AUD/USD exchange rate away from normality. The prerequisite that the series is normal is that the points in the QQ – plots should lie alongside the straight line, which is not the case in this Figure.

3. Summarizes and Concludes

150

Page 151: Introduction to Econometrics 1

In this article, we have tested volatility clustering by applying threshold generalized autoregressive conditional heteroskedastic models, (TGARCH), exponential generalized autoregressive conditional heteroskedastic models, (EGARCH) and Power Generalized Autoregressive Conditional Heteroskedastic models, (PGARCH). We have tested the volatility of the logarithmic monthly returns of the spot exchange rate of AUD/USD. The data that we have used are monthly returns starting from 01/01/1990 to 01/01/2013, which total to 276 observations. The total dataset includes 277 observations. The data was obtained from the Federal Reserve Statistical Release Department and the symbol of the series is H.10.

We have found that the AUD/USD spot exchange rate time series is non-normal. The natural logarithmic monthly returns of the AUD/USD are a stationary series. By applying a TGARCH (1,1) model, we have found that there is no leverage effect. the asymmetry term, RESID(-1)^2*(RESID(-1)<0) has a coefficient of 0.057, a z-statistic of 0.71 and a p-probability of 0.48, which is not statistically significant at the 5 % significance level. Positive shocks related to good news have not implied a higher next period conditional variance than negative shocks related to bad news of the same extent. The ARCH effect is not significant. Only, the GARCH effect is statistically significant as the p-probability is 0.0000. There is no serial correlation in the residuals of the TGARCH (1,1) model. The residuals of the TGARCH (1,1) are homoskedastic and there is no additional ARCH effect. Finally, there are negative and positive shocks that are driving the AUD/USD exchange spot rate away from normality.

By applying an EGARCH (1,1) model, the asymmetric term RESID(-1)/@SQRT(GARCH(-1), has a coefficient of -0.040, a z-statistic of -0.70 and a p-value of 0.48, which is not statistically significant at the 5 % significance level. Negative shocks do not imply a higher next period conditional variance than positive shocks of the same sign. EGARCH represented by C(5), has a z – statistic of 14.61 and it is statistically significant at the 5% significance level. The generalized error distribution, (GED) parameter has a coefficient of 1.71, a z-statistic of 9.78 and a p-value of 0.0000. There is no serial correlation in the residuals of the EGARCH (1,1) model. The residuals of the EGARCH (1,1) are homoskedastic and there is no additional ARCH effect. Finally, there are negative and positive shocks that are driving the AUD/USD exchange spot rate away from normality.

By applying a PARCH (1,1) model, the coefficient C(5) *@SQRT(GARCH(-1))^C(6), which measures the asymmetric term is statistically significant at the 5% significance level. C(5) has a coefficient of 0.76, a z-statistic of 7.07 and a p-value of 0.0000. Positive shocks related to good news implied a higher next period conditional variance than negative shocks related to bad news of the same extent. The student’s t distribution has a coefficient of 15.22, a z-statistic of 1.42 and a p-value of 0.15. The constant, the ARCH and GARCH effect are not significant.

By comparing the three models in terms of Akaike information criterion and Schwarz criterion, we have concluded the PARCH (1,1) is the best – fit model, as it has the lowest values.

References

151

Page 152: Introduction to Econometrics 1

Alexander, C., (2003), “Market Models. A Guide to Financial Data Analysis”. John Wiley and Sons Ltd. ISMA Centre. The Business School For Financial Markets.

Brooks,C..(2002), Introductory econometrics for Finance. Cambridge University Press. pp.456, 470.

Bollerslev, T.,(1986), “Generalized Autoregressive Conditional Heteroskedasticity”. Journal of Econometrics, 31, pp.307 – 327.

Bollerslev, T., Chou, R,Y., and Kroner, K.F., (1992), “ARCH Modelling in Finance: A Review of the Theory and Empirical Evidence”. Journal of Econometrics, 52, pp.5 -59.

Bollerslev, T, Engle, R.F, and Nelson, D.B., (1994), “ARCH Models”. Chapter 49 in Robert F. Engle and Daniel L.McFadden (eds.). Handbook of Econometrics, Volume 4, Amsterdam: Elsvier Science B.V.

Bollerslev, T, and Wooldridge, J,M., (1992), “Quasi – Maximum Likelihood Estimation and Inference in Dynamic Models with Time Varying Covariances”. Econometric Reviews, 11, pp.143 – 172.

Dickey, D.A and Fuller, W.A.,(1979), “Distribution of Estimators for Time Series Regressions with a Unit Root”. Journal of the American Statistical Association 74, 427-31.

Ding, Z, Granger, C.W.J and Engle, R.F., (1993), “A Long Memory Property of Stock Market Returns and a New Model”. Journal of Empirical Finance, 1, pp.83 -106.

Engle, R.F., (1982), “Autoregressive Conditional Heteroskedasticity with Estimates of the Variance of U.K. Inflation”. Econometrica, 50, pp.987 – 1008.

Engle, R.F, and Bollerslev, T., (1986), “Modelling the Persistence of Conditional Variances”. Econometric Reviews, 5, pp.1 -50.

Engle, R.F, Lilien, D.M and Robins,R.P., (1987), “Estimating Time Varying Risk Premia in the Term Structure: The ARCH-M Model”. Econometrica, 55, pp.391 - 407.

EViews 6, (2007), “ User’s Guide II ”. Quantitative micro software. pp.199-200.

Glosten, L.R.R., Jaganathan and Runkle, D., (1993), “On the Relation between the Expected Value and the Volatility of the Normal Excess Return on Stocks”. Journal of Finance, 48, pp.1779 – 1801.

Loretan,M., Mazda,A. and Subramanian,S.(2005), Indexes of the Foreign Exchange value of the Dollar. It is published in the site of the Federal Reserve Bulletin, Winter 2005.

152

Page 153: Introduction to Econometrics 1

Nelson, D.B.,(1991), “Conditional Heteroskedasticity in Asset Returns: A New Approach”. Econometrica 59(2), pp.347-70.

Schwert, W., (1989), “Stock Volatility and Crash of ‘87”. Review of Financial Studies, 3, pp.77 – 102.

Taylor, S., (1986), “Modelling Financial Time Series, New York: John Wiley and Sons.

Zakoian, J.M., (1994), “Threshold Heteroskedastic Models”. Journal of Economic Dynamics and Control, 18, pp.931 -944.

I have added a research paper to help you understand the how the Wald test restrictions are used in modeling exchange rates by applying an ARCH model.

153

Page 154: Introduction to Econometrics 1

Application of an ARCH system equations of 4 spot exchange rates in terms of CAD/USD, DKK/USD, CHF/USD and JPY/USD to check the conditional variance, covariance and correlation.

Abstract

In this article, we have tested the volatility of the natural logarithmic monthly returns of the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rates. We have applied an ARCH – conditional heteroskedasticity method to model the variance, the covariance of the error terms and the correlation of the spot exchange rates. We have used three multivariate ARCH specifications in terms of the conditional constant correlation, the diagonal VECH and the diagonal BEKK. By applying the system equations 1, which represent a diagonal VECH of a covariance specification multivariate ARCH model, we have found that the log likelihood value of the system equations model is 2935.896. The average log likelihood value is 2.66. The Akaike information criterion is -21.03. The Schwarz criterion is -20.58 and the Hannan – Quinn criterion is -20.85. By applying the system equations 2, which represent a constant conditional correlation of a covariance specification multivariate ARCH model, we have found that the log likelihood value of the system equations model is 2903.890. The average log likelihood value is 2.63. The Akaike information criterion is -20.88. The Schwarz criterion is -20.59 and the Hannan – Quinn criterion is -20.77. By applying the system equations 3, which represent a diagonal BEKK of a covariance specification multivariate ARCH model, we have found that the log likelihood value of the system equations model is 2891.647. The average log likelihood value is 2.62. The Akaike information criterion is -20.84. The Schwarz criterion is -20.63 and the Hannan – Quinn criterion is -20.75. The best fit model to help the arbitrageurs to craft their investment strategy in terms of buying, selling or holding large portfolios of spot exchange rates is the system ARCH equations 1. It has the largest value of the log likelihood and average log likelihood value and the lowest error term value of the Akaike and Hannan - Quinn information criterion. The Schwarz criterion has the highest value. By applying the Wald test of the symmetry restrictions of the natural logarithmic monthly returns of the spot exchange rate returns namely the CAD/USD, DKK/USD, CHF/USD and JPY/USD, we have found that the sample evidence suggests accepting the null hypothesis. In other words, the test has rejected the symmetry restrictions of the mean coefficients. The data that we have used started from 01/01/1990 to 01/01/2013 and represent the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rates, which total to 277 observations. The natural logarithmic monthly returns total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Keywords: System equations ARCH and GARCH terms, CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rate, conditional variance, covariance and correlation.

Introduction

154

Page 155: Introduction to Econometrics 1

This article has focused on modeling the volatility of the natural logarithmic monthly returns of the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rates by using an ARCH system equation to check the conditional variance, covariance and correlation. We have used EViews 6 to test and validate the hypotheses that will be formulated.

System ARCH is a convenient technique to show the variance and the covariance of the error term in an autoregressive form. There are three choices of multivariate specifications such as the diagonal VECH, the diagonal BEKK and the conditional constant correlation.

The purpose of using the ARCH system equations is to help the arbitrageurs to craft their investment strategy in terms of buying, selling or holding large portfolios of spot exchange rates. This will be done by selecting the best fit model in term of the maximum value of log likelihood, and average log likelihood. In addition, we are seeking the lowest error value of the Akaike information criterion, the Schwarz criterion and the Hanna – Quinn criterion. Specifically, Arbitrageurs or rational investors would open a short position to the system of expensive currencies portfolio and simultaneously purchase the same or very similar spot exchange rate portfolio at a spot rate in another market at a lower price to hedge their risks. The effect of this selling by arbitrageurs is to bring the price of the overpriced currencies portfolio down to its fundamental value. The same principle applies for undervalued or depreciated currencies. To earn profit arbitrageurs would buy underpriced currencies and open a short position to an overpriced to hedge their risk. They prevent with this technique underpricing to be substantial or long lasting.

The rest of the paper is organized as follows. Section 1 describes the methodology and the data. Section 2 is an analysis of statistical and econometrical tests and Section 3 summarizes and concludes.

1.Methodology and data description

The methodologies that we have used are system equations of multivariate ARCH models. Autoregressive models have been studied by various researchers such as: Alexander,(2003), Brooks, (2002), Nelson, (1991), Bollerslev, (1990), Bollerslev, (1986), Bollerslev, Chou, and Kroner, (1992), Bollerslev, Engle and Nelson, (1994), Bollerslev, Jeffrey, and Wooldridge, (1992), Donald, (1991), Donald and Monahan, (1992), Engle and Kroner, (1995), Ding, Granger and Engle, (1993), Engle, (1982), Engle and Bollerslev, (1986), Engle, Lilien and Robins, (1987), Glosten, Jaganathan and Runkle, (1993), Newey and West, (1994), Schwert, (1989), Taylor, (1986) and Zakoian, (1994).

The system equations 1 represent a multivariate ARCH model. The model is a diagonal VECH of a covariance specification. The ARCH maximum likelihood is calculated through the Marquardt algorithm. The estimation command, the estimated equations, the mean equations, the variance and covariance equations of the ARCH (1) and GARCH(1) multivariate system are as follows:

Estimation Command:=====================

155

Page 156: Introduction to Econometrics 1

ARCH(DERIV=AA) @DIAGVECH C(INDEF) ARCH(1,INDEF) GARCH(1,INDEF)

Estimated Equations:=====================LNRETCAD = C(1) (1)

Where: LNRETCAD is the natural logarithmic monthly returns of the spot exchange rate CAD/USD. C(1) is the constant of the equation.

LNRETCHF = C(2) (2)

Where: LNRETCHF is the natural logarithmic monthly returns of the spot exchange rate CHF/USD. C(2) is the constant of the equation.

LNRETDKK = C(3) (3)

Where: LNRETDKK is the natural logarithmic monthly returns of the spot exchange rate DKK/USD. C(3) is the constant of the equation.

LNRETJPY = C(4) (4)

Where: LNRETJPY is the natural logarithmic monthly returns of the spot exchange rate JPY/USD. C(4) is the constant of the equation.

Substituted Coefficients (mean equations):================================LNRETCAD = -0.000198594205929 (5) LNRETCHF = -0.00403814506366 (6)

LNRETDKK = -0.00367527995757 (7)

LNRETJPY = -0.00190342907729 (8)

Variance-Covariance Representation:=====================GARCH = M + A1.*RESID(-1)*RESID(-1)' + B1.*GARCH(-1)

Variance and Covariance Equations:=====================GARCH1 = M(1,1) + A1(1,1)*RESID1(-1)^2 + B1(1,1)*GARCH1(-1) (9)

GARCH2 = M(2,2) + A1(2,2)*RESID2(-1)^2 + B1(2,2)*GARCH2(-1) (10)

GARCH3 = M(3,3) + A1(3,3)*RESID3(-1)^2 + B1(3,3)*GARCH3(-1) (11)

GARCH4 = M(4,4) + A1(4,4)*RESID4(-1)^2 + B1(4,4)*GARCH4(-1) (12)

Where: GARCH1 is the conditional variance of the CAD/USD spot exchange rate. GARCH 2 is the conditional variance of the CHF/USD spot exchange rate. GARCH 3 is the conditional variance of the DKK/USD spot exchange rate. GARCH 4 is the conditional variance of the JPY/USD spot exchange rate.

COV1_2 = M(1,2) + A1(1,2)*RESID1(-1)*RESID2(-1) + B1(1,2)*COV1_2(-1) (13)

COV1_3 = M(1,3) + A1(1,3)*RESID1(-1)*RESID3(-1) + B1(1,3)*COV1_3(-1) (14)

COV1_4 = M(1,4) + A1(1,4)*RESID1(-1)*RESID4(-1) + B1(1,4)*COV1_4(-1) (15)

COV2_3 = M(2,3) + A1(2,3)*RESID2(-1)*RESID3(-1) + B1(2,3)*COV2_3(-1) (16)

COV2_4 = M(2,4) + A1(2,4)*RESID2(-1)*RESID4(-1) + B1(2,4)*COV2_4(-1) (17)

156

Page 157: Introduction to Econometrics 1

COV3_4 = M(3,4) + A1(3,4)*RESID3(-1)*RESID4(-1) + B1(3,4)*COV3_4(-1) (18)

Where: COV 1_2 is the conditional covariance between the CAD and the CHF. COV 1_3 is the conditional covariance between the CAD and the DKK. COV 1_4 is the conditional covariance between the CAD and the JPY. COV 2_3 is the conditional covariance between the CHF and the DKK. COV 2_4 is the conditional covariance between the CHF and the JPY. COV 3_4 is the conditional covariance between the DKK and the JPY.

Substituted Coefficients:=====================GARCH1 = 1.53541081268e-05 + 0.195240705537*RESID1(-1)^2 + 0.76258984777*GARCH1(-1)

(19)

GARCH2 = 0.000186085932276 + 0.151025328609*RESID2(-1)^2 + 0.631849700201*GARCH2(-1)

(20)

GARCH3 = 0.00015731714965 + 0.201548270511*RESID3(-1)^2 + 0.600747200726*GARCH3(-1)

(21)

GARCH4 = 0.00030495614079 + 0.213685108468*RESID4(-1)^2 + 0.358389066875*GARCH4(-1)

(22)

COV1_2 = 2.8356351288e-05 + 0.0245260769584*RESID1(-1)*RESID2(-1) + 0.59715460081*COV1_2(-1)

(23)

COV1_3 = 3.2232504728e-05 + 0.0655861369301*RESID1(-1)*RESID3(-1) + 0.625219635364*COV1_3(-1)

(24)

COV1_4 = 3.48443071278e-05 + 0.14237717967*RESID1(-1)*RESID4(-1) -0.329718314306*COV1_4(-1)

(25)

COV2_3 = 0.000152706362993 + 0.161043984121*RESID2(-1)*RESID3(-1) + 0.638818634615*COV2_3(-1)(26)

COV2_4 = 9.40602772686e-05 + 0.0519183722581*RESID2(-1)*RESID4(-1) + 0.6856730049*COV2_4(-1)(27)

COV3_4 = 4.81238848873e-05 + 0.0523633483445*RESID3(-1)*RESID4(-1) + 0.781179497991*COV3_4(-1)(28)

The system equations 2 represent a multivariate ARCH model. The model is a constant conditional correlation of a covariance specification. The ARCH Maximum Likelihood is calculated through the Marquardt algorithm. The estimation command, the estimated equations, the mean equations, the variance and covariance equations of the ARCH (1) and GARCH(1) multivariate system are as follows:

Estimation Command:=====================ARCH(DERIV=AA) @CCC C ARCH(1) GARCH(1)

Estimated Equations:=====================LNRETCAD = C(1) (29)

Where: LNRETCAD is the natural logarithmic monthly returns of the spot exchange rate CAD/USD. C(1) is the constant of the equation.

157

Page 158: Introduction to Econometrics 1

LNRETCHF = C(2) (30)

Where: LNRETCHF is the natural logarithmic monthly returns of the spot exchange rate CHF/USD. C(2) is the constant of the equation.

LNRETDKK = C(3) (31)

Where: LNRETDKK is the natural logarithmic monthly returns of the spot exchange rate DKK/USD. C(3) is the constant of the equation.

LNRETJPY = C(4) (32) Where: LNRETJPY is the natural logarithmic monthly returns of the spot exchange rate JPY/USD. C(4) is the constant of the equation.

Substituted Coefficients: (mean equations):=================================LNRETCAD = -7.39889179944e-05 (33)

LNRETCHF = -0.00224734526905 (34)

LNRETDKK = -0.00142461374593 (35)

LNRETJPY = -0.00151368914384 (36)

Variance and Covariance Representations:=====================GARCH(i) = M(i) + A1(i)*RESID(i)(-1)^2 + B1(i)*GARCH(i)(-1)

COV(i,j) = R(i,j)*@SQRT(GARCH(i)*GARCH(j))

Variance and Covariance Equations:=====================GARCH1 = C(5) + C(6)*RESID1(-1)^2 + C(7)*GARCH1(-1) (37)

GARCH2 = C(8) + C(9)*RESID2(-1)^2 + C(10)*GARCH2(-1) (38)

GARCH3 = C(11) + C(12)*RESID3(-1)^2 + C(13)*GARCH3(-1) (39)

GARCH4 = C(14) + C(15)*RESID4(-1)^2 + C(16)*GARCH4(-1) (40)

Where: GARCH1 is the conditional variance of the CAD/USD spot exchange rate. GARCH 2 is the conditional variance of the CHF/USD spot exchange rate. GARCH 3 is the conditional variance of the DKK/USD spot exchange rate. GARCH 4 is the conditional variance of the JPY/USD spot exchange rate.

COV1_2 = C(17)*@SQRT(GARCH1*GARCH2) (41)

COV1_3 = C(18)*@SQRT(GARCH1*GARCH3) (42)

COV1_4 = C(19)*@SQRT(GARCH1*GARCH4) (43)

COV2_3 = C(20)*@SQRT(GARCH2*GARCH3) (44)

COV2_4 = C(21)*@SQRT(GARCH2*GARCH4) (45)

COV3_4 = C(22)*@SQRT(GARCH3*GARCH4) (46)

Where: COV 1_2 is the conditional covariance between the CAD and the CHF. COV 1_3 is the conditional covariance between the CAD and the DKK. COV 1_4 is the conditional covariance between the CAD and the JPY. COV 2_3 is the conditional covariance between the CHF and the DKK. COV 2_4 is the conditional covariance between the CHF and the JPY. COV 3_4 is the conditional covariance between the DKK and the JPY.

158

Page 159: Introduction to Econometrics 1

Substituted Coefficients:=====================GARCH1 = 1.32739414729e-05 + 0.202995283141*RESID1(-1)^2 + 0.766911037193*GARCH1(-1)

(47)

GARCH2 = 0.000277076071496 + 0.224475673303*RESID2(-1)^2 + 0.457997084165*GARCH2(-1)

(48)

GARCH3 = 0.000128874245053 + 0.216475150011*RESID3(-1)^2 + 0.610379514469*GARCH3(-1)

(49)

GARCH4 = 0.000527343855433 + 0.257042477932*RESID4(-1)^2 + 0.000274019412797*GARCH4(-1)

(50)

COV1_2 = 0.241402906757*@SQRT(GARCH1*GARCH2)

(51)

COV1_3 = 0.34927281624*@SQRT(GARCH1*GARCH3)

(52)

COV1_4 = 0.0360814058414*@SQRT(GARCH1*GARCH4)

(53)

COV2_3 = 0.908952800855*@SQRT(GARCH2*GARCH3)

(54)

COV2_4 = 0.474855564027*@SQRT(GARCH2*GARCH4)

(55)

COV3_4 = 0.397917855463*@SQRT(GARCH3*GARCH4)

(56)

The system equations 3 represent a multivariate ARCH model. The model is a diagonal BEKK of a covariance specification. The ARCH Maximum Likelihood is calculated through the Marquardt algorithm. The estimation command, the estimated equations, the mean equations, the variance and covariance equations of the ARCH (1) and GARCH(1) multivariate system are as follows:

Estimation Command:=====================ARCH(DERIV=AA) @DIAGBEKK C(DIAG) ARCH(1,DIAG) GARCH(1,DIAG)

Estimated Equations:=====================LNRETCAD = C(1) (57)

LNRETCHF = C(2) (58)

LNRETDKK = C(3) (59)

LNRETJPY = C(4) (60)

159

Page 160: Introduction to Econometrics 1

Substituted Coefficients:=====================LNRETCAD = 0.000598156398879 (61)

LNRETCHF = -0.00109184700772 (62)

LNRETDKK = -0.000926360627345 (63)

LNRETJPY = -0.00182102510883 (64)

Variance-Covariance Representation:=====================GARCH = M + A1*RESID(-1)*RESID(-1)'*A1 + B1*GARCH(-1)*B1

Variance and Covariance Equations:=====================GARCH1 = M(1,1) + A1(1,1)^2*RESID1(-1)^2 + B1(1,1)^2*GARCH1(-1) (65)

GARCH2 = M(2,2) + A1(2,2)^2*RESID2(-1)^2 + B1(2,2)^2*GARCH2(-1) (66)

GARCH3 = M(3,3) + A1(3,3)^2*RESID3(-1)^2 + B1(3,3)^2*GARCH3(-1) (67)

GARCH4 = M(4,4) + A1(4,4)^2*RESID4(-1)^2 + B1(4,4)^2*GARCH4(-1) (68)

Where: GARCH1 is the conditional variance of the CAD/USD spot exchange rate. GARCH 2 is the conditional variance of the CHF/USD spot exchange rate. GARCH 3 is the conditional variance of the DKK/USD spot exchange rate. GARCH 4 is the conditional variance of the JPY/USD spot exchange rate.

COV1_2 = A1(1,1)*A1(2,2)*RESID1(-1)*RESID2(-1) + B1(1,1)*B1(2,2)*COV1_2(-1) (69)

COV1_3 = A1(1,1)*A1(3,3)*RESID1(-1)*RESID3(-1) + B1(1,1)*B1(3,3)*COV1_3(-1) (70)

COV1_4 = A1(1,1)*A1(4,4)*RESID1(-1)*RESID4(-1) + B1(1,1)*B1(4,4)*COV1_4(-1) (71)

COV2_3 = A1(2,2)*A1(3,3)*RESID2(-1)*RESID3(-1) + B1(2,2)*B1(3,3)*COV2_3(-1) (72) COV2_4 = A1(2,2)*A1(4,4)*RESID2(-1)*RESID4(-1) + B1(2,2)*B1(4,4)*COV2_4(-1) (73)

COV3_4 = A1(3,3)*A1(4,4)*RESID3(-1)*RESID4(-1) + B1(3,3)*B1(4,4)*COV3_4(-1) (74)

Where: COV 1_2 is the conditional covariance between the CAD and the CHF. COV 1_3 is the conditional covariance between the CAD and the DKK. COV 1_4 is the conditional covariance between the CAD and the JPY. COV 2_3 is the conditional covariance between the CHF and the DKK. COV 2_4 is the conditional covariance between the CHF and the JPY. COV 3_4 is the conditional covariance between the DKK and the JPY.

Substituted Coefficients:=====================GARCH1 = 7.48545113012e-06+0.0887305025778*RESID1(-1)^2+0.892558809051*GARCH1(-1)

(75)

GARCH2 = -3.04030767555e-06+0.0830928567535*RESID2(-1)^2+0.927099590292*GARCH2(-1)

(76)

GARCH3 = 3.92100107163e-06+0.101678776401*RESID3(-1)^2+0.901287145764*GARCH3(-1)

(77)

GARCH4 = 1.74128289923e-05+0.0501735508656*RESID4(-1)^2+0.926063342157*GARCH4(-1)

(78)

160

Page 161: Introduction to Econometrics 1

COV1_2 = 0.0858654234274*RESID1(-1)*RESID2(-1) + 0.909665271505*COV1_2(-1)

(79)

COV1_3 = 0.0949842562297*RESID1(-1)*RESID3(-1) + 0.896912359953*COV1_3(-1)

(80)

COV1_4 = 0.0667227426326*RESID1(-1)*RESID4(-1) + 0.909156748741*COV1_4(-1)

(81)

COV2_3 = 0.0919172454022*RESID2(-1)*RESID3(-1) + 0.914102261005*COV2_3(-1)

(82)

COV2_4 = 0.0645682869131*RESID2(-1)*RESID4(-1) + 0.926581321363*COV2_4(-1)

(83)

COV3_4 = 0.0714253824609*RESID3(-1)*RESID4(-1) + 0.913591257866*COV3_4(-1)

(84)

The data that we have used are logarithmic monthly returns starting from 01/02/1990 to 01/01/2013 of the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot rate, which total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

The logarithmic formula that we have used is:

(85)

Where: Rt is the monthly return for month t, Pt is the closing price for month t, and Pt-1 is the previous closing price for month t-1.

2. Statistical and econometrical tests

Table 1 shows the system equations 1, which represent a multivariate ARCH model. The model is a diagonal VECH of a covariance specification. The ARCH Maximum Likelihood is calculated through the Marquardt algorithm. The data that we have used are monthly returns starting from 01/01/1990 to 01/01/2013, which total to 277 observations. The natural logarithmic monthly returns total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and the symbol of the series is H.10.

System: SYS01Estimation Method: ARCH Maximum Likelihood (Marquardt)Covariance specification: Diagonal VECHDate: 11/26/13 Time: 13:43Sample: 1990M02 2013M01Included observations: 276Total system (balanced) observations 1104Presample covariance: backcast (parameter =0.7)Convergence achieved after 56 iterations

161

Page 162: Introduction to Econometrics 1

Coefficient Std. Error z-Statistic Prob.  

C(1) -0.000199 0.000932 -0.213189 0.8312C(2) -0.004038 0.001682 -2.400459 0.0164C(3) -0.003675 0.001527 -2.406897 0.0161C(4) -0.001903 0.001797 -1.059111 0.2895

Variance Equation Coefficients

C(5) 1.54E-05 9.36E-06 1.641131 0.1008C(6) 2.84E-05 4.65E-05 0.610380 0.5416C(7) 3.22E-05 2.91E-05 1.107402 0.2681C(8) 3.48E-05 3.33E-05 1.045578 0.2958C(9) 0.000186 5.61E-05 3.316210 0.0009

C(10) 0.000153 3.91E-05 3.905382 0.0001C(11) 9.41E-05 7.99E-05 1.176990 0.2392C(12) 0.000157 3.87E-05 4.066967 0.0000C(13) 4.81E-05 3.83E-05 1.254928 0.2095C(14) 0.000305 0.000123 2.472924 0.0134C(15) 0.195241 0.063509 3.074200 0.0021C(16) 0.024526 0.051215 0.478884 0.6320C(17) 0.065586 0.051115 1.283121 0.1994C(18) 0.142377 0.088549 1.607897 0.1079C(19) 0.151025 0.038722 3.900238 0.0001C(20) 0.161044 0.035964 4.477900 0.0000C(21) 0.051918 0.044951 1.154997 0.2481C(22) 0.201548 0.038235 5.271357 0.0000C(23) 0.052363 0.039564 1.323505 0.1857C(24) 0.213685 0.073793 2.895740 0.0038C(25) 0.762590 0.076663 9.947267 0.0000C(26) 0.597155 0.645110 0.925663 0.3546C(27) 0.625220 0.298011 2.097972 0.0359C(28) -0.329718 0.366833 -0.898824 0.3687C(29) 0.631850 0.079720 7.925883 0.0000C(30) 0.638819 0.061119 10.45212 0.0000C(31) 0.685673 0.242232 2.830641 0.0046C(32) 0.600747 0.058177 10.32617 0.0000C(33) 0.781179 0.152202 5.132520 0.0000C(34) 0.358389 0.195785 1.830520 0.0672

Log likelihood 2935.896Schwarz criterion -20.58224Avg. log likelihood 2.659327Hannan-Quinn criter. -20.84927Akaike info criterion -21.02824

Covariance specification: Diagonal VECHGARCH = M + A1.*RESID(-1)*RESID(-1)' + B1.*GARCH(-1)M is an indefinite matrixA1 is an indefinite matrixB1 is an indefinite matrix

Transformed Variance Coefficients

Coefficient Std. Error z-Statistic Prob.  

M(1,1) 1.54E-05 9.36E-06 1.641131 0.1008M(1,2) 2.84E-05 4.65E-05 0.610380 0.5416M(1,3) 3.22E-05 2.91E-05 1.107402 0.2681M(1,4) 3.48E-05 3.33E-05 1.045578 0.2958M(2,2) 0.000186 5.61E-05 3.316210 0.0009M(2,3) 0.000153 3.91E-05 3.905382 0.0001M(2,4) 9.41E-05 7.99E-05 1.176990 0.2392

162

Page 163: Introduction to Econometrics 1

M(3,3) 0.000157 3.87E-05 4.066967 0.0000M(3,4) 4.81E-05 3.83E-05 1.254928 0.2095M(4,4) 0.000305 0.000123 2.472924 0.0134A1(1,1) 0.195241 0.063509 3.074200 0.0021A1(1,2) 0.024526 0.051215 0.478884 0.6320A1(1,3) 0.065586 0.051115 1.283121 0.1994A1(1,4) 0.142377 0.088549 1.607897 0.1079A1(2,2) 0.151025 0.038722 3.900238 0.0001A1(2,3) 0.161044 0.035964 4.477900 0.0000A1(2,4) 0.051918 0.044951 1.154997 0.2481A1(3,3) 0.201548 0.038235 5.271357 0.0000A1(3,4) 0.052363 0.039564 1.323505 0.1857A1(4,4) 0.213685 0.073793 2.895740 0.0038B1(1,1) 0.762590 0.076663 9.947267 0.0000B1(1,2) 0.597155 0.645110 0.925663 0.3546B1(1,3) 0.625220 0.298011 2.097972 0.0359B1(1,4) -0.329718 0.366833 -0.898824 0.3687B1(2,2) 0.631850 0.079720 7.925883 0.0000B1(2,3) 0.638819 0.061119 10.45212 0.0000B1(2,4) 0.685673 0.242232 2.830641 0.0046B1(3,3) 0.600747 0.058177 10.32617 0.0000B1(3,4) 0.781179 0.152202 5.132520 0.0000B1(4,4) 0.358389 0.195785 1.830520 0.0672

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

According to Table 1, the first four coefficients listed in the upper section, namely, C(1), C(2), C(3) and C(4) represent the mean equation. Specifically, the coefficient C(1) represents the CAD/USD spot exchange rate. It has a coefficient of -0.000199 and a p-value of 0.83. It is not significant as it is above the 5% significance level. The coefficient C(2) and C(3), which shows the CHF/USD and DKK/USD respectively are statistically significant at the 5% significance level. Finally, the C(4), which displays the JPY/USD is not statistically significant. Then, the table displays the variance coefficients. C(5) to C(14) are the coefficients for the constant matrix. The coefficients C(9), C(10), C(12) and C(14) show significant p-values. C(15) to C(24) are the coefficients for the ARCH term. C(15), C(19), C(20), C(22) and C(24) show a significant p-values of 0.0021, 0.0001, 0.0000, 0.0000, and 0.0038. C(25) to C(34) are the coefficients for the GARCH term. The coefficients C(25), C(27), C(29), C(30), C(31), C(32), C(33) display significant p-values at the 5% significance level. The log likelihood value of the system equations model is 2935.896. The average log likelihood value is 2.66. The Akaike information criterion is -21.03. The Schwarz criterion is -20.58 and the Hannan – Quinn criterion is -20.85.

The next part of the table 1 shows the diagonal VECH covariance model. GARCH represents the conditional variance matrix. M is the constant matrix coefficient. A1 is the coefficient matrix for the ARCH term and B1 is the coefficient matrix for the GARCH term. Equations (19) to (22) in the methodological section show the GARCH equations, which represent the conditional variance equations. For example, equation (19) is as follows:

GARCH1 = 1.53541081268e-05 + 0.195240705537*RESID1(-1)^2 + 0.76258984777*GARCH1(-1) (1.64) (3.07) (9.95) (0.10) (0.00) (0.00)Where: GARCH1 is the conditional variance equation of the CAD/USD spot exchange rate.

163

Page 164: Introduction to Econometrics 1

1.5354 represents M(1,1), which is the constant matrix coefficient. 0.1952 represents A1(1,1) which is the coefficient matrix for the ARCH term. 0.7626 represents B1(1,1) which is the coefficient matrix for the GARCH term.

The first parenthesis represents the z-statistic and the second parenthesis displays the p-value. The constant matrix coefficient is not significant. Both the coefficient matrix for the ARCH and GARCH term are significant as the p-values are below the 5% significance level. The conditional variance equations for GARCH 2, 3 and 4 have significant values for the coefficient matrices.

Equations (23) to (28) in the methodological section show the covariance equations, which represent the conditional covariance equations. The first parenthesis represents the z-statistic and the second parenthesis displays the p-value. The constant matrix coefficient, the coefficient matrix for the ARCH and GARCH term are not significant as the p-values are above the 5% significance level. For example, equation (23) is as follows:

COV1_2 = 2.8356351288e-05 + 0.0245260769584*RESID1(-1)*RESID2(-1) + 0.59715460081*COV1_2(-1) (0.61) (0.48) (0.93) (0.54) (0.63) (0.35)

Where: COV1_2 is the conditional covariance equation of the CAD/CHF spot exchange rate. 2.8356 represents M(1,2), which is the constant matrix coefficient. 0.0245 represents A1(1,2) which is the coefficient matrix for the ARCH term. 0.5972 represents B1(1,2) which is the coefficient matrix for the GARCH term.

The conditional covariance equation COV2_3, which represents the conditional covariance between the CHF and the DKK spot exchange rate displays significant values of the coefficient matrices. The COV1_4, which is the covariance of CAD/JPY is not significant. COV1_3, COV2_4, COV3_4, which represent CAD/DKK, CHF/JPY, and DKK/JPY display only a significant GARCH term of 0.04, 0.005 and 0.0000 respectively.

Table 2 shows the Wald test of the symmetry restrictions of the logarithmic monthly returns starting from 01/02/1990 to 01/01/2013. The data covers the spot exchange rate returns namely the CAD/USD, DKK/USD, CHF/USD and JPY/USD, which total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Wald Test:System: SYS01

Test Statistic Value   df     Probability

Chi-square 6.205460 4   0.1843

Null Hypothesis Summary:

Normalized Restriction (= 0) Value   Std. Err.

C(1) -0.000199 0.000932C(2) -0.004038 0.001682C(3) -0.003675 0.001527

164

Page 165: Introduction to Econometrics 1

C(4) -0.001903 0.001797

Restrictions are linear in coefficients.Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

The hypotheses that have been formulated and tested are as follows:

H0: The mean coefficients C(1)=C(2)=C(3)=C(4)= 0.

H1: The mean coefficients C(1)=C(2)=C(3)=C(4) ≠ 0.

According to Table 2, the probability of the Chi-square statistic is 6.21 and the p-value is 0.18. The sample evidence suggests to accept the null hypothesis. In other words, the test has rejected the symmetry restrictions.

Table 3 shows the system equations 2, which represent a multivariate ARCH model. The model is a constant conditional correlation of a covariance specification. The ARCH Maximum Likelihood is calculated through the Marquardt algorithm. The data that we have used are monthly returns starting from 01/01/1990 to 01/01/2013, which total to 277 observations. The natural logarithmic monthly returns total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and the symbol of the series is H.10.

System: SYS02Estimation Method: ARCH Maximum Likelihood (Marquardt)Covariance specification: Constant Conditional CorrelationDate: 11/26/13 Time: 17:58Sample: 1990M02 2013M01Included observations: 276Total system (balanced) observations 1104Presample covariance: backcast (parameter =0.7)Convergence achieved after 62 iterations

Coefficient Std. Error z-Statistic Prob.  

C(1) -7.40E-05 0.000929 -0.079675 0.9365C(2) -0.002247 0.001559 -1.441825 0.1494C(3) -0.001425 0.001365 -1.043417 0.2968C(4) -0.001514 0.001687 -0.897015 0.3697

165

Page 166: Introduction to Econometrics 1

Variance Equation Coefficients

C(5) 1.33E-05 8.06E-06 1.647688 0.0994C(6) 0.202995 0.066791 3.039282 0.0024C(7) 0.766911 0.072270 10.61175 0.0000C(8) 0.000277 0.000105 2.631233 0.0085C(9) 0.224476 0.045657 4.916603 0.0000

C(10) 0.457997 0.144120 3.177881 0.0015C(11) 0.000129 4.49E-05 2.867505 0.0041C(12) 0.216475 0.050956 4.248249 0.0000C(13) 0.610380 0.080099 7.620348 0.0000C(14) 0.000527 0.000193 2.737024 0.0062C(15) 0.257042 0.088192 2.914578 0.0036C(16) 0.000274 0.261945 0.001046 0.9992C(17) 0.241403 0.062748 3.847151 0.0001C(18) 0.349273 0.058583 5.962018 0.0000C(19) 0.036081 0.059243 0.609045 0.5425C(20) 0.908953 0.009870 92.09174 0.0000C(21) 0.474856 0.043462 10.92576 0.0000C(22) 0.397918 0.049386 8.057360 0.0000

Log likelihood 2903.890Schwarz criterion -20.59468Avg. log likelihood 2.630335Hannan-Quinn criter. -20.76746Akaike info criterion -20.88326

Covariance specification: Constant Conditional CorrelationGARCH(i) = M(i) + A1(i)*RESID(i)(-1)^2 + B1(i)*GARCH(i)(-1)COV(i,j) = R(i,j)*@SQRT(GARCH(i)*GARCH(j))

Transformed Variance Coefficients

Coefficient Std. Error z-Statistic Prob.  

M(1) 1.33E-05 8.06E-06 1.647688 0.0994A1(1) 0.202995 0.066791 3.039282 0.0024B1(1) 0.766911 0.072270 10.61175 0.0000M(2) 0.000277 0.000105 2.631233 0.0085A1(2) 0.224476 0.045657 4.916603 0.0000B1(2) 0.457997 0.144120 3.177881 0.0015M(3) 0.000129 4.49E-05 2.867505 0.0041A1(3) 0.216475 0.050956 4.248249 0.0000B1(3) 0.610380 0.080099 7.620348 0.0000M(4) 0.000527 0.000193 2.737024 0.0062A1(4) 0.257042 0.088192 2.914578 0.0036B1(4) 0.000274 0.261945 0.001046 0.9992R(1,2) 0.241403 0.062748 3.847151 0.0001R(1,3) 0.349273 0.058583 5.962018 0.0000R(1,4) 0.036081 0.059243 0.609045 0.5425R(2,3) 0.908953 0.009870 92.09174 0.0000R(2,4) 0.474856 0.043462 10.92576 0.0000R(3,4) 0.397918 0.049386 8.057360 0.0000

Source: Author’s calculation based on EViews 6 software.

166

Page 167: Introduction to Econometrics 1

Significant at the 5% significance level.

According to Table 3, the first four coefficients listed in the upper section, namely, C(1), C(2), C(3) and C(4) represent the mean equation. Specifically, the coefficient C(1) represents the CAD/USD spot exchange rate. It has a coefficient of -7.40E-05 and a p-value of 0.94. It is not significant as it is above the 5% significance level. The coefficient C(2) and C(3), which shows the CHF/USD and DKK/USD respectively are not statistically significant at the 5% significance level as the p-value is 0.15 and 0.30 respectively. Finally the C(4), which displays the JPY/USD is not statistically significant. Then, the table displays the variance coefficients. C(5) to C(10) are the coefficients for the constant matrix. The coefficients C(6), C(7), C(8), C(9) and C(10) show significant p-values of 0.00, 0.00, 0.01, 0.00, 0.00 respectively. C(11) to C(16) are the coefficients for the ARCH term. C(11), C(12), C(13), C(14) and C(15) show a statistically significant p-values. C(17) to C(22) are the coefficients for the GARCH term. The coefficients C(17), C(18), C(20), C(21), and C(22), display significant p-values at the 5% significance level. The log likelihood value of the system equations model is 2903.890. The average log likelihood value is 2.63. The Akaike information criterion is -20.88. The Schwarz criterion is -20.59 and the Hannan – Quinn criterion is -20.77.

The next part of the table 1 shows the constant conditional correlation of the covariance model. GARCH represents the conditional variance matrix. M is the constant matrix coefficient. A1 is the coefficient matrix for the ARCH term and B1 is the coefficient matrix for the GARCH term. Equations (47) to (50) in the methodological section show the GARCH equations, which represent the conditional variance equations. For example, equation (47) is as follows:

GARCH1 = 1.32739414729e-05 + 0.202995283141*RESID1(-1)^2 + 0.766911037193*GARCH1(-1) (1.65) (3.04) (10.61) (0.099) (0.002) (0.000)Where: GARCH1 is the conditional variance equation of the CAD/USD spot exchange rate. 1.32739 represents M(1), which is the constant matrix coefficient. 0.20299 represents A1(1) which is the coefficient matrix for the ARCH term. 0.76691 represents B1(1) which is the coefficient matrix for the GARCH term.

The first parenthesis represents the z-statistic and the second parenthesis displays the p-value. The constant matrix coefficient and the coefficient matrix for the ARCH and GARCH term are significant as the p-values are below the 5% significance level. The conditional variance equations for GARCH 2, 3 and 4 have significant values for the coefficient matrices except the coefficient matrix of the GARCH term B1(4).

Equations (51) to (56) in the methodological section show the covariance equations, which represent the conditional covariance equations. The first parenthesis represents the z-statistic and the second parenthesis displays the p-value. The output displays that the correlation R(1,2) is 0.24. It is significant as the z-statistic is 3.85 and the p-value is 0.00 and below the 5% significance level. For example, equation (51) is as follows:

COV1_2 = 0.241402906757*@SQRT(GARCH1*GARCH2)

(3.85) (0.00)

Where: COV1_2 is the conditional covariance equation of the CAD/CHF spot exchange rate.

167

Page 168: Introduction to Econometrics 1

R(1,2) is 0.24 and shows the output of the constant conditional correlation between the two currencies.

The conditional covariance equation COV1_3, which represents the conditional covariance between the CAD and the DKK spot exchange rate, displays significant values of the correlation coefficient. The correlation R(1,3) is 0.35. It has a z-statistic of 5.96 and a p-value of 0.00. The COV1_4, which is the covariance of CAD/JPY is not significant. The correlation R(1,4) is 0.04. The z-statistic is 0.61 and the p-value is 0.54. The COV2_3, COV2_4, and COV3_4, which represent CHF/DKK, CHF/JPY, and DKK/JPY all statistically significant. The correlation R(2,3) is 0.91. It has a z-statistic of 92.09 and a p-value of 0.00. The correlation R(2,4) is 0.47. It has a z-statistic of 10.93 and a p-value of 0.00. Finally, the correlation R(3,4) is 0.40. It has a z-statistic of 8.06 and a p-value of 0.00.

Table 4 shows the Wald test of the symmetry restrictions of the logarithmic monthly returns starting from 01/02/1990 to 01/01/2013. The data covers the spot exchange rate returns namely the CAD/USD, DKK/USD, CHF/USD and JPY/USD, which total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Wald Test:System: SYS02

Test Statistic Value   df     Probability

Chi-square 2.293821 4   0.6819

Null Hypothesis Summary:

Normalized Restriction (= 0) Value   Std. Err.

C(1) -7.40E-05 0.000929C(2) -0.002247 0.001559C(3) -0.001425 0.001365C(4) -0.001514 0.001687

168

Page 169: Introduction to Econometrics 1

Restrictions are linear in coefficients.Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

The hypotheses that have been formulated and tested are as follows:

H0: The mean coefficients C(1)= C(2) =C(3)=C(4)= 0.

H1: The mean coefficients C(1)= C(2)=C(3)=C(4) ≠ 0.

According to Table 4, the probability of the Chi-square statistic is 2.29 and the p-value is 0.68. The sample evidence suggests to accept the null hypothesis. In other words, the test failed to reject the symmetry restrictions.

Table 5 shows the system equations 3, which represent a multivariate ARCH model. The model is a diagonal BEKK of a covariance specification. The ARCH Maximum Likelihood is calculated through the Marquardt algorithm. The data that we have used are monthly returns starting from 01/01/1990 to 01/01/2013, which total to 277 observations. The natural logarithmic monthly returns total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and the symbol of the series is H.10.

System: SYS03Estimation Method: ARCH Maximum Likelihood (Marquardt)Covariance specification: Diagonal BEKKDate: 11/26/13 Time: 18:02Sample: 1990M02 2013M01Included observations: 276Total system (balanced) observations 1104Presample covariance: backcast (parameter =0.7)Convergence achieved after 93 iterations

Coefficient Std. Error z-Statistic Prob.  

C(1) 0.000598 0.000958 0.624331 0.5324C(2) -0.001092 0.001120 -0.974649 0.3297C(3) -0.000926 0.000957 -0.967527 0.3333C(4) -0.001821 0.001452 -1.254016 0.2098

Variance Equation Coefficients

C(5) 7.49E-06 4.36E-06 1.717914 0.0858C(6) -3.04E-06 2.23E-06 -1.365042 0.1722C(7) 3.92E-06 2.23E-06 1.755561 0.0792C(8) 1.74E-05 7.70E-06 2.262118 0.0237

169

Page 170: Introduction to Econometrics 1

C(9) 0.297877 0.045174 6.593932 0.0000C(10) 0.288258 0.026328 10.94880 0.0000C(11) 0.318871 0.026856 11.87347 0.0000C(12) 0.223995 0.038227 5.859642 0.0000C(13) 0.944753 0.018936 49.89229 0.0000C(14) 0.962860 0.006985 137.8454 0.0000C(15) 0.949361 0.007883 120.4268 0.0000C(16) 0.962322 0.011963 80.43952 0.0000

Log likelihood 2891.647Schwarz criterion -20.62814Avg. log likelihood 2.619245Hannan-Quinn criter. -20.75380Akaike info criterion -20.83802

Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

According to Table 5, the first four coefficients listed in the upper section, namely, C(1), C(2), C(3) and C(4) represent the mean equation. Specifically, the coefficient C(1) represents the CAD/USD spot exchange rate. It has a coefficient of 0.001 and a p-value of 0.53. It is not significant as the p-value is above the 5% significance level. The coefficient C(2), C(3), and C(4) which shows the CHF/USD, DKK/USD, and JPY/USD respectively are not statistically significant at the 5% significance level. Then, the table displays the variance coefficients. C(5) to C(8) are the coefficients for the constant matrix. The coefficients C(5), C(6), and C(7) are not statistically significant. C(9) to C(12) are the coefficients for the ARCH term. They are all significant. C(13) to C(16) are the coefficients for the GARCH term. The coefficients C(13), C(14), C(15), and C(16) are statistically significant at the 5% significance level. The log likelihood value of the system equations model is 2891.647. The average log likelihood value is 2.62. The Akaike information criterion is -20.84. The Schwarz criterion is -20.63 and the Hannan – Quinn criterion is -20.75.

GARCH represents the conditional variance matrix of the diagonal BEKK covariance model. M is the constant matrix coefficient. A1 is the coefficient matrix for the ARCH term and B1 is the coefficient matrix for the GARCH term. Equations (75) to (78) in the methodological section show the GARCH equations, which represent the conditional variance equations. For example, equation (75) is as follows:

GARCH1 = 7.48545113012e-06+0.0887305025778*RESID1(-1)^2+0.892558809051*GARCH1(-1) (1.72) (6.59) (49.89) (0.09) (0.00) (0.00)Where: GARCH1 is the conditional variance equation of the CAD/USD spot exchange rate. 7.4855 represents M(1,1), which is the constant matrix coefficient. 0.0887 represents A1(1,1) which is the coefficient matrix for the ARCH term. 0.8926 represents B1(1,1) which is the coefficient matrix for the GARCH term.

The first parenthesis represents the z-statistic and the second parenthesis displays the p-value. The constant matrix coefficient is not significant. Both the coefficient matrix for the ARCH and GARCH term are significant as the p-values are below the 5% significance level. Similar results show the conditional variance equations for GARCH 2, 3 and 4.

Equations (79) to (84) in the methodological section show the covariance equations, which represent the conditional covariance equations. For example, equation (79 -84) are as follows:

170

Page 171: Introduction to Econometrics 1

COV1_2 = 0.0858654234274*RESID1(-1)*RESID2(-1) + 0.909665271505*COV1_2(-1)

COV1_3 = 0.0949842562297*RESID1(-1)*RESID3(-1) + 0.896912359953*COV1_3(-1)

COV1_4 = 0.0667227426326*RESID1(-1)*RESID4(-1) + 0.909156748741*COV1_4(-1)

COV2_3 = 0.0919172454022*RESID2(-1)*RESID3(-1) + 0.914102261005*COV2_3(-1)

COV2_4 = 0.0645682869131*RESID2(-1)*RESID4(-1) + 0.926581321363*COV2_4(-1)

COV3_4 = 0.0714253824609*RESID3(-1)*RESID4(-1) + 0.913591257866*COV3_4(-1)

The conditional covariance equation COV2_3, which represents the conditional covariance between the CHF and the DKK spot exchange rate displays significant values of coefficient matrices. The COV1_4, which is the covariance of CAD/JPY is not significant. COV1_3, COV2_4, COV3_4, which represent CAD/DKK, CHF/JPY, and DKK/JPY display significant ARCH and GARCH terms.

Table 6 shows the Wald test of the symmetry restrictions of the logarithmic monthly returns starting from 01/02/1990 to 01/01/2013. The data covers the spot exchange rate returns namely the CAD/USD, DKK/USD, CHF/USD and JPY/USD, which total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Wald Test:System: SYS03

Test Statistic Value   df     Probability

Chi-square 2.559000 4   0.6341

Null Hypothesis Summary:

Normalized Restriction (= 0) Value   Std. Err.

C(1) 0.000598 0.000958C(2) -0.001092 0.001120C(3) -0.000926 0.000957C(4) -0.001821 0.001452

Restrictions are linear in coefficients.Source: Author’s calculation based on EViews 6 software.Significant at the 5% significance level.

The hypotheses that have been formulated and tested are as follows:

H0: The mean coefficients C(1) = C(2) =C(3)=C(4)= 0

H1: The mean coefficients C(1) = C(2)=C(3)=C(4) ≠ 0

171

Page 172: Introduction to Econometrics 1

According to Table 6, the probability of the Chi-square statistic is 2.56 and the p-value is 0.63. The sample evidence suggests accepting the null hypothesis. In other words, the test has rejected the symmetry restrictions.

3. Summarizes and Concludes

In this article, we have tested the volatility of the natural logarithmic monthly returns of the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rates. We have applied an ARCH – conditional heteroskedasticity method to model the variance, the covariance of the error terms and the correlation of the spot exchange rates. We have used three multivariate ARCH specifications in terms of the conditional constant correlation, the diagonal VECH and the diagonal BEKK. The data that we have used started from 01/01/1990 to 01/01/2013 and represent the CAD/USD, DKK/USD, CHF/USD and JPY/USD spot exchange rates, which total to 277 observations. The natural logarithmic monthly returns total to 276 observations. The data was obtained from the Federal Reserve Statistical Release Department and it is denoted by the symbol, H.10.

Fist of all, we have applied the system equations 1, which represent a diagonal VECH of a covariance specification multivariate ARCH model. We have found that the coefficient C(1), which represents the mean equations of the CAD/USD spot exchange rate has a coefficient of -0.000199 and a p-value of 0.83. It is not significant as it is above the 5% significance level. The mean coefficients C(2) and C(3), which shows the CHF/USD and DKK/USD respectively are statistically significant at the 5% significance level. The mean coefficient C(4), which displays the JPY/USD is not statistically significant. The variance coefficients C(5) to C(14) are the coefficients for the constant matrix. The coefficients C(9), C(10), C(12) and C(14) show significant p-values. C(15) to C(24) are the coefficients for the ARCH term. C(15), C(19), C(20), C(22) and C(24) show a significant p-values of 0.0021, 0.0001, 0.0000, 0.0000, and 0.0038. C(25) to C(34) are the coefficients for the GARCH term. The coefficients C(25), C(27), C(29), C(30), C(31), C(32), C(33) display significant p-values at the 5% significance level. The log likelihood value of the system equations model is 2935.896. The average log likelihood value is 2.66. The Akaike information criterion is -21.03. The Schwarz criterion is -20.58 and the Hannan – Quinn criterion is -20.85.

In addition, we have found that GARCH represents the conditional variance matrix. The conditional variance equations for GARCH 2, 3 and 4 have significant values for the coefficient matrices. The conditional covariance equation COV2_3, which represents the conditional covariance between the CHF and the DKK spot exchange rate displays significant values of the coefficient matrices. The COV1_4, which is the covariance of CAD/JPY is not significant. COV1_3, COV2_4, COV3_4, which

172

Page 173: Introduction to Econometrics 1

represent CAD/DKK, CHF/JPY, and DKK/JPY display only a significant GARCH term of 0.04, 0.005 and 0.0000 respectively.

By applying the system equations 2, which represent a constant conditional correlation of a covariance specification multivariate ARCH model, we have found different results. The coefficient C(1) represents the mean equation of the CAD/USD spot exchange rate. It has a coefficient of -7.40E-05 and a p-value of 0.94. It is not significant as it is above the 5% significance level. The mean coefficients C(2) and C(3), which shows the CHF/USD and DKK/USD respectively are not statistically significant at the 5% significance level as the p-value is 0.15 and 0.30 respectively. Finally the mean coefficient C(4), which displays the JPY/USD is not statistically significant. The variance coefficients for the constant matrix C(6), C(7), C(8), C(9) and C(10) show significant p-values of 0.00, 0.00, 0.01, 0.00, 0.00 respectively. C(11) to C(16) are the coefficients for the ARCH term. C(11), C(12), C(13), C(14) and C(15) show a statistically significant p-values. C(17) to C(22) are the coefficients for the GARCH term. The coefficients C(17), C(18), C(20), C(21), and C(22), display significant p-values at the 5% significance level. The log likelihood value of the system equations model is 2903.890. The average log likelihood value is 2.63. The Akaike information criterion is -20.88. The Schwarz criterion is -20.59 and the Hannan – Quinn criterion is -20.77.

Furthermore, we have found that GARCH represents the conditional variance matrix. The conditional variance equations for GARCH 2, 3 and 4 have significant values for the coefficient matrices except the coefficient matrix of the GARCH term B1(4). The conditional covariance equation COV1_3, which represents the conditional covariance between the CAD and the DKK spot exchange rate, displays significant values of the correlation coefficient. The correlation R(1,3) is 0.35. It has a z-statistic of 5.96 and a p-value of 0.00. The COV1_4, which is the covariance of CAD/JPY is not significant. The correlation R(1,4) is 0.04. The z-statistic is 0.61 and the p-value is 0.54. The COV2_3, COV2_4, and COV3_4, which represent CHF/DKK, CHF/JPY, and DKK/JPY all statistically significant. The correlation R(2,3) is 0.91. It has a z-statistic of 92.09 and a p-value of 0.00. The correlation R(2,4) is 0.47. It has a z-statistic of 10.93 and a p-value of 0.00. Finally, the correlation R(3,4) is 0.40. It has a z-statistic of 8.06 and a p-value of 0.00.

By applying the system equations 3, which represent a diagonal BEKK of a covariance specification multivariate ARCH model, we have found different results for the mean, variance and covariance coefficients. Specifically, the mean coefficient C(1) represents the CAD/USD spot exchange rate. It has a coefficient of 0.001 and a p-value of 0.53. It is not significant as the p-value is above the 5% significance level. The mean coefficients C(2), C(3), and C(4) which shows the CHF/USD, DKK/USD, and JPY/USD respectively are not statistically significant at the 5% significance level. C(5) to C(8) are the variance coefficients for the constant matrix. The variance coefficients C(5), C(6), and C(7) are not statistically significant. C(9) to C(12) are the coefficients for the ARCH term. They are all significant. C(13) to C(16) are the coefficients for the GARCH term. The coefficients C(13), C(14), C(15), and C(16) are statistically significant at the 5% significance level. The log likelihood value of the system equations model is 2891.647. The average log likelihood value is 2.62. The Akaike information criterion is -20.84. The Schwarz criterion is -20.63 and the Hannan – Quinn criterion is -20.75.

173

Page 174: Introduction to Econometrics 1

GARCH represents the conditional variance matrix of the diagonal BEKK covariance model. Both the coefficient matrix for the ARCH and GARCH term are significant as the p-values are below the 5% significance level. Similar results show the conditional variance equations for GARCH 2, 3 and 4. The conditional covariance equation COV2_3, which represents the conditional covariance between the CHF and the DKK spot exchange rate displays significant values of coefficient matrices. The COV1_4, which is the covariance of CAD/JPY is not significant. COV1_3, COV2_4, COV3_4, which represent CAD/DKK, CHF/JPY, and DKK/JPY display significant ARCH and GARCH terms.

By applying the Wald test of the symmetry restrictions of the natural logarithmic monthly returns of the spot exchange rate returns namely the CAD/USD, DKK/USD, CHF/USD and JPY/USD, we have found that the sample evidence suggests to accept the null hypothesis. In other words, the test has rejected the symmetry restrictions of the mean coefficients.

The best fit model to help the arbitrageurs to craft their investment strategy in terms of buying, selling or holding large portfolios of spot exchange rates is the system ARCH equations 1. It has the largest value of the log likelihood and average log likelihood value and the lowest error term value of the Akaike and Hannan - Quinn information criterion. The Schwarz criterion has the highest value.

References

Alexander, C., (2003), “Market Models. A Guide to Financial Data Analysis”. John Wiley and Sons Ltd. ISMA Centre. The Business School For Financial Markets.

Brooks,C..(2002), Introductory Econometrics for Finance. Cambridge University Press. pp.456, 470.

Bollerslev, T.,(1990), “ Modelling the Coherence in Short-run Nominal Exchange rates: A Multivariate Generalized ARCH Model”. The Review of Economics and Statistics, 72, pp.498-505.

Bollerslev, T.,(1986), “Generalized Autoregressive Conditional Heteroskedasticity”. Journal of Econometrics, 31, pp.307 – 327.

174

Page 175: Introduction to Econometrics 1

Bollerslev, T., Chou, R,Y., and Kroner, K.F., (1992), “ARCH Modelling in Finance: A Review of the Theory and Empirical Evidence”. Journal of Econometrics, 52, pp.5 -59.

Bollerslev, T, Engle, R.F, and Nelson, D.B., (1994), “ARCH Models”. Chapter 49 in Robert F. Engle and Daniel L.McFadden (eds.). Handbook of Econometrics, Volume 4, Amsterdam: Elsvier Science B.V.

Bollerslev, T, and Wooldridge, J,M., (1992), “Quasi – Maximum Likelihood Estimation and Inference in Dynamic Models with Time Varying Covariances”. Econometric Reviews, 11, pp.143 – 172.

Donald, A,W,K.,(1991), “Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation”. Econometrica, 59, pp.817 - 858.

Donald, A,W,K., and Monahan, J,C., (1992), “An Improved Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimator”. Econometrica, 60, pp.953 -966.

Dickey, D.A and Fuller, W.A.,(1979), “Distribution of Estimators for Time Series Regressions with a Unit Root”. Journal of the American Statistical Association 74, pp.427-31.

Ding, Z, Granger, C.W.J and Engle, R.F., (1993), “A Long Memory Property of Stock Market Returns and a New Model”. Journal of Empirical Finance, 1, pp.83 -106.

Engle, R.F and Kroner, K.F, (1995), “Multivariate Simultaneous Generalized ARCH”. Econometric Theory, 11, pp.122 – 150.

Engle, R.F., (1982), “Autoregressive Conditional Heteroskedasticity with Estimates of the Variance of U.K. Inflation”. Econometrica, 50, pp.987 – 1008.

Engle, R.F, and Bollerslev, T., (1986), “Modelling the Persistence of Conditional Variances”. Econometric Reviews, 5, pp.1 -50.

Engle, R.F, Lilien, D.M and Robins,R.P., (1987), “Estimating Time Varying Risk Premia in the Term Structure: The ARCH-M Model”. Econometrica, 55, pp.391 - 407.

EViews 6, (2007), “ User’s Guide II ”. Quantitative micro software.

Glosten, L.R.R., Jaganathan and Runkle, D., (1993), “On the Relation between the Expected Value and the Volatility of the Normal Excess Return on Stocks”. Journal of Finance, 48, pp.1779 – 1801.

Loretan,M., Mazda,A. and Subramanian,S.(2005), Indexes of the Foreign Exchange value of the Dollar. It is published in the site of the Federal Reserve Bulletin, Winter 2005.

175

Page 176: Introduction to Econometrics 1

Nelson, D.B.,(1991), “Conditional Heteroskedasticity in Asset Returns: A New Approach”. Econometrica 59(2), pp.347-70.

Newey, W, and West, K.,(1994), “ Automatic Lag Selection in Covariance Matrix Estimation”. Review of Economic Studies, 61, pp.631 -653.

Schwert, W., (1989), “Stock Volatility and Crash of ‘87”. Review of Financial Studies, 3, pp.77 – 102.

Taylor, S., (1986), “Modelling Financial Time Series, New York: John Wiley and Sons.

Zakoian, J.M., (1994), “Threshold Heteroskedastic Models”. Journal of Economic Dynamics and Control, 18, pp.931 -944.

176