Common Correlated E ects Estimation of Dynamic Panels with ...geeverae/index_files/CCEP.pdfCCEP estimator and is not much worse than the infeasible WG estimator for moderate T. In

Common Correlated Effects Estimation of Dynamic Panels with

Cross-Sectional Dependence

Gerdie Everaert and Tom De Groote∗

SHERPPA, Ghent University

August 1, 2013

Abstract

We derive inconsistency expressions for dynamic panel data estimators under error cross-sectional

dependence generated by an unobserved common factor in both the fixed effect and the incidental trends

case. We show that for a temporally dependent factor, the standard within groups (WG) estimator is

inconsistent even as both N and T tend to infinity. Next we investigate the properties of the common

correlated effects pooled (CCEP) estimator of Pesaran [Econometrica, 2006] which eliminates the error

cross-sectional dependence using cross-sectional averages of the data. In contrast to the static case, the

CCEP estimator is only consistent when next to N also T tends to infinity. It is shown that for the most

relevant parameter settings, the inconsistency of the CCEP estimator is larger than that of the infeasible

WG estimator, which includes the common factors as regressors. Restricting the CCEP estimator results

in a somewhat smaller inconsistency. The small sample properties of the various estimators are analysed

using Monte Carlo experiments. The simulation results suggest that the CCEP estimator can be used to

estimate dynamic panel data models provided T is not too small. The size of N is of less importance.

JEL Classification: C13, C15, C23

Keywords: Cross-Sectional Dependence; Dynamic Panel; Common Correlated Effects

1 Introduction

Over the last decades, estimation of dynamic panel data models has received a lot of attention. Nickell (1981)

demonstrated that in dynamic panel data regressions the within groups (WG) estimator is inconsistent for

fixed T and N →∞. Given that the asymptotic bias may be quite sizable in many cases relevant to applied

research, various alternative estimators have been suggested including general method of moments (GMM)

estimators (Arellano and Bond, 1991; Blundell and Bond, 1998), bias-corrected WG estimators (Kiviet,

1995; Hahn and Kuersteiner, 2002; Bun and Carree, 2005; Everaert and Pozzi, 2007; Choi et al., 2010) and

likelihood-based estimators (Lancaster, 2002; Moreira, 2009). However, new challenges arise when it comes

∗We thank three anonymous referees and participants at various conferences and seminars for constructive comments. Cor-responding author: Gerdie Everaert, Tweekerkenstraat 2, B-9000 Gent, Belgium. Email: [email protected]. Website:http://www.sherppa.be.

1

to the estimation of dynamic panel data models. The recent panel data literature shifted its attention to the

estimation of models with error cross-sectional dependence. A particular form that has become popular is

a common factor error structure with a fixed number of unobserved common factors and individual-specific

factor loadings (see e.g. Coakley et al., 2002; Phillips and Sul, 2003; Bai and Ng, 2004; Pesaran, 2006).

The most obvious implication of error cross-sectional dependence is that standard panel data estimators

are inefficient and estimated standard errors are biased and inconsistent. Phillips and Sul (2003) for instance

show that if there is high cross-sectional correlation there may not be much to gain from pooling the data.

However, error cross-sectional dependence can also induce a bias and even result in inconsistent estimates.

In general, inconsistency arises as an omitted variables bias when the observed explanatory variables are

correlated with the unobserved common factors (see e.g. Pesaran, 2006). More specifically, Phillips and

Sul (2007) show that in dynamic panel data models with fixed T and N → ∞, the unobserved common

factors induce an additional (random) inconsistency term for the WG estimator even under the assumption

of temporally independent factors such that these are not correlated with the lagged dependent variable. This

bias disappears as T →∞. Sarafidis and Robertson (2009) show that also dynamic panel data IV and GMM

estimators (in levels or first-differences) are inconsistent for fixed T and N → ∞ as the moment conditions

used by these estimators are invalid under error cross-sectional dependence. Choi et al. (2010) suggest that

their common recursive mean adjustment to reduce the dynamic panel data bias of the WG estimator can

also be used under error cross-sectional dependence, but an explicit bias formula is not provided.

In this paper we further analyze the impact of error cross-sectional dependence in linear dynamic panels.

Explicit inconsistency formulas are derived for both the naive within groups (WGn) estimator, which ignores

error cross-sectional dependence, and the common correlated effects pooled (CCEP) estimator of Pesaran

(2006), which is explicitly designed to deal with unobserved common factors in the error term. We study

both the standard fixed effects model and the incidental trends model. We first extend the work of Phillips

and Sul (2007) by relaxing the assumption of a temporally independent common factor. In line with their

results we find that for fixed T and N →∞ the inconsistency of the WGn estimator is a combination of the

nonrandom dynamic panel data bias and a random component induced by the common factor in the error

term. Importantly, the latter component of the inconsistency becomes nonrandom but does not disappear

when next to N → ∞ we also let T → ∞ since the temporal dependence in the unobserved common factor

implies that the error term is correlated with the lagged dependent variable even for both N and T → ∞.

This finding should warn against the use of the WGn estimator in cross-sectionally dependent dynamic

panels even when T is large. Second, we extend the work of Pesaran (2006) by analyzing the asymptotic

behavior of the CCEP estimator in a dynamic panel data setting. The basic idea of CCEP estimation is to

deal with error cross-sectional dependence by filtering out the unobserved common factors using the cross-

section averages of both the dependent and the explanatory variables. We show that contrary to the static

model, the CCEP estimator is no longer consistent for N → ∞ and fixed T . Similar to the results for the

WG estimator, the inconsistency is a combination of the standard nonrandom dynamic panel data bias and

a random component which is now induced by orthogonalising on the cross-sectional averages. The main

difference with the WGn estimator is that both components of the inconsistency disappear as we also let

T →∞. As a benchmark, we also derive the inconsistency of the infeasible within groups (WGi) estimator,

2

which includes the unobserved factor as an explanatory variable. This WGi estimator also has a random

inconsistency for fixed T and N → ∞ which disappears as T → ∞. However, for the cases most relevant

to applied research, the inconsistency of the CCEP is bigger than that of the WGi estimator. One possible

reason for this is that the CCEP estimator as suggested by Pesaran (2006) ignores the restrictions on the

individual-specific factor loadings as implied by the derivation of the cross-sectional averages augmented

specification of the model. Imposing these restrictions, the inconsistency of the restricted CCEP estimator

is closer to that of the WGi estimator.

We next analyse the small sample properties of the WG and CCEP estimators using a small-scale Monte

Carlo experiment. First, the results illustrate that the standard WG estimator, ignoring error cross-sectional

dependence, has a persistent (as N and T growing larger) bias for a temporally dependent factor. Second,

the results show that both the unrestricted and the restricted CCEP estimators have a higher bias than the

infeasible WG estimator for small values of T but the restricted CCEP estimator outperforms the unrestricted

CCEP estimator and is not much worse than the infeasible WG estimator for moderate T . In line with the

results in Pesaran (2006), the small sample properties of the CCEP estimators are not very sensitive to the

size of N . Overall, the results suggest that the CCEP estimator is quite useful for estimating cross-sectional

dependent dynamic panel data models provided T is not too small.

The remainder of this paper is organized as follows. Section 2 sets out the basic model and its assumptions.

Section 3 derives explicit inconsistency expressions as N →∞ and large T expansions thereof for the WGn,

the WGi and the unrestricted and restricted CCEP estimators in a dynamic model with error cross-sectional

dependence. Section 4 adds exogenous explanatory variables. Section 5 reports the results of a small-scale

Monte Carlo experiment. Section 6 concludes.

2 Model and assumptions

Consider the following first-order autoregressive panel data model

(Fixed effects model) yit = αi + ρyi,t−1 + νit, (1a)

(Incidental trends model) yit = αi + bit+ ρyi,t−1 + νit. (1b)

where yit is the observation of the dependent variable for cross-sectional unit i (i = 1, . . . , N) at time t

(t = 1, . . . , T ) and the individual effects αi and bi are fixed parameters. We restrict |ρ| < 1 such that yit is

stationary. For notational convenience we assume yi0 is observed. We further assume:

Assumption A1. (Cross-section dependence) The error term νit has a single-factor structure

νit = γiFt + εit, (2)

Ft = θFt−1 + µt, |θ| < 1, (3)

where Ft is an individual-invariant time-specific unobserved effect with µt ∼ i.i.d.(0, σ2

µ

). The individual-

specific factor loadings γi satisfy plimN→∞

1

N

N∑i=1

γ2i = m2

γ being finite.

3

The restriction of a single-factor structure is for notational convenience only. The inconsistencies for

N → ∞ of the estimators presented below can straightforwardly be extended to the multi-factor case but

the large T approximations of the inconsistencies would be more complicated as these would depend on the

joint DGP of the factors.

Assumption A2. (Error condition) εit ∼ i.i.d.(0, σ2

εi

)across i and t and independent of αj, yj0 and Fs

∀i, j, t, s and plimN→∞

1

N

N∑i=1

σ2εi = σ2

ε .

This assumption allows for cross-sectional heteroscedasticity under a mild regularity condition as in

Phillips and Sul (2007).1

The model in equations (1a)-(1b) and (2)-(3) can be written in component form as

yit = y+it + γiF

+t , (4)

with

F+t = (1− ρL)

−1Ft = (ρ+ θ)F+

t−1 − ρθF+t−2 + µt. (5)

and

(Fixed effects model) y+it = αi + ρy+

i,t−1 + εit, (6a)

(Incidental trends model) y+it = αi + bit+ ρy+

i,t−1 + εit. (6b)

This component form is convenient for the proofs presented below as the components y+it and F+

t are inde-

pendent.

Assumption A3. (Initial conditions) Initial conditions are in the infinite past, such that the initial obser-

vations y+i0 and F0 satisfy

y+i0 =

αi1− α

+

∞∑j=0

ρjεi,−j , (7)

F0 =

∞∑j=0

θjµi,−j . (8)

The assumptions on the initial observations will be used in deriving large T approximations of the incon-

sistency expressions for N →∞ of the estimators presented below, but the (in)consistency of the estimators

does not rely on the specification of the initial conditions.

Moreover, for the presentation of the estimators below, stacking the model in (1a)-(1b) and (2) for each

1As in Phillips and Sul (2007) this cross-sectional heteroscedasticity does not affect the asymptotic results.

4

i yields

(Fixed effects model) yi = αiι+ ρyi,−1 + γiF + εi, (9a)

(Incidental trends model) yi = αiι+ biτ + ρyi,−1 + γiF + εi. (9b)

where yi = (yi1, . . . , yiT )′, yi,−1 = (yi0, . . . , yi,T−1)

′, F = (F1, . . . , FT )

′, εi = (εi1, . . . , εiT )

′, ι is a (Tx1) vector

of ones and τ = (1, . . . , T )′.

The affix notation on yi will be used to denote the individual mean of yit, yit will be used to denote that

yit has been demeaned while yit signifies that yit has been demeaned and detrended. Similar notation is used

for all other series.

3 Estimators

In this section we analyse the asymptotic properties of various estimators for ρ in equations (1a) and (1b)

under assumptions A1-A3. We are mainly interested in the properties of the CCEP estimator, which was

suggested by Pesaran (2006) for a static model with cross-sectional dependence. We consider both an unre-

stricted and a restricted version of the CCEP estimator. We also include two benchmark estimators. First,

the WGn estimator is included to illustrate the extent of the bias that can occur if error cross-section depen-

dence is ignored. Second, the WGi estimator is included to provide an upper bound to the performance of

the CCEP estimators. Each of these four estimators for ρ in the models in (1a) and (1b) is given by

ρ =

∑Ni=1 y

′i,−1Myi∑N

i=1 y′i,−1Myi,−1

= ρ+

∑Ni=1 y

′i,−1M (γiF + εi)∑N

i=1 y′i,−1Myi,−1

, (10)

bur differ in their choice of M .2

3.1 Naive within groups

Fixed effects model

The WGn estimator for ρ in (1a) is obtained by choosing M in (10) to be the projection matrix Mι =

IT−ιι′ /T that demeans the data. Nickell (1981) showed that for a model with no cross-sectional dependence,

the within groups estimator for ρ yields inconsistent results for N → ∞ and T fixed but this inconsistency

disappears as T → ∞. Phillips and Sul (2007) show that the consistency of the WGn estimator as N

and T → ∞ continues to hold in dynamic panel data models with cross-sectional dependence induced by

a temporarily independent factor. The next proposition looks into the asymptotic properties of the WGn

estimator in the case of a temporarily dependent factor. All proofs are in Appendix A.

2Note that identification of ρ requires∑Ni=1 y

′i,−1Myi,−1 to be nonsingular.

5

Proposition 1.a. In model (1a) under assumptions A1-A3, the WGn estimator is inconsistent as N →∞

plimN→∞

(ρWGn − ρ) =−A(ρ, T ) +

m2γ

σ2ε

∑Tt=1 F

+t−1Ft

B(ρ, T ) +m2γ

σ2ε

∑Tt=1

(F+t−1

)2 , (11)

where A(ρ, T ) = 11−ρ

(1− 1

T1−ρT1−ρ

)and B(ρ, T ) = T

1−ρ2

(1− 1

T1+ρ1−ρ + 2ρ

T 21−ρT

(1−ρ)2

).

For large T , the inconsistency in (11) has the following expansion

plimN→∞

(ρWGn − ρ) = −1 + ρ

T+ η

(E (gFT ) +

1 + ρ

T

)+Op

(1√T

), (12)

with gFT =∑Tt=1 F

+t−1Ft

/∑Tt=1

(F+t−1

)2

and η =m2γσ

2µ

σ2ε

/((1−θρ)(1−θ2)

1+θρ +m2γσ

2µ

σ2ε

).

For θ = 0, the large T expansion in (12) reduces to

plimN→∞

(ρWGn− ρ) = −1 + ρ

T− η 2ρ

T+Op

(1√T

), (13)

while for θ 6= 0 we have

plim(N,T )seq→∞

(ρWGn− ρ) =

(1− ρ2

)θ

(1 + θρ) + (1− θρ) (1− θ2)σ2ε

m2γσ

2µ

. (14)

Proposition 1.a is similar to the results in Phillips and Sul (2007), the only difference being that the

data generating process of Ft is different. Equation (11) shows that the inconsistency of ρWGn

for N → ∞and fixed T has two sources. The first is the standard Nickell dynamic panel data bias, which depends

on the persistence ρ in yit and on the time dimension T . This can be seen by setting the error cross-

sectional dependence to zero (m2γ = 0) such that equation (11) reduces to the standard Nickell bias formula

−A(ρ, T ) /B(ρ, T ) which has a large T expansion − (1 + ρ) /T . The second source stems from the error

cross-sectional dependence. It is apparent from equation (11) that this part of the inconsistency is random

for fixed T as 1 /T∑Tt=1 F

+t−1Ft and 1 /T

∑Tt=1

(F+t−1

)2

depend on the particular realisation for Ft. The

large T expansion in equation (12) shows that consistency of the WGn estimator for both N and T → ∞depends on the asymptotic behavior of E (gFT ), which in turn depends on the temporary dependence in Ft.

Equation (13) reproduces the approximate inconsistency expression of Phillips and Sul (2007) which shows

that for a temporally independent factor (θ = 0) the inconsistency of the WGn estimator disappears as

T → ∞. The most important novel conclusion from Proposition 1 is that inertia in Ft (θ 6= 0) implies that

the WGn estimator is inconsistent as both N,T → ∞ since E (gFT ) does not converge to zero. Essentially,

this is an omitted variable bias as θ 6= 0 implies E (yi,t−1Ft) 6= 0 such that omitting Ft from the regression

results in an inconsistent estimator for ρ even if both N and T become infinitely large. Note that given

the restrictions that |ρ| < 1 and |θ| < 1, the sign of the inconsistency depends on the sign of θ. These

results are visualized in panel (a) of Figure 1 which plots the inconsistency for θ = 0 and θ = 0.9 calculated

from equation (11) for N → ∞ and various values of T together with the large T expansion from equation

(13) for θ = 0 and the inconsistency for N,T → ∞ from (14) for θ = 0.9. The 5th and 95th percentiles

6

show the randomness in the inconsistency for N → ∞ and T fixed. The graph also shows that the large T

approximation for θ = 0 is fairly accurate.

Figure 1: Inconsistency of the WGn estimator for ρ = 0.5

(a) Fixed effects model (b) Incidental trends model

Note: σ2ε = 1− ρ2, σ2

µ = 1− θ2 and γi ∼ i.i.d.U [γL, γU ] such that m2γ = 1.09.

Incidental trends model

The WGn estimator for ρ in (1b) is obtained by choosing M in (10) to be the projection matrix Mτ =

IT − τι (τ ′ιτι)−1τ ′ι , with τι = (ι, τ), that demeans and detrends the data.

Proposition 1.b. In model (1b) under assumptions A1-A3, the WGn estimator is inconsistent as N →∞

plimN→∞

(ρWGn− ρ) =

−C(ρ, T ) +m2γ

σ2ε

∑Tt=1 F

+t−1Ft

D(ρ, T ) +m2γ

σ2ε

∑Tt=1

(F+t−1

)2 , (15)

where C(ρ, T ) = 1T−1

21−ρ

((T − 1)− 2

1−ρC1

)and D(ρ, T ) = T−2

1−ρ2

(1− 1

T−24ρ

1−ρD1

), with C1 and C2 being

defined in equations (A-12)-(A-13).


plimN→∞

(ρWGn− ρ) = −2

1 + ρ

T+ η

(E (hFT ) + 2

1 + ρ

T

)+Op

(1√T

), (16)

with hFT =∑Tt=1 F

+t−1Ft

/∑Tt=1

(F+t−1

)2

and η as defined in Proposition 1. A such, for θ = 0 the large T

expansion in (16) reduces to

plimN→∞

(ρWGn− ρ) = −2

1 + ρ

T− η 2ρ

T+Op

(1√T

), (17)

while for θ 6= 0 the inconsistency as N,T →∞ is given by (14).

The results in Proposition 1.b show that the inconsistency for N → ∞ and T fixed is larger than in the

7

fixed effects model. For θ = 0, the expansion in (17) reproduces the expression in Phillips and Sul (2007)

which shows that the first term in the inconsistency expression, which is due to the elimination of the fixed

effects and the incidental trends, is approximately twice as large as in the fixed effects model. For θ 6= 0, the

inconsistency for N,T → ∞ is the same as in the fixed effects model. These results can also be seen from

panel (b) in Figure 1.

3.2 Infeasible within groups

Fixed effects model

The WGi estimator, i.e. including Ft as an observed regressor, for ρ in (1a) is obtained by choosing M in

(10) to be the projection matrix MFι = IT − Fι (F ′ιFι)−1F ′ι , with Fι = (ι, F ).

Proposition 2.a. In model (1a) under assumptions A1-A3, the WGi estimator is inconsistent as N →∞

plimN→∞

(ρWGi− ρ) =

−A(ρ, T )−∑T−1t=1 ρt−1gF,t

B(ρ, T )− 11−ρ2

(1 + 2ρ

∑T−1t=1 ρt−1gF,t

)+ T

m2γ

σ2εkF

, (18)

where gF,t =∑Ts=t+1 κs,s−t and kF = 1

T

∑Tt=1

(F+t−1

)2(

1− ( 1T

∑Tt=1 FtF

+t−1)

2

1T

∑Tt=1 F

2t

1T

∑Tt=1 (F+

t−1)2

), with κs,s−t being

the (s, s− t) th element in F(F ′F

)−1

F ′.


plimN→∞

(ρWGi− ρ) = − 1

T

(1 + ρ+

θ(1− ρ2

)(1− θρ)

)(1 +

m2γ

(1− θρ)2

σ2µ

σ2ε

)−1

+ op

(1

T

). (19)

Proposition 2.a shows that the inconsistency of the WGi estimator for N →∞ also has two sources. The

first is again the standard Nickell bias. The second source now stems from orthogonalizing on the observed

factor Ft. For fixed T , this induces randomness in the inconsistency as the orthogonalisation depends on

the particular realization of the process Ft. Moreover, equation (19) shows that temporal dependence in the

common factor (θ 6= 0) also induces a nonrandom inconsistency for N → ∞, which disappears as T → ∞.

The intuition for this result is that for fixed T the transformed error term MFιεi is, next to being a function

of the average error term εi due to the within transformation, now also a function of the entire series F (as

represented by gF,t) due to the orthogonalisation on Ft which results in correlation with the explanatory

variable yi,t−1. The denominator of equation (19) further shows that the inconsistency is smaller when

the error cross-sectional dependence is stronger as this implies more variability in the explanatory variable

yi,t−1, which is induced by Ft−1 and is not completely filtered out by including Ft as a control variable in

the regression. This additional variability is captured by the term kF in the denominator of equation (18).

The inconsistency for N →∞ calculated from equation (18) and its large T approximation calculated from

equation (19) are plotted in panel (a) of Figure 2. The graph shows that the large T approximation is again

fairly accurate.

8

Figure 2: Inconsistency for N → ∞ of the WGi estimator for ρ = 0.5 and θ = 0.9


Note: See Figure 1.


The WGi estimator for ρ in (1b) is obtained by choosing M in (10) to be the projection matrix MFτ =

IT − Fτ (F ′τFτ )−1F ′τ , with Fτ = (ι, τ, F ).

Proposition 2.b. In model (1b) under assumptions A1-A3, the WGi estimator is inconsistent as N →∞

plimN→∞

(ρWGi − ρ) =−C(ρ, T )−


D(ρ, T )− 11−ρ2

(1 + 2ρ


)+ T

m2γ

σ2εkF

, (20)

where gF,t =∑Ts=t+1 κs,s−t and kF = 1

T

∑Tt=1

(F+t−1

)2(

1− ( 1T

∑Tt=1 FtF

+t−1)

2

1T

∑Tt=1 F

2t

1T

∑Tt=1 (F+

t−1)2

), with κs,s−t being

the (s, s− t) th element in F(F ′F

)−1

F ′.


plimN→∞

(ρWGi − ρ) = − 1

T

(2 (1 + ρ) +

θ(1− ρ2

)(1− θρ)

)(1 +

m2γ

(1− θρ)2

σ2µ

σ2ε

)−1

+ op

(1

T

). (21)

The results in Proposition 2.b show that the inconsistency for N → ∞ and T fixed is larger than in the

fixed effects model, i.e. the first term in the large T expansion of the inconsistency is again twice as large as

in the fixed effects model. These results can also be seen from panel (b) in Figure 2.

3.3 CCEP

The CCEP estimator suggested by Pesaran (2006) eliminates the unobserved common factors by including

cross-section averages of the dependent and the explanatory variables. Taking cross-sectional averages of the

9

fixed effects model in (1a) gives

yt = α+ ρyt−1 + γFt + εt, (22)

with yt denoting the cross-sectional mean of yit and similarly for the other variables. In line with Pesaran

(2007) and Phillips and Sul (2007) we make the full rank assumption γ 6= 0 (for fixed N and N →∞) such

that equation (22) can be solved for Ft as

Ft =1

γ(yt − α− ρyt−1 − εt). (23)

Note that

plimN→∞

εt = 0, (24)

plimN→∞

yt = plimN→∞

(y+t + γF+

t

)= (1− ρ)−1α+ γF+

t , (25)

with α = plimN→∞

α and γ = plimN→∞

γ 6= 0.

Inserting (23) in (1a) yields the following augmented form

yit = αi + ρyi,t−1 +γiγ

(yt − α− ρyt−1 − εt) + εit,

= α∗i + ρyi,t−1 + γ1iyt + γ2iyt,−1 + ε∗it, (26)

with γ1i = γi /γ , γ2i = ργ1i, α∗i = αi − γ1iα and ε∗it = εit − γ1iεt.

Similarly, for the incidentals trends model in (1b) we obtain

Ft =1

γ(yt − α− bt− ρyt−1 − εt), (27)

yit = α∗i + b∗i t+ ρyi,t−1 + γ1iyt + γ2iyt,−1 + ε∗it, (28)

with b∗i = bi − γ1ib.

The augmented forms in (26) and (28) are in terms of observable variables and can therefore be estimated

with least squares. This is what Pesaran (2006) refers to as CCEP estimators. We consider two versions.

The first, CCEPu ignores the restrictions on γ1i, γ2i, α∗i and b∗i as implied by (26) or (28), while the second

CCEPr takes these restrictions into account. Note that the augmented forms are easily generalized to a

multi-factor model (see e.g. equation (37) in Phillips and Sul, 2007). The advantage of the CCEPu over the

CCEPr approach is that it does not require a decision on the number of common factors. The number of

factors can however be estimated using the panel information criteria suggested by Bai and Ng (2002).

10

3.3.1 Unrestricted CCEP

Fixed effects model

The unrestricted CCEP estimator for ρ in (26) is obtained by choosing M in (10) to be the projection matrix

MGι = IT −Gι (G′ιGι)−1G′ι, with Gι = (ι, G) and G = (y, y−1), y = (y1, . . . , yT )

′and y−1 =

(y0, . . . , yT−1

)′.

Theorem 1.a. In model (1a) under assumptions A1-A3, the CCEPu estimator is inconsistent as N →∞

plimN→∞

(ρCCEPu

− ρ) =−A(ρ, T )−

∑T−1t=1 ρt−1g+

F,t

B(ρ, T )− 21−ρ2

(1 + ρ

∑T−1t=1 ρt−1g+

F,t

) , (29)

where g+F,t =

∑Ts=t+1 κ

+s,s−t with κ+

s,s−t being the (s, s− t) th element in H(H ′H

)−1

H ′, with H =(F+, F+

−1

).

The inconsistency in (29) has the following large T expansion

plimN→∞

(ρCCEPu

− ρ) = − 1

T

(1 + 2ρ+

θ(1− ρ2)

1− θρ

)+ op

(1

T

). (30)

The implication of Theorem 1.a is that the CCEPu estimator is consistent for both N and T →∞ but has

a different inconsistency expression compared to the WGi estimator for N →∞ and T fixed. The intuition

for this is that the error term εit is now orthogonalised on a constant, yt and yt−1 with, as can be seen

from (25), the latter two converging to F+t and F+

t−1 respectively as N → ∞. For fixed T , this implies two

differences compared to the WGi estimator. First, orthogonalising on F+t−1, next to on a constant and on F+

t ,

results in extra correlation between the orthogonalised error term and the explanatory variable yi,t−1 as the

latter is by construction a function of F+t−1. As such, comparing (19) and (30), the numerator of the latter

contains an extra term in ρ. Second, the extra variability in the explanatory variable yi,t−1 induced by Ft−1

is now completely filtered out by orthogonalising on F+t−1. As such, stronger error cross-sectional dependence

raises the denominator in (19) but doesn’t affect (30). Further comparing (19) and (30), it is clear that

both inconsistencies need not have the same direction and that the absolute value of the inconsistency of the

CCEPu estimator is not necessarily bigger than that of the WGi estimator. However, for the majority of

values for ρ and θ, the absolute value of the inconsistency is larger for the CCEPu estimator. For the most

relevant case of both ρ > 0 and θ > 0, the inconsistency of both the WGi and CCEPu estimator is negative.3

However, the inconsistency of the latter is bigger compared to the former. Thus, in these cases, approximating

the unobserved Ft using cross-sectional averages of the observed data results in a larger inconsistency for

N → ∞ and T fixed compared to the benchmark infeasible estimator with Ft observed. These results are

shown in panel (a) of Figure 3, which plots the inconsistency calculated from equation (29) for N →∞ and

various values of T together with the large T expansion from equation (30).


The unrestricted CCEP estimator for ρ in (28) is obtained by choosing M in (10) to be the projection matrix

MGτ = IT −Gτ (G′τGτ )−1G′τ , with Gτ = (ι, τ,G) and G as defined above.

3One example where the approximate inconsistency of the CCEPu estimator is smaller than that of the WGi estimator isthe parameter setting used in Figures 1-4 but with ρ = 0.5 being replaced by e.g. ρ = −0.5.

11

Figure 3: Inconsistency for N → ∞ of the CCEPu estimator for ρ = 0.5 and θ = 0.9


Note: See Figure 1.

Theorem 1.b. In model (1b) under assumptions A1-A3, the CCEPu estimator is inconsistent as N →∞

plimN→∞

(ρCCEPu

− ρ) =−C(ρ, T )−

∑T−1t=1 ρt−1g+

F,t

D(ρ, T )− 21−ρ2

(1 + ρ

∑T−1t=1 ρt−1g+

F,t

) , (31)

where g+F,t =

∑Ts=t+1 κ

+s,s−t with κ+

s,s−t being the (s, s− t) th element in H(H ′H

)−1

H ′, with H =(F+, F+

−1

).


plimN→∞

(ρCCEPu

− ρ) = − 1

T

(2 + 3ρ+

θ(1− ρ2)

1− θρ

)+ op

(1

T

). (32)

The results in Theorem 1.b again show that the inconsistency for N → ∞ and T fixed is larger than

in the fixed effects model, i.e. a factor − 1+ρT is added to the approximate inconsistency expression. These

results are visualized in panel (b) of Figure 3.

3.3.2 Restricted CCEP

Fixed effects model

The restricted CCEP estimator for ρ in (26) can be obtained by minimizing the objective function

SNT (ρ, F ) =1

NT

N∑i=1

(yi − ρyi,−1)′MFι (yi − ρyi,−1) . (33)

Although F is not observed when estimating ρ and similarly, ρ is not observed when estimating F , we

can replace the unobserved quantities by initial estimates and iterate until convergence. The continuously-

12

updated estimator for (ρ, F ) is defined as

(ρ

CCEPr, F)

= argminρ,F

SNT (ρ, F ) . (34)

More specifically, ρCCEPr is the solution to equation (10) with the projection matrix M chosen to be MFι=

IT − Fι(F ′ι Fι

)−1

F ′ι , where Fι =(ι, F)

, while F is the solution to the following equation

F =1

γ

(y − ρ

CCEPry−1

). (35)

Note that an estimate α for the constant α in (23) has been dropped from (35) since this would be projected

out anyway as ι is included in Fι. Note that the restricted CCEP estimator bears some similarities with the

continuously updated (Cup) estimator presented in Bai et al. (2009). The difference being that the CCEP

estimates the unobserved components via the cross-sectional averages of both dependent and explanatory

variables, whereas the Cup estimator uses a principal component approach.

Theorem 2.a. In model (1a) under assumptions A1-A3, the CCEPr estimator is inconsistent as N →∞

plimN→∞

(ρCCEPr

− ρ) =−A(ρ, T )−

∑T−1t=1 ρt−1gF ,t

B(ρ, T )− 21−ρ2

(1 + ρ


) , (36)

where gF ,t =∑Ts=t+1

˜τs,s−t with ˜τs,s−t being the (s, s− t) th element in˜F

(˜F′ ˜F

)−1 ˜F′.


plimN→∞

(ρCCEPr

− ρ) = − 1

T

(1 + ρ+

θ(1− ρ2)

1− θρ

)+ op

(1

T

). (37)

Comparing (37) and (19), the inconsistency of the CCEPr equals the inconsistency of the WGi multiplied

by a factor(

1 +m2γ

(1−θρ)2σ2µ

σ2ε

)> 1. This implies that the CCEPr inconsistency has the same direction and is

bigger than that of the WGi estimator. The intuition for this is that for fixed T , the deviation of Ft from Ft is

a function of F+t−1, as can be seen from (A-36), which induces extra correlation between the transformed error

term MFιεit and yi,t−1. Further comparing (37), (30) and (19), the inconsistency of the CCEPr estimator

is smaller than that of the CCEPu estimator and closer to that of the WGi estimator for the most relevant

case of ρ > 0 and θ > 0. The inconsistency for N →∞ and its large T expansion are shown in panel (a) of

Figure 4.


The restricted CCEP estimator for ρ in (28) is obtained by replacingMFι in (33) byMFτ such that(ρ

CCEPr, F)

is again the solution to equations (10)-(35) but with MFιreplaced by MFτ

= IT − Fτ(F ′τ Fτ

)−1

F ′τ with

Fτ =(ι, τ, F

).

13

Figure 4: Inconsistency for N → ∞ of the CCEPr estimator for ρ = 0.5 and θ = 0.9


Note: See Figure 1.

Theorem 2.b. In model (1b) under assumptions A1-A3, the CCEPr estimator is inconsistent as N →∞

plimN→∞

(ρCCEPr

− ρ) =−C(ρ, T )−


D(ρ, T )− 21−ρ2

(1 + ρ


) , (38)


˘τs,s−t with ˘τs,s−t being the (s, s− t) th element in˘F

(˘F′ ˘F

)−1˘F′.


plimN→∞

(ρCCEPr − ρ) = − 1

T

(2 (1 + ρ) +

θ(1− ρ2)

1− θρ

)+ op

(1

T

). (39)

The results in Theorem 2.b show that, in line with the results for the CCEPu estimator, a factor − 1+ρT is

added to the approximate inconsistency expression compared to the fixed effects model. The inconsistency

for N →∞ and its large T expansion are shown in panel (b) of Figure 4.

4 Including exogenous variables

This section extends the model in (1a)-(1b) by including a vector of exogenous variables, xit. Consider the

following autoregressive model

(Fixed effects model) yit = αi + ρyi,t−1 + x′itβ + νit, (40a)

(Incidental trends model) yit = αi + bit+ ρyi,t−1 + x′itβ + νit. (40b)

with xit =(x1it, . . . , x

Kit

)′a (K× 1) vector of explanatory variables which is assumed to be strictly exogenous

with respect to the errors, i.e. E(xitεis) = 0 ∀i, t, s, but allowed to be correlated with the individual effects

14

and the common factor4.

The estimator for ρ is now given by

ρ = ρ+

∑Ni=1 y

′i,−1MxiM (γiF + εi)∑N

i=1 y′i,−1MxiMyi,−1

, (41)

with the projection matrix M for each of the four considered estimators as defined in Section 3 and Mxi =

IT −Mxi (x′iMxi)−1x′iM with xi = (xi1, . . . , xiT )

′. Equation (41) shows that adding an additional vector of

exogenous explanatory variables has a double impact on ρ. First, inconsistencies tend to be smaller in absolute

value as the denominator of (41) is bigger than that of (10). This is due to the fact that adding explanatory

variables increases the variance in yi,t−1 and this extra variability is not fully projected out by orthogonalising

on xit. Second, the numerator of (41) differs from the one in (10) due to the extra orthogonalisation matrix

Mxi . However, for the WGi, CCEPu and CCEPr estimators,∑Ni=1 y

′i,−1MxiM (γiF + εi)→

∑Ni=1 y

′i,−1Mεi

as N → ∞ since the choice of M for these estimators implies that Ft is projected out and using that

E(xitεis) = 0. As such, for N →∞ the numerator is the same as in the case of no exogenous variables. For

the WGn estimator, Ft is not projected out by M such that here will be an extra term in the numerator of

(41) which will depend on the correlation between xit and Ft.

Turning to the asymptotic behavior of β, it is straightforward to show that letting N →∞

plimN→∞

(β − β) =

plimN→∞

N∑i=1

x′iMγiF − plimN→∞

N∑i=1

x′iMyi,−1 plimN→∞

(ρ− ρ)

plimN→∞

N∑i=1

x′iMxi

, (42)

where use is made of E(xitεis) = 0. Equation (42) shows that the inconsistency of β is a fraction of the

inconsistency of ρ, with this fraction depending on the relationship between the exogenous variables xit and

yi,t−1. For the WGi, CCEPu and CCEPr estimators, the first term in the numerator is zero since the choice

of M for these estimators implies that Ft is projected out. For the WGn estimator this term is not zero such

that this results in an additional inconsistency term when xit is correlated with Ft. This inconsistency does

not disappear as T →∞.

5 Monte Carlo simulation

In this section we conduct a small-scale Monte Carlo experiment to investigate the small sample properties

of the CCEPu and the CCEPr estimators under error cross-sectional dependence in both the fixed effects

and the incidental trends case in comparison to the benchmark WGn and WGi estimators.

4When the explanatory variables are correlated with the common factor Ft, the condition γ 6= 0 can be generalized for theunrestricted CCEP estimator to a rank condition on the full matrix of all factor loadings as in Pesaran et al. (2013). For therestricted model γ 6= 0 is still required, though.

15

5.1 Experimental design

The data generating process (DGP) we consider is given by

yit = ρyi,t−1 + βxit + γiFt + εit, εit ∼ i.i.d.N(0,(1− ρ2

)σ2ε

), γi ∼ i.i.d.U [γL, γU ] ,

xit = φiFt + ωit, ωit ∼ i.i.d.N(0, σ2

ω

), φi ∼ i.i.d.U [φL, φU ] ,

Ft = θFt−1 + µt, µt ∼ i.i.d.N(0,(1− θ2

)σ2µ

).

We initialise yi,−49, xi,−49 and F−49 at zero and discard the first 50 observations.

We compare the 4 alternative estimators over different values of the dynamic parameters, ρ ∈ 0.5; 0.9 and

θ ∈ 0.5; 0.9, for different samples sizes, T ∈ 5; 10; 25; 50; 100 and N ∈ 20; 100, with σ2ε = σ2

ω = σ2µ = 1,

β = 1 − ρ, (γL, γU ) = (0.5, 1.5) and (φL, φU ) = (0.5, 1.5). Results for different parameter settings are

available on request. The estimators include either fixed effects or both fixed effects and incidental trends.5

All experiments are based on 5000 iterations. The estimators are compared in terms of (i) mean bias (bias),

(ii) root mean squared error (rmse) and (iii) real size (size). The size is calculated for a two-sided hypothesis

test at the 5% nominal level of significance using standard errors taken from the covariance matrix Ω of the

coefficients estimates(ρ, β′

)which, given that εit is a homoscedastic and serially uncorrelated error term,

can be estimated in the standard way as

Ω = σ2ε

(N∑i=1

X0′

i X0i

)−1

,

where X0i =

∂M(ρyi,−1+xiβ)

∂(ρ,β′)with the four alternative estimators differing in their choice of M . Note

that for the linear estimators WGn, WGi and CCEPu, X0i = MXi, with Xi = (yi,−1, xi), such that

Ω = σ2ε

(∑Ni=1X

′

iMXi

)−1

. For the non-linear CCEPr estimator, the first derivatives in X0i are evaluated

numerically.

5.2 Simulation Results

The simulation results for the fixed effects and the incidental trends case can be found in Table 1 and 2

respectively. Before looking into the results in detail, two overall conclusions can be drawn. First, in line

with the theoretical results the biases for small T are much bigger in the incidental trends case than in the

fixed effects case. Second, comparing the results for N = 20 with those for N = 100 shows that the size of

the cross-sectional dimension N does not have a considerable effect on the size of the biases for any of the

considered estimators. This is in line with the results in Pesaran (2006).

With respect to estimating ρ, the following conclusions stand out. First, the WGn estimator is biased

for all combinations of N and T , with the bias being negative for small T and positive for larger values of

T . This is in line with the theoretical results in Section 3.1 which show that the switch in sign is due to

the fact that as T increases the Nickell part of the bias, which is negative, diminishes whereas the positive

bias originating from the unobserved common component does not as θ 6= 0 (see also Figure 1). In line with

5Note that for simplicity the DGP for yit does not include fixed effects or incidental trends. This is without loss of generalityas they would be projected out anyway by fitting them in the regression model.

16

equation (14), the bias of the WGn estimator for larger values of T is bigger for larger values of θ and smaller

for larger values of ρ. Second, for small values of T the WGi estimator is biased but as T increases, this

bias shrinks to zero. The absolute value of the bias of the WGi estimator is increasing in θ but, in contrast

to the WG estimator in a model with no cross-sectional dependence, not increasing in ρ. This can also be

seen from equation (19). Third, the CCEP estimators both have a considerably larger bias compared to the

benchmark WGi estimator for small values of T with CCEPr clearly outperforming CCEPu. The absolute

value of the bias of the CCEP estimators is increasing in both ρ and θ. The bias diminishes as T increases

and is more or less gone when T = 50 in the fixed effects case and when T = 100 in the incidental trends

case. Note that for small values of T , both the WGn and WGi estimator have a considerably smaller rmse

compared to the CCEP estimators but for larger values of T the rmse of the CCEP estimators is more in

line with that of the WGi estimator and is clearly smaller than that of the WGn estimator. Turning to the

estimates for β, a relatively (to ρ) smaller bias is found for the WGi, CCEPu and CCEPr estimators with

the bias disappearing as T grows larger. For the WGn estimator, there is a considerable bias which does not

decrease as T grows larger. This is due to the fact that in the DGP xit is correlated with the common factor

Ft in yit but this common factor is omitted from the regression.

Looking at the size results, all estimators for ρ are subject to large size distortions. This is not surprising,

given the substantial downward biases for small T . For the WGi, CCEPu and CCEPr estimators the size

improves along with the decrease in their bias as T grows larger, though. As these estimators are also much

less biased for β, size distortions for inference on β are much more moderate and size is more or less correct

from T = 25 onwards. The WGn estimator suffers from substantial size distortions for both ρ and β for all

values of T .

In sum, the results show that ignoring cross-sectional dependence in a dynamic panel data model implies

the standard WG estimator to break down. Although the WGn estimator tends to have a relatively small

bias for ρ when T is small, in general it yields biased estimates for ρ and β and considerable size distortions

with these biases and size distortions not disappearing as T grows large. The CCEP estimators have a larger

bias than the infeasible WG estimator for small T , with the CCEPr estimator being relatively less biased

than the CCEPu estimator, but the biases disappear when T grows larger. For moderate values of T , the

CCEP estimators have more or less correct size for β but remain oversized for ρ.

17

Table

1:

Monte

Carl

ore

sult

s,fixed

effec

tsca

se

ρ=

0.5

ρ=

0.9

θ=

0.5

θ=

0.9

θ=

0.5

θ=

0.9

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

TN

bia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

ze

520

WG

n−

0.11

0.2

045.0

0.30

0.3

670.4−

0.1

20.2

043.2

0.11

0.1

826.5−

0.24

0.3

773.4

0.27

0.3

179.9−

0.1

60.3

169.9

0.09

0.1

334.9

WG

i−

0.15

0.2

230.3−

0.0

30.1

56.6−

0.2

50.3

247.3−

0.05

0.1

58.7−

0.15

0.2

334.1−

0.01

0.0

75.7−

0.2

20.3

247.1−

0.01

0.0

75.6

CC

EP

u−

0.56

0.6

860.8−

0.1

50.2

619.5−

0.5

90.7

064.3−

0.15

0.2

620.7−

0.93

1.0

283.8−

0.05

0.0

912.9−

1.0

11.0

989.1−

0.05

0.1

015.1

CC

EP

r−

0.36

0.4

184.2−

0.0

70.1

416.1−

0.4

00.4

487.3−

0.08

0.1

518.4−

0.60

0.6

597.6−

0.03

0.0

613.7−

0.6

50.7

796.9−

0.03

0.0

614.7

1020

WG

n−

0.01

0.0

933.0

0.38

0.4

197.1

0.03

0.1

140.4

0.17

0.2

065.2−

0.10

0.1

768.

70.

370.3

999.2−

0.0

30.1

574.6

0.16

0.1

979.8

WG

i−

0.05

0.0

723.0−

0.0

10.0

75.6−

0.1

00.1

246.7−

0.01

0.0

75.7−

0.04

0.0

631.3

0.00

0.0

45.4−

0.0

60.0

941.3

0.00

0.0

45.1

CC

EP

u−

0.24

0.2

775.7−

0.0

40.0

99.5−

0.2

70.3

084.3−

0.04

0.0

910.7−

0.45

0.4

898.7−

0.02

0.0

410.0−

0.5

50.5

799.8−

0.03

0.0

513.4

CC

EP

r−

0.17

0.1

969.8−

0.0

20.0

88.7−

0.1

90.2

179.3−

0.03

0.0

89.3−

0.28

0.3

096.3−

0.01

0.0

411.3−

0.3

40.3

796.7−

0.02

0.0

411.3

2520

WG

n0.

060.0

762.2

0.42

0.4

310

0.0

0.13

0.1

487.9

0.22

0.2

397.8−

0.01

0.0

550.6

0.44

0.4

510

0.0

0.04

0.0

774.2

0.26

0.2

899.8

WG

i−

0.02

0.0

312.3

0.00

0.0

45.6−

0.0

40.0

528.1

0.00

0.0

45.8−

0.01

0.0

217.5

0.00

0.0

25.6−

0.0

10.0

221.2

0.00

0.0

25.8

CC

EP

u−

0.08

0.0

956.1−

0.0

10.0

56.0−

0.1

00.1

171.5−

0.01

0.0

56.2−

0.16

0.1

798.7−

0.01

0.0

26.9−

0.2

20.2

399.7−

0.01

0.0

28.0

CC

EP

r−

0.05

0.0

732.6

0.00

0.0

46.3−

0.0

70.0

850.0−

0.01

0.0

46.5−

0.08

0.0

975.2

0.00

0.0

27.1−

0.0

80.1

178.4

0.00

0.0

210.3

5020

WG

n0.

070.0

890.2

0.43

0.4

310

0.0

0.17

0.1

799.6

0.22

0.2

310

0.0

0.01

0.0

357.4

0.46

0.4

610

0.0

0.05

0.0

694.4

0.31

0.3

2100.0

WG

i−

0.01

0.0

210.4

0.00

0.0

34.6−

0.0

20.0

218.5

0.00

0.0

35.1

0.00

0.0

111.7

0.00

0.0

14.7

0.00

0.0

111.7

0.00

0.0

15.0

CC

EP

u−

0.04

0.0

534.2

0.00

0.0

35.4−

0.0

50.0

650.0

0.00

0.0

35.7−

0.07

0.0

894.8

0.00

0.0

25.7−

0.1

00.1

098.8

0.00

0.0

25.9

CC

EP

r−

0.02

0.0

315.

00.

00

0.0

35.6−

0.0

30.0

426.7

0.00

0.0

35.8−

0.02

0.0

329.2

0.00

0.0

15.9

0.00

0.0

443.1

0.00

0.0

26.9

100

20W

Gn

0.08

0.0

999.6

0.43

0.4

3100.0

0.19

0.1

910

0.0

0.23

0.2

310

0.0

0.02

0.0

382.1

0.46

0.4

6100.0

0.05

0.0

610

0.0

0.33

0.3

3100.0

WG

i0.

000.0

18.

00.

000.0

25.0−

0.0

10.0

111.9

0.00

0.0

24.8

0.00

0.0

08.9

0.00

0.0

15.

10.

000.0

07.8

0.00

0.0

14.8

CC

EP

u−

0.02

0.0

321.5

0.00

0.0

25.8−

0.0

30.0

330.4

0.00

0.0

25.8−

0.03

0.0

481.5

0.00

0.0

15.

9−

0.04

0.0

594.3

0.00

0.0

15.9

CC

EP

r0.

000.0

27.

70.

000.0

25.9−

0.0

10.0

212.3

0.00

0.0

25.8

0.01

0.0

225.8

0.00

0.0

16.

10.

030.0

563.9

0.00

0.0

16.9

510

0W

Gn

−0.

110.1

875.0

0.30

0.3

589.9−

0.1

10.1

776.2

0.11

0.1

555.5−

0.23

0.3

587.2

0.27

0.3

193.7−

0.15

0.2

986.6

0.09

0.1

366.3

WG

i−

0.14

0.2

066.7−

0.03

0.0

811.9−

0.2

50.3

085.1−

0.0

50.0

920.6−

0.15

0.2

271.6−

0.01

0.0

37.

7−

0.22

0.3

181.2−

0.0

10.0

39.4

CC

EP

u−

0.57

0.6

687.

9−

0.15

0.2

048.3−

0.6

10.6

991.8−

0.16

0.2

153.5−

0.94

1.0

199.1−

0.05

0.0

641.1−

1.0

21.0

899.7−

0.0

50.0

646.3

CC

EP

r−

0.38

0.4

099.

8−

0.08

0.1

045.3−

0.4

20.4

499.9−

0.09

0.1

153.0−

0.62

0.6

599.9−

0.03

0.0

442.9−

0.7

30.7

699.9−

0.0

40.0

551.7

1010

0W

Gn

−0.

010.0

865.8

0.38

0.4

099.9

0.03

0.1

071.3

0.17

0.1

992.5−

0.09

0.1

686.0

0.37

0.3

910

0.0−

0.02

0.1

489.9

0.16

0.1

996.7

WG

i−

0.05

0.0

669.

0−

0.01

0.0

35.7−

0.1

00.1

192.9−

0.01

0.0

47.3−

0.04

0.0

573.0

0.00

0.0

25.

5−

0.06

0.0

878.9

0.00

0.0

25.8

CC

EP

u−

0.24

0.2

599.

0−

0.04

0.0

521.6−

0.2

80.2

999.7−

0.05

0.0

631.4−

0.45

0.4

710

0.0−

0.02

0.0

327.7−

0.5

50.5

610

0.0−

0.0

30.0

340.7

CC

EP

r−

0.17

0.1

899.

8−

0.02

0.0

414.4−

0.2

00.2

199.9−

0.03

0.0

418.5−

0.29

0.3

099.8−

0.02

0.0

224.6−

0.3

70.3

899.8−

0.0

20.0

232.0

2510

0W

Gn

0.06

0.0

783.4

0.42

0.4

310

0.0

0.13

0.1

495.4

0.21

0.2

299.9−

0.01

0.0

575.5

0.44

0.4

410

0.0

0.04

0.0

787.3

0.26

0.2

8100.0

WG

i−

0.02

0.0

246.

10.

00

0.0

24.9−

0.0

40.0

483.4

0.00

0.0

24.9−

0.01

0.0

158.5

0.00

0.0

15.

0−

0.01

0.0

158.8

0.00

0.0

14.8

CC

EP

u−

0.08

0.0

998.9−

0.01

0.0

26.2−

0.1

10.1

199.9−

0.01

0.0

27.5−

0.16

0.1

610

0.0−

0.01

0.0

19.6−

0.2

20.2

210

0.0−

0.0

10.0

115.2

CC

EP

r−

0.05

0.0

688.9

0.00

0.0

25.7−

0.0

80.0

898.8−

0.01

0.0

26.0−

0.08

0.0

898.3

0.00

0.0

17.5−

0.0

90.1

095.4

0.00

0.0

112.7

5010

0W

Gn

0.08

0.0

897.3

0.43

0.4

310

0.0

0.17

0.1

799.9

0.22

0.2

310

0.0

0.01

0.0

379.6

0.45

0.4

6100.0

0.05

0.0

697.8

0.31

0.3

2100.0

WG

i−

0.01

0.0

126.5

0.00

0.0

15.2−

0.0

20.0

262.7

0.00

0.0

15.2

0.00

0.0

037.8

0.00

0.0

15.

30.

00

0.0

035.6

0.00

0.0

15.2

CC

EP

u−

0.04

0.0

493.0

0.00

0.0

15.2−

0.0

50.0

598.9

0.00

0.0

15.9−

0.07

0.0

710

0.0

0.00

0.0

16.

2−

0.10

0.1

010

0.0

0.00

0.0

17.9

CC

EP

r−

0.02

0.0

247.3

0.00

0.0

15.0−

0.0

30.0

486.1

0.00

0.0

15.3−

0.02

0.0

260.1

0.00

0.0

15.

3−

0.01

0.0

475.1

0.00

0.0

18.4

100

100

WG

n0.

080.0

910

0.0

0.43

0.4

310

0.0

0.19

0.1

910

0.0

0.23

0.2

310

0.0

0.02

0.0

392.2

0.46

0.4

610

0.0

0.05

0.0

610

0.0

0.33

0.3

3100.0

WG

i0.

000.0

116.0

0.00

0.0

14.8−

0.0

10.0

139.4

0.00

0.0

14.7

0.00

0.0

021.0

0.00

0.0

04.

70.

000.0

020.5

0.00

0.0

04.7

CC

EP

u−

0.02

0.0

269.9

0.00

0.0

14.9−

0.0

30.0

388.8

0.00

0.0

14.9−

0.03

0.0

310

0.0

0.00

0.0

05.

1−

0.04

0.0

410

0.0

0.00

0.0

05.3

CC

EP

r0.

000.0

110.4

0.00

0.0

14.

8−

0.01

0.0

243.3

0.00

0.0

14.8

0.01

0.0

151.9

0.00

0.0

05.

20.

03

0.0

480.1

0.00

0.0

17.0

18

Table

2:

Monte

Carl

ore

sult

s,in

ciden

tal

tren

ds

case

ρ=

0.5

ρ=

0.9

θ=

0.5

θ=

0.9

θ=

0.5

θ=

0.9

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

Res

ult

sfo

rρ

Res

ult

sfo

rβ

TN

bia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

zebia

srm

sesi

ze

520

WG

n−

0.38

0.4

379.7

0.15

0.2

536.9−

0.4

70.4

992.0−

0.04

0.1

412.8−

0.69

0.7

988.

20.

150.2

147.6

0−

0.7

40.8

093.0

0.01

0.0

813.4

WG

i−

0.40

0.5

150.1−

0.1

10.2

414.4−

0.5

50.6

464.9−

0.15

0.2

620.5−

0.51

0.6

757.3−

0.03

0.1

09.1

0−

0.7

50.8

776.0−

0.04

0.1

012.0

CC

EP

u-

--

--

--

--

--

--

--

--

--

--

--

-

CC

EP

r−

0.68

0.7

198.1−

0.1

80.2

339.4−

0.6

90.7

297.9−

0.18

0.2

342.3−

1.09

1.1

299.9−

0.06

0.0

828.5

0−

1.1

31.1

610

0.0−

0.06

0.0

829.4

1020

WG

n−

0.11

0.1

652.5

0.34

0.3

790.9−

0.1

40.1

861.3

0.11

0.1

538.6−

0.27

0.3

483.

70.

300.3

396.3

0−

0.2

70.3

480.9

0.10

0.1

356.2

WG

i−

0.11

0.1

354.0−

0.0

20.0

85.6−

0.2

00.2

281.0−

0.04

0.0

98.0−

0.11

0.1

562.1−

0.01

0.0

45.5

0−

0.2

20.2

684.0−

0.01

0.0

46.2

CC

EP

u−

0.40

0.4

295.6−

0.0

80.1

218.8−

0.4

20.4

496.8−

0.09

0.1

322.0−

0.74

0.7

610

0.0−

0.04

0.0

518.9

0−

0.7

90.8

1100.0−

0.04

0.0

522.2

CC

EP

r−

0.32

0.3

497.9−

0.0

60.1

017.2−

0.3

40.3

598.3−

0.06

0.1

018.1−

0.59

0.6

110

0.0−

0.03

0.0

520.5

0−

0.6

40.6

5100.0−

0.03

0.0

520.3

2520

WG

n0.

030.0

642.3

0.42

0.4

310

0.0

0.07

0.1

059.3

0.20

0.2

194.8−

0.06

0.0

966.3

0.42

0.4

310

0.0

0−

0.0

10.0

867.6

0.21

0.2

399.3

WG

i−

0.03

0.0

429.5

0.00

0.0

45.5−

0.0

60.0

758.3−

0.01

0.0

46.0−

0.02

0.0

344.3

0.00

0.0

25.7

0−

0.0

30.0

454.2

0.00

0.0

25.8

CC

EP

u−

0.14

0.1

589.7−

0.0

10.0

56.9−

0.1

60.1

694.6−

0.02

0.0

57.9−

0.29

0.3

010

0.0−

0.01

0.0

39.9

0−

0.3

40.3

5100.0−

0.02

0.0

312.2

CC

EP

r−

0.11

0.1

277.4−

0.0

10.0

57.4−

0.1

20.1

386.5−

0.01

0.0

57.6−

0.20

0.2

199.3−

0.01

0.0

211.3

0−

0.2

30.2

499.2−

0.01

0.0

212.4

50

20W

Gn

0.06

0.0

780.4

0.43

0.4

310

0.0

0.14

0.1

597.4

0.22

0.2

310

0.0−

0.01

0.0

349.0

0.45

0.4

510

0.0

00.

040.0

582.5

0.29

0.3

0100.0

WG

i−

0.01

0.0

218.9

0.00

0.0

34.9−

0.0

30.0

337.7

0.00

0.0

35.0−

0.01

0.0

129.4

0.00

0.0

14.7

0−

0.0

10.0

128.8

0.00

0.0

15.0

CC

EP

u−

0.07

0.0

769.2

0.00

0.0

35.6−

0.0

80.0

880.3

0.00

0.0

36.0−

0.13

0.1

310

0.0

0.00

0.0

26.2

0−

0.1

60.1

610

0.0−

0.01

0.0

26.9

CC

EP

r−

0.04

0.0

541.7

0.00

0.0

35.7−

0.0

60.0

658.7

0.00

0.0

36.0−

0.07

0.0

888.9

0.00

0.0

26.4

0−

0.0

70.0

885.3

0.00

0.0

28.5

100

20

WG

n0.

080.0

898.7

0.43

0.4

410

0.0

0.18

0.1

8100.0

0.23

0.2

310

0.0

0.02

0.0

269.4

0.46

0.4

610

0.0

00.

050.0

599.5

0.32

0.3

3100.0

WG

i−

0.01

0.0

112.

80.

00

0.0

25.1−

0.0

10.0

222.6

0.00

0.0

25.0

0.00

0.0

117.8

0.00

0.0

15.2

00.

000.0

014.7

0.00

0.0

15.1

CC

EP

u−

0.03

0.0

443.3

0.00

0.0

25.9−

0.0

40.0

454.4

0.00

0.0

25.6−

0.06

0.0

699.

50.

000.0

16.1

0−

0.0

70.0

799.9

0.00

0.0

16.2

CC

EP

r−

0.02

0.0

217.1

0.00

0.0

25.9−

0.0

20.0

328.9

0.00

0.0

25.8−

0.01

0.0

226.

90.

000.0

16.0

00.

00

0.0

339.5

0.00

0.0

17.6

510

0W

Gn

−0.

370.4

292.0

0.15

0.2

363.0−

0.4

60.4

899.1−

0.04

0.1

041.2−

0.68

0.7

895.

00.

140.2

072.7−

0.7

30.7

897.6

0.01

0.0

635.2

WG

i−

0.39

0.4

992.1−

0.11

0.1

646.7−

0.5

50.6

297.6−

0.15

0.1

961.2−

0.52

0.6

694.0−

0.03

0.0

534.3−

0.7

50.8

599.0−

0.04

0.0

645.7

CC

EP

u-

--

--

--

--

--

--

--

--

--

--

--

-

CC

EP

r−

0.70

0.7

1100.0−

0.1

90.2

185.9−

0.7

00.7

210

0.0−

0.19

0.2

085.8−

1.12

1.1

310

0.0−

0.06

0.0

679.1−

1.1

51.1

6100.0−

0.06

0.0

681.4

1010

0W

Gn

−0.

110.1

578.3

0.34

0.3

698.7−

0.1

40.1

786.1

0.11

0.1

468.3−

0.28

0.3

493.1

0.30

0.3

399.5−

0.2

60.3

392.1

0.10

0.1

283.3

WG

i−

0.11

0.1

296.9−

0.0

20.0

411.6−

0.2

00.2

199.9−

0.04

0.0

523.4−

0.11

0.1

496.4−

0.01

0.0

29.6−

0.2

10.2

499.3−

0.01

0.0

213.7

CC

EP

u−

0.39

0.4

1100.0−

0.0

80.0

956.0−

0.4

20.4

410

0.0−

0.09

0.1

063.9−

0.74

0.7

510

0.0−

0.04

0.0

461.7−

0.7

90.8

1100.0−

0.04

0.0

469.5

CC

EP

r−

0.33

0.3

3100.0−

0.0

60.0

753.3−

0.3

40.3

510

0.0−

0.07

0.0

855.5−

0.60

0.6

010

0.0−

0.03

0.0

464.6−

0.6

50.6

5100.0−

0.03

0.0

466.5

25

100

WG

n0.

030.0

671.0

0.41

0.4

210

0.0

0.07

0.1

081.8

0.20

0.2

199.9−

0.06

0.0

984.5

0.42

0.4

210

0.0−

0.0

10.0

884.7

0.21

0.2

3100.0

WG

i−

0.03

0.0

387.9

0.00

0.0

26.0−

0.0

60.0

799.6−

0.01

0.0

26.3−

0.02

0.0

391.1

0.00

0.0

15.9−

0.0

30.0

493.2

0.00

0.0

15.6

CC

EP

u−

0.14

0.1

410

0.0−

0.0

10.0

211.8−

0.1

60.1

610

0.0−

0.02

0.0

315.5−

0.29

0.2

910

0.0−

0.01

0.0

225.2−

0.3

40.3

410

0.0−

0.01

0.0

237.3

CC

EP

r−

0.11

0.1

110

0.0−

0.0

10.0

29.7−

0.1

30.1

310

0.0−

0.01

0.0

211.1−

0.20

0.2

010

0.0−

0.01

0.0

124.9−

0.2

40.2

499.9−

0.01

0.0

230.6

5010

0W

Gn

0.06

0.0

793.1

0.43

0.4

310

0.0

0.14

0.1

599.6

0.22

0.2

210

0.0−

0.01

0.0

373.6

0.45

0.4

510

0.0

0.04

0.0

592.1

0.28

0.3

0100.0

WG

i−

0.01

0.0

266.2

0.00

0.0

15.4−

0.0

30.0

394.5

0.00

0.0

15.6−

0.01

0.0

181.4

0.00

0.0

15.6−

0.0

10.0

176.2

0.00

0.0

15.6

CC

EP

u−

0.07

0.0

799.9

0.00

0.0

16.3−

0.0

80.0

810

0.0−

0.01

0.0

16.9−

0.13

0.1

310

0.0

0.00

0.0

110.4−

0.1

60.1

610

0.0−

0.01

0.0

114.7

CC

EP

r−

0.05

0.0

597.4

0.00

0.0

15.5−

0.0

60.0

699.9

0.00

0.0

16.1−

0.07

0.0

799.6

0.00

0.0

18.9−

0.0

80.0

896.5

0.00

0.0

113.4

100

100

WG

n0.

080.0

899.9

0.43

0.4

310

0.0

0.18

0.1

8100.0

0.23

0.2

310

0.0

0.02

0.0

285.8

0.46

0.4

610

0.0

0.05

0.0

599.8

0.32

0.3

3100.0

WG

i−

0.01

0.0

139.6

0.00

0.0

14.7−

0.0

10.0

174.5

0.00

0.0

15.0

0.00

0.0

059.4

0.00

0.0

04.8

0.00

0.0

047.4

0.00

0.0

05.0

CC

EP

u−

0.03

0.0

397.5

0.00

0.0

15.1−

0.0

40.0

499.5

0.00

0.0

15.1−

0.06

0.0

610

0.0

0.00

0.0

15.8−

0.0

70.0

710

0.0

0.00

0.0

16.6

CC

EP

r−

0.02

0.0

257.7

0.00

0.0

14.8−

0.0

30.0

389.9

0.00

0.0

14.9−

0.01

0.0

265.

60.

000.0

05.2

0.00

0.0

368.3

0.00

0.0

17.0

Note

:N

ore

sult

sare

rep

ort

edfo

rth

eC

CE

Pu

esti

mato

rw

hen

T=

5as

inth

isca

seth

enu

mb

erof

un

kn

ow

np

ara

met

ers

exce

eds

the

num

ber

of

ob

serv

ati

on

s.

19

6 Concluding remarks

This papers examines the effects of error cross-sectional dependence, modelled as an unobserved common

factor, on WG and CCEP estimators in a linear dynamic panel data model. In general, the asymptotic

behavior as N → ∞ of each estimator stems from two sources: the well known Nickell dynamic panel data

bias and a random inconsistency which depends on the particular realisation of the unobserved common

factor. First, in line with Phillips and Sul (2007), we find that the naive WG estimator is inconsistent for

N → ∞ and T fixed. For a temporally dependent factor, we further show that the inconsistency remains

even for N,T → ∞. Second, contrary to the findings in Pesaran (2006) for a static model, the unrestricted

CCEP estimator is inconsistent for N → ∞ and fixed T . For a relevant range of parameter combinations,

the inconsistency is larger compared to the infeasible WG estimator. Restricting the CCEP estimator by

taking into account the restrictions on the individual-specific factor loadings as implied by the derivation

of the specification of the model augmented with cross-sectional averages results in a somewhat smaller

inconsistency. Letting N,T →∞, both the unrestricted and the restricted CCEP estimators are consistent.

The main practical conclusions of this paper are that (i) in a dynamic panel data model with error cross-

sectional dependence the naive WG estimator breaks down and (ii) while trying to account for this using

the CCEP estimator one should not dismiss the bias for small to moderate values of T . The latter implies

that bias corrections for the CCEP estimator would be very useful in practice. Along the lines suggested by

e.g. Hahn and Kuersteiner (2002), Bun and Carree (2005) and Phillips and Sul (2007), the asymptotic bias

expressions presented in this paper can in principle be used for bias correction. For a temporally independent

factor (θ = 0), this is easily implemented as the inconsistency of the CCEP estimators for fixed T is a

function of ρ only such that it is possible to invert the bias formulas and express ρ as a function of ρCCEP

to construct bias-corrected CCEP estimators. For a temporally dependent factor (θ 6= 0), however, this is

less straightforward as the bias expressions also depend on the persistence in the unobserved common factor.

A feasible bias correction procedure requires estimating this persistence. As this is beyond the scope of the

present paper, we leave this for future research.

20

References

Arellano, M., Bond, S. 1991. Some Tests of Specification for Panel Data: Monte Carlo Evidence and an

Application to Employment Equations. Review of Economic Studies 58: 277–297.

Bai, J. S., Ng, S. 2004. A PANIC Attack on Unit Roots and Cointegration. Econometrica 72(4): 1127–1177.

Bai, J.S., Kao, C., Ng, S. 2009. Panel Cointegration with Global Stochastic Trends. Journal of Econometrics

149(1): 82–99.

Bai, J.S., Ng, S. 2002. Determining the Number of Factors in Approximate Factor Models. Econometrica

70(1): 191–221.

Blundell, R., Bond, S. 1998. Initial Conditions and Moment Restrictions in Dynamic Panel Data Models.

Journal of Econometrics 87: 115–143.

Bun, M.J.G., Carree, M.A. 2005. Bias-Corrected Estimation in Dynamic Panel Data Models. Journal of

Business and Economic Statistics 23(2): 200–210.

Choi, C.Y., Mark, N.C., Sul, D. 2010. Bias Reduction in Dynamic Panel Data Models by Common Recursive

Mean Adjustment. Oxford Bulletin of Economics and Statistics 72(5): 567–599.

Coakley, J., Fuertes, A.M., Smith, R. 2002. A Principal Components Approach to Cross-Section Dependence

in Panels. In 10th International Conference on Panel Data, Berlin, July 5-6, number B5-3.

Everaert, G., Pozzi, L. 2007. Bootstrap-Based Bias Correction for Dynamic Panels. Journal of Economic

Dynamics and Control 31(4): 1160–1184.

Hahn, J.Y., Kuersteiner, G. 2002. Asymptotically Unbiased Inference for a Dynamic Panel Model with Fixed

Effects when both N and T are Large. Econometrica 70(4): 1639–1657.

Kiviet, J.F. 1995. On Bias, Inconsistency, and Efficiency of Various Estimators in Dynamic Panel Data

Models. Journal of Econometrics 68: 53–78.

Lancaster, T. 2002. Orthogonal Parameters and Panel Data. Reviewof Economic Studies 69(3): 647–666.

Moreira, M. J. 2009. A Maximum Likelihood Method for the Incidental Parameter Problem. Annals of

Statistics 37(6A): 3660–3696.

Nickell, S. 1981. Biases in Dynamic Models with Fixed Effects. Econometrica 49(6): 1417–1426.

Pesaran, M.H. 2006. Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error

Structure. Econometrica 74(4): 967–1012.

2007. A Simple Panel Unit Root Test in the Presence of Cross-Section Dependence. Journal of Applied

Econometrics 22(2): 265–312.

Pesaran, M.H., Smith, L.V., Yamagata, T. 2013. Panel Unit Root Tests in the Presence of a Multifactor

Error Structure. Journal of Econometrics 175: 94–115.

Phillips, P.C.B., Sul, D. 2003. Dynamic Panel Estimation and Homogeneity Testing under Cross Section

Dependence. Econometrics Journal 6(1): 217–259.

2007. Bias in Dynamic Panel Estimation with Fixed Effects, Incidental Trends and Cross Section Depen-

dence. Journal of Econometrics 137(1): 162–188.

Sarafidis, V., Robertson, D. 2009. On the Impact of Error Cross-Sectional Dependence in Short Dynamic

Panel Estimation. Econometrics Journal 12(1): 62–81.

Tanaka, K. 1983. Asymptotic Expansions Associated with the AR(1) Model with Unknown Mean. Econo-

metrica 51: 1221–1231.

21

Appendices

Appendix A Proofs

Lemma A-1. Under Assumption A1 and A3 we have from (5)

λ0 = E(F+t

)2= (ρ+ θ)λ1 − ρθλ2 + σ2

µ,

λ1 = E(F+t F

+t−1

)= (ρ+ θ)λ0 − ρθλ1,

λs = E(F+t F

+t−s)

= (ρ+ θ)λs−1 − ρθλs−2, ∀s ≥ 2 (A-1)

which can be solved to obtain

λ0 =1 + θρ

(1− θρ) (1− θ − ρ+ θρ) (1 + θ + ρ+ θρ)σ2µ =

1 + θρ

(1− θρ) (1− θ2) (1− ρ2)σ2µ, (A-2)

λ1 =θ + ρ

(1− θρ) (1− θ2) (1− ρ2)σ2µ. (A-3)

Next, using that Ft = F+t − ρF+

t−1 we have

E(FtF

+t−1

)= E

(F+t F

+t−1

)− ρE

(F+t−1

)2= λ1 − ρλ0 =

θ

(1− θρ) (1− θ2)σ2µ. (A-4)

Proof of Proposition 1.a. Suppose Assumptions A1-A3 hold, then:

plimN→∞

(ρWGn− ρ) = plim

N→∞

(1 /NT )∑Ni=1 y

′i,−1Mι (γiF + εi)

(1 /NT )∑Ni=1 y

′i,−1Mιyi,−1

,

=

plimN→∞

(1 /NT )

N∑i=1

T∑t=1

(y+i,t−1 + γiF

+t−1

) (γiFt + εit

)plimN→∞

(1 /NT )

N∑i=1

T∑t=1

(y+i,t−1 + γiF

+t−1

)2=−A(ρ, T ) +

m2γ

σ2ε

∑Tt=1 F

+t−1Ft

B(ρ, T ) +m2γ

σ2ε

∑Tt=1

(F+t−1

)2 , (A-5)

where from Nickell (1981) A(ρ, T ) = 11−ρ

(1− 1

T1−ρT1−ρ

)and B(ρ, T ) = T

1−ρ2

(1− 1

T1+ρ1−ρ + 2ρ

T 21−ρT

(1−ρ)2

). As

argued by Phillips and Sul (2007), using straightforward calculations these expressions continue to hold for

the case of cross-sectional heteroscedasticity of the type given in A2.

Next, letting T →∞ and using Lemma A-1, we have

1

T

T∑t=1

(F+t−1

)2

= E(F+t−1

)2+Op

(1√T

)=

1 + θρ

(1− θρ) (1− θ2) (1− ρ2)σ2µ +Op

(1√T

), (A-6)

1

T

T∑t=1

(F+t−1Ft

)= E

(F+t−1Ft

)+Op

(1√T

)=

θ

(1− θρ) (1− θ2)σ2µ +Op

(1√T

). (A-7)

22

Thus, the inconsistency in (11) has the following large T expansion

plimN→∞

(ρWGn− ρ) =

−A(ρ,T )T

(1T

∑Tt=1

(F+t−1

)2)−1

+m2γ

σ2ε

∑Tt=1 F

+t−1Ft∑T

t=1(F+t−1)

2

B(ρ,T )T

(1T

∑Tt=1

(F+t−1

)2)−1

+m2γ

σ2ε

,

=−A(ρ,T )

T

(1+θρ

(1−θρ)(1−θ2)(1−ρ2)σ2µ + op (1)

)−1

+m2γ

σ2εgFT

B(ρ,T )T

(1+θρ

(1−θρ)(1−θ2)(1−ρ2)σ2µ + op (1)

)−1

+m2γ

σ2ε

=− 1+ρ

T

(1−θρ)(1−θ2)1+θρ +

m2γσ

2µ

σ2εgFT

(1−θρ)(1−θ2)1+θρ +

m2γσ

2µ

σ2ε

+ op

(1

T

),

= −1 + ρ

T+ η

(gFT +

1 + ρ

T

)+ op

(1

T

)= −1 + ρ

T+ η

(E (gFT ) +

1 + ρ

T

)+Op

(1√T

), (A-8)

where gFT =∑Tt=1 F

+t−1Ft

/∑Tt=1

(F+t−1

)2

and η =m2γσ

2µ

σ2ε

/((1−θρ)(1−θ2)

1+θρ +m2γσ

2µ

σ2ε

).

When θ = 0, Tanaka (1983) shows that the unconditional mean of gFT has a large T expansion given by

E (gFT ) = − 1+3ρT + o

(1T

)such that (A-8) reduces to

plimN→∞

(ρWGn− ρ) = −1 + ρ

T− η 2ρ

T+Op

(1√T

). (A-9)

When θ 6= 0, using (A-2) and (A-3) and letting T → ∞ we have from using (A-6)-(A-7) that E (gFT ) =(1−ρ2)θ

1+θρ such that for (N,T )seq →∞ we have from (A-8)

plim(N,T )seq→∞

(ρWGn − ρ) =

(1− ρ2

)θ

(1 + θρ) + (1− θρ) (1− θ2)σ2ε

m2γσ

2µ

. (A-10)

Proof of Proposition 1.b. Suppose Assumptions A1-A3 hold, then using the same derivations as in the

proof of Proposition 1.a and using the results from the proof of Proposition 4 in Phillips and Sul (2007), it

is straightforward to show that

plimN→∞

(ρWGn− ρ) = plim

N→∞

(1 /NT )∑Ni=1 y

′i,−1Mτ (γiF + εi)

(1 /NT )∑Ni=1 y

′i,−1Mτyi,−1

=−C(ρ, T ) +

m2γ

σ2ε

∑Tt=1 F

+t−1Ft

D(ρ, T ) +m2γ

σ2ε

∑Tt=1

(F+t−1

)2 , (A-11)

where from Phillips and Sul (2007) we have that C(ρ, T ) = 1T−1

21−ρ

((T − 1)− 2

1−ρC1

)and D(ρ, T ) =

T−21−ρ2

(1− 1

T−24ρ

1−ρD1

), with

C1 = 1− 1

T + 1

(1 +

1− ρ3

T (1− ρ)3

)+

(1

2+

1

T + 1

(1 + 2ρ

1− ρ+

1− ρ3

T (1− ρ)3

))ρT , (A-12)

D1 = 1− 1

T + 1

2

1− ρ

(1 +

1

T − 1

(1− 1− ρ3

T (1− ρ)3

(1− ρT

)+

(3ρ

1− ρ+T + 3

2

)ρT

)). (A-13)

Defining hFT =∑Tt=1 F

+t−1Ft

/∑Tt=1

(F+t−1

)2

, the results in equations (16) and (17) and the fact that for

θ 6= 0 the results in equation (14) for the fixed effects model continue to hold in the incidental trends case

following directly from using that the results in (A-6)-(A-7) for F+t−1 and Ft continue to hold for F+

t−1 and

23

Ft and noting that for θ = 0 we have from Phillips and Sul (2007) that E (hFT ) = − 2+4ρT + o

(1T

).

Proof of Proposition 2.a. Choosing M in (10) to be the projection matrix MFι = IT −Fι (F ′ιFι)−1F ′ι , we

have

ρWGi =(1 /NT )

∑Ni=1 y

′i,−1MFιyi

(1 /NT )∑Ni=1 y

′i,−1MFιyi,−1

= ρ+(1 /NT )

∑Ni=1 y

′i,−1MFιεi

(1 /NT )∑Ni=1 y

′i,−1MFιyi,−1

. (A-14)

First, under Assumptions A1-A3 the probability limit as N → ∞ of the numerator of equation (A-14) is

given by

plimN→∞

1

NT

N∑i=1

y′i,−1MFιεi = plimN→∞

1

NT

N∑i=1

T∑t=1

yi,t−1

[(εit − εi)−

T∑s=1

κst (εis − εi)

],

=1

T

T∑t=1

plimN→∞

1

N

N∑i=1

yi,t−1εit −1

T

T∑t=1

(1−

T∑s=1

κst

)plimN→∞

1

N

N∑i=1

yi,t−1εi

− 1

T

T∑t=1

T∑s=1

κst plimN→∞

1

N

N∑i=1

yi,t−1εis, (A-15)

where κst = FsFt

/∑Tt=1 F

2t . Using

plimN→∞

1

N

N∑i=1

yi,t−1εi,t−s = plimN→∞

1

N

N∑i=1

E [yi,t−1εi,t−s] = plimN→∞

1

N

N∑i=1

σ2εiρ

s−1 = σ2ερs−1, ∀s ≥ 1, (A-16)

= 0, ∀s < 1, (A-17)

the first term of (A-15) drops and the last term rewrites to

1

T

T∑t=1

T∑s=1

κst plimN→∞

1

N

N∑i=1

yi,t−1εis = −σ2ε

1

T

T∑t=2

t−1∑s=1

κstρs+t−1 = −σ2

ε

1

T

T∑t=2

t−1∑s=1

κt,t−sρs−1,

= −σ2ε

1

T

(T∑t=2

κt,t−1 + ρ

T∑t=3

κt,t−2 + ρ2T∑t=4

κt,t−3 + . . .+ ρT−2κT,1

),

= −σ2ε

1

T

T−1∑t=1

ρt−1T∑

s=t+1

κs,s−t = σ2ε

1

T

T−1∑t=1

ρt−1gF,t, (A-18)

where gF,t =∑Ts=t+1 κs,s−t =

∑Ts=t+1 FsFs−t

/∑Tt=1 F

2t .

The second term is given by

1

T

T∑t=1

(1−

T∑s=1

κst

)plimN→∞

1

N

N∑i=1

yi,t−1εi =1

T 2

σ2ε

1− ρ

T∑t=1

(1− ρt−1)

(1−

Ft∑Ts=1 Fs∑T

t=1 F2t

)

=1

T 2

σ2ε

1− ρ

T∑t=1

(1− ρt−1) =1

T

[σ2ε

T

1

1− ρ

[T − 1− ρT

1− ρ

]],

=1

Tσ2εA(ρ, T ). (A-19)

24

Using (A-16) - (A-19), equation (A-15) can be written as

plimN→∞

1

NT

N∑i=1

y′i,−1MFιεi = −σ2ε

TA(ρ, T )− σ2

ε

1

T

T−1∑t=1

ρt−1gF,t. (A-20)

Second, the probability limit as N →∞ of the denominator of equation (A-14) is given by

plimN→∞

1

NT

N∑i=1

y′i,−1MFιyi,−1 = plimN→∞

1

NT

N∑i=1

T∑t=1

yi,t−1

[(yi,t−1 − yi,−1

)−

T∑s=1

κst(yi,s−1 − yi,−1

)],

= plimN→∞

1

NT

N∑i=1

T∑t=1

[y2i,t−1 − yi,t−1yi,−1

]− plimN→∞

1

NT

N∑i=1

T∑t=1

T∑s=1

κstyi,t−1yi,s−1

+ plimN→∞

1

NT

N∑i=1

T∑t=1

T∑s=1

κstyi,t−1yi,−1,

=1

T

T∑t=1

plimN→∞

1

N

N∑i=1

[y2i,t−1 − yi,t−1yi,−1

]− 1

T

T∑t=1

T∑s=1

κst plimN→∞

1

N

N∑i=1

yi,t−1yi,s−1

+1

T

T∑t=1

T∑s=1

κst plimN→∞

1

N

N∑i=1

yi,t−1yi,−1. (A-21)

The first and second term can be written as

1

T

T∑t=1

plimN→∞

1

N

N∑i=1

[y2i,t−1 − yi,t−1yi,−1

]=

1

T

T∑t=1

plimN→∞

1

N

N∑i=1

[y+i,t−1 − y

+i,−1

]2+ γ2

i

[F+t−1 − F

+

−1

]2,

=σ2ε

T

T − 1

1− ρ2

[1− 1

T − 1

2ρ

1− ρ

[1− 1

T

1− ρT

1− ρ

]]+m2

γ

1

T

T∑t=1

[F+t−1 − F

+

−1

]2,

=σ2ε

TB(ρ, T ) +m2

γ

1

T

T∑t=1

(F+t−1

)2

,

plimN→∞

1

N

N∑i=1

yi,t−1yi,s−1 = plimN→∞

1

N

N∑i=1

(y+i,t−1 + γiF

+t−1

) (y+i,s−1 + γiF

+s−1

),

= plimN→∞

1

N

N∑i=1

y+i,t−1y

+i,s−1 + plim

N→∞

1

N

N∑i=1

γ2i F

+t−1F

+s−1 =

ρ|t−s|

1− ρ2σ2ε +m2

γF+t−1F

+s−1,

while the last term in (A-21) can be dropped since

1

T

T∑t=1

T∑s=1

κst plimN→∞

1

N

N∑i=1

yi,t−1yi,−1 =1

T

T∑t=1

(Ft∑Ts=1 Fs∑T

t=1 F2t

)plimN→∞

1

N

N∑i=1

yi,t−1yi,−1 = 0.

Thus, equation (A-21) can be written as

plimN→∞

1

NT

N∑i=1

y′i,−1MFιyi,−1

=σ2ε

TB(ρ, T ) +m2

γ

1

T

T∑t=1

(F+t−1

)2

− 1

T

T∑t=1

T∑s=1

κts

(ρ|t−s|

1− ρ2σ2ε +m2

γF+t−1F

+s−1

),

25

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 1

T

T∑t=1

T∑s=1

κtsρ|t−s|

)+m2

γ

(1

T

T∑t=1

(F+t−1

)2

− 1

T

T∑t=1

T∑s=1

κstF+t−1F

+s−1

),

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )−

(1

T

T∑t=1

κtt + 2ρ1

T

T∑t=2

t−1∑s=1

κt,t−sρs−1

))+

m2γ

1

T

T∑t=1

(F+t−1

)2

1−∑Tt=1 FtF

+t−1

∑Ts=1 Fs

(F+s−1

)∑Tt=1 F

2t

∑Tt=1

(F+t−1

)2

,

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 1

T

(1 + 2ρ

T−1∑t=1

ρt−1gF,t

))+m2

γ kF , (A-22)

where

kF =1

T

T∑t=1

(F+t−1

)2

1−

(∑Tt=1 FtF

+t−1

)2

∑Tt=1 F

2t

∑Tt=1

(F+t−1

)2

.

Dividing (A-20) by (A-22) yields the result in equation (18).

Next, letting T →∞

gF,t =

T−tT

(E (FsFs−t) +Op

(1√T

))E (F 2

t ) +Op

(1√T

) =T − tT

θt +Op

(1√T

), (A-23)

kF = E((F+t−1

)2)1−E(FtF

+t−1

)2E (F 2

t )E((F+t−1

)2)+Op

(1√T

)=

1

(1− θρ)2

(1− ρ2)σ2µ +Op

(1√T

),

(A-24)

where use is made of lemma A-1. Hence,

T−1∑t=1

ρt−1gF,t =

T−1∑t=1

ρt−1

(T − tT

θt +Op

(1√T

))= θ

T−1∑t=1

(ρθ)t−1 − 1

ρ

1

T

T−1∑t=1

t (ρθ)t

+Op

(1√T

),

= θ1− (θρ)

T−1

1− θρ− 1

ρ

1

T

(θρ

1− (θρ)T−1

(1− θρ)2 − (T − 1)

(θρ)T

1− θρ

)+Op

(1√T

),

=θ

1− θρ

[1− 1

T

1− θT ρT

1− θρ

]+Op

(1√T

)=

θ

1− θρ+Op

(1√T

). (A-25)


plimN→∞

(ρWGi− ρ) =

1

T

−A(ρ, T )− θ1−θρ + op (1)

B(ρ, T ) + 1(1−θρ)2(1−ρ2)

m2γσ

2µ

σ2ε

+ op (1),

= − 1

T

(1 + ρ+

θ(1− ρ2

)1− θρ

)(1 +

m2γ

(1− θρ)2

σ2µ

σ2ε

)−1

+ op

(1

T

). (A-26)

Proof of Proposition 2.b. Under Assumptions A1-A3, it is straightforward to obtain the result in (20) using

26

the same derivations as in the proof of Proposition 2.a and using the results from the proof of Proposition 4 in

Phillips and Sul (2007). The large T expansion in (21) then follows directly from the definitions of C (ρ, T )

and D (ρ, T ) and using that the results in (A-25) and (A-24) for gF,t and kF continue to hold for gF,t and

kF .

Proof of Theorem 1.a. Choosing M in (10) to be the projection matrix MGι = IT − Gι (G′ιGι)−1G′ι, we

have

ρCCEPu =(1 /NT )

∑Ni=1 y

′i,−1MGιyi

(1 /NT )∑Ni=1 y

′i,−1MGιyi,−1

= ρ+(1 /NT )

∑Ni=1 y

′i,−1MGιε

∗i

(1 /NT )∑Ni=1 y

′i,−1MGιyi,−1

. (A-27)

Using (24) and (25), the probability limit as N →∞ of the numerator of equation (A-27) equals

plimN→∞

1

NT

N∑i=1

y′i,−1MGιε∗i = plim

N→∞

1

NT

N∑i=1

y′i,−1MHιεi, (A-28)

with MHι = IT −Hι (H ′ιHι)−1H ′ι, Hι =

(ι, F+, F+

−1

). Similar to the derivation of (A-20), (A-28) rewrites

to

plimN→∞

1

NT

N∑i=1

y′i,−1MHιεi = plimN→∞

1

NT

N∑i=1

T∑t=1

yi,t−1

[(εit − εi)−

T∑s=1

κ+st (εis − εi)

],

= − 1

T

T∑t=1

plimN→∞

1

N

N∑i=1

yi,t−1εi −1

T

T∑t=1

T∑s=1

κ+st plimN→∞

1

N

N∑i=1

yi,t−1εis,

= −σ2ε

TA(ρ, T )− σ2

ε

1

T

T−1∑t=1

ρt−1g+F,t, (A-29)

where g+F,t =

∑Ts=t+1 κ

+s,s−t and

κ+st =

1

α0

(α1F

+t F

+s − α2F

+t F

+s−1 + α3F

+t−1F

+s−1 − α2F

+t−1F

+s

), (A-30)

with α0 = α1α3 − α22, α1 =

∑Tt=1

(F+t−1

)2

, α2 =∑Tt=1 F

+t F

+t−1 and α3 =

∑Tt=1

(F+t

)2

.

Second, as (25) implies that for N → ∞ MGιF+ = 0 such that MGιyi,−1 = MGιy

+i,−1, the probability

limit as N →∞ of the denominator of equation (A-27) is given by

plimN→∞

1

NT

N∑i=1

y′i,−1MGιyi,−1 = plimN→∞

1

NT

N∑i=1

y+′

i,−1MGιy+i,−1,

= plimN→∞

1

NT

N∑i=1

T∑t=1

y+i,t−1

((y+i,t−1 − y

+i,−1)−

T∑s=1

κ+st(y

+i,s−1 − y

+i,−1)

),

= plimN→∞

1

NT

N∑i=1

T∑t=1

(y+i,t−1 − y

+i,−1

)2 − plimN→∞

1

NT

N∑i=1

T∑t=1

T∑s=1

κ+sty

+i,t−1y

+i,s−1

+ plimN→∞

1

NT

N∑i=1

T∑t=1

T∑s=1

κ+sty

+i,t−1y

+i,−1,

27

=1

T

T∑t=1

plimN→∞

1

N

N∑i=1

(y+i,t−1 − y

+i,−1

)2 − 1

T

T∑t=1

T∑s=1

κ+st plimN→∞

1

N

N∑i=1

y+i,t−1y

+i,s−1,

=σ2ε

T

T − 1

1− ρ2

[1− 1

T − 1

2ρ

1− ρ

[1− 1

T

1− ρT

1− ρ

]]− 1

T

T∑t=1

T∑s=1

κ+ts

(ρ|t−s|

1− ρ2σ2ε

),

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 1

T

T∑t=1

T∑s=1

κ+tsρ|t−s|

),

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )−

(1

T

T∑t=1

κ+tt + 2ρ

1

T

T∑t=2

t−1∑s=1

κ+t,t−sρ

s−1

)),

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 2

T

(1 + ρ

T−1∑t=1

ρt−1g+F,t

)). (A-31)


Turning to the large T expansion, first note that using Lemma A-1

1

T

T∑s=t+1

F+s F

+s−t =

T − tT

E(F+s F

+s−t) +Op

(1√T

)=T − tT

λt +Op

(1√T

),

such that

g+F,t =

1

α0

(α1

T∑s=t+1

F+s F

+s−t − α2

T∑s=t+1

F+s F

+s−t−1 + α3

T∑s=t+1

F+s−1F

+s−t−1 − α2

T∑s=t+1

F+s−1F

+s−t

),

=T − tT

2ωt − ω1ωt+1 − ω1ωt−1

1− ω21

+Op

(1√T

),

with ωt = λt/λ0. Next

− 1

T

T−1∑t=1

ρt−1g+F,t = − 1

T

T−1∑t=1

ρt−1

(T − tT

2ωt − ω1ωt+1 − ω1ωt−1

1− ω21

+Op

(1√T

)),

= − (1 + θρ)2

(1− θ2) (1− ρ2)

1

T

T−1∑t=1

ρt−1

(1− t

T

)((2− (θ + ρ)ω1)ωt − (1− θρ)ω1ωt−1) + op

(1

T

),

= − (1 + θρ)1

T

T−1∑t=1

ρt−1

(1− t

T

)(ωt − ρωt−1

1− ρ2+ωt − θωt−1

1− θ2

)+ op

(1

T

),

= − 1

T

(θ

1− θρ+

ρ

1− ρ2

)+ op

(1

T

), (A-32)

where use is made of ωt = (θ + ρ)ωt−1 − θρωt−2 ∀t ≥ 2 and

T−1∑t=2

(1− t

T

)ρt−1ωt = (θ + ρ)

T−1∑t=2

(1− t

T

)ρt−1ωt−1 − θρ

T−1∑t=2

(1− t

T

)ρt−1ωt−2,

= (θ + ρ)

((1− 2

T

)ρω1 + ρ

(T−1∑t=2

(1− t

T

)ρt−1ωt −

1

T

T−1∑t=2

ρt−1ωt

))

28

− θρ

((1− 2

T

)ρ+

(1− 3

T

)ρ2ω1 + ρ2

(T−1∑t=2

(1− t

T

)ρt−1ωt −

2

T

T−1∑t=2

ρt−1ωt

)),

= (θ + ρ)

(ρω1 + ρ

T−1∑t=2

(1− t

T

)ρt−1ωt

)− θρ

(ρ+ ρ2ω1 + ρ2

T−1∑t=2

(1− t

T

)ρt−1ωt

)+O

(1

T

),

=(θ + ρ) ρω1 − θρ

(ρ+ ρ2ω1

)1− (θ + ρ) ρ+ θρ3

+O

(1

T

)= ρ

θ(1− ρ2

)(θ + ρ) + ρ2

(1− θ2

)(1− ρ2) (1− θ2ρ2)

+O

(1

T

),

such that

T−1∑t=1

(1− t

T

)ρt−1ωt = ω1 +

T−1∑t=2

(1− t

T

)ρt−1ωt =

θ(1− ρ2

)+ ρ

(1− θ2ρ2

)(1− ρ2) (1− θ2ρ2)

+O

(1

T

),

and

T−1∑t=1

(1− t

T

)ρt−1ωt−1 = 1 + ρ

T−1∑t=1

(1− t

T

)ρt−1ωt −

1

T

T∑t=1

ρt−1ωt−1,

= 1 + ρω1 − θρ2

(1− ρ2) (1− θρ)+O

(1

T

)=

(1− θ2ρ2

)+ θρ

(1− ρ2

)(1− ρ2) (1− θ2ρ2)

+O

(1

T

).

Using (A-32), as T →∞ (A-29) and (A-31) are given by

−σ2ε

TA(ρ, T )− σ2

ε

1

T

T−1∑t=1

ρt−1g+F,t = −σ

2ε

T

(A(ρ, T ) +

(θ

1− θρ+

ρ

1− ρ2

))+ op

(1

T

), (A-33)

σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 2

T

(1 + ρ

T−1∑t=1

ρt−1g+F,t

))=σ2ε

TB(ρ, T ) +O

(1

T

). (A-34)


plimN→∞

(ρCCEPu − ρ) = − 1

T

(1

1− ρ+

(θ

1− θρ+

ρ

(1− ρ2)

))(1− ρ2

)+ op

(1

T

),

= − 1

T

(1 + 2ρ+

θ(1− ρ2)

1− θρ

)+ op

(1

T

). (A-35)

Proof of Theorem 1.b. Under Assumptions A1-A3, it is straightforward to obtain the result in (31) using

the same derivations as in the proof of Theorem 1.a and using the results from the proof of Proposition 4 in


and D (ρ, T ) and using that the result in (A-32) for g+F,t continues to hold for g+

F,t.

Proof of Theorem 2.a. First note that using (22) and (35)

Ft =1

γ

(yt − α− ρCCEPr

yt−1

)=

1

γ

((ρ− ρ

CCEPr)yt−1 + (α− α) + γFt + εt

),

Ft = Ft +1

γ

((ρ

CCEPr− ρ)yt−1 + (α− α)− εt

),

29

which for N →∞ and using (24) and (25) reduces to

Ft = Ft +1

γ

((ρ

CCEPr− ρ)

((1− ρ)

−1α+ γF+

t−1

)+ (α− α)

),

= Ft + (ρCCEPr

− ρ)F+t−1 + a∗, (A-36)

where a∗ = 1γ

((ρCCEPr − ρ) (1− ρ)

−1α+ (α− α)

). Using (A-36) in (10) with M chosen to be MFι

plimN→∞

(ρCCEPr − ρ) =

plimN→∞

(1 /NT )

N∑i=1

y′i,−1MFι

(γi plimN→∞

(ρCCEPr

− ρ)F+−1 + εi

)

plimN→∞

(1 /NT )

N∑i=1

y′i,−1MFιyi,−1

,

=

plimN→∞

(1 /NT )

N∑i=1

y′i,−1MFιεi

plimN→∞

(1 /NT )

N∑i=1

y′i,−1MFιy+i,−1

. (A-37)

Using (A-16)-(A-19), the numerator of (A-37) is given by

plimN→∞

1

NT

N∑i=1

y′i,−1MFιεi = plim

N→∞

1

NT

N∑i=1

T∑t=1

yi,t−1

[(εit − εi)−

T∑s=1

˜τst (εis − εi)

],

=1

T

T∑t=1

plimN→∞

1

N

N∑i=1

yi,t−1εit −1

T

T∑t=1

(1−

T∑s=1

˜τst) plimN→∞

1

N

N∑i=1

yi,t−1εi

− 1

T

T∑t=1

T∑s=1

˜τst plimN→∞

1

N

N∑i=1

yi,t−1εis,

= − 1

T

T∑t=1

plimN→∞

1

N

N∑i=1

yi,t−1εi −1

T

T∑t=1

T∑s=1

˜τst plimN→∞

1

N

N∑i=1

yi,t−1εis,

= −σ2ε

TA(ρ, T )− σ2

ε

1

T

T−1∑t=1

ρt−1gF ,t, (A-38)


˜τs,s−t with ˜τs,s−t being the (s, s− t) th element in˜F

(˜F′ ˜F

)−1 ˜F′.

Similarly to the derivation of (A-22), the denominator of equation (A-37) is given by

plimN→∞

1

NT

N∑i=1

y′i,−1MFιy+i,−1 = plim

N→∞

1

NT

N∑i=1

T∑t=1

[y+i,t−1 + γF+

t−1

] [(y+i,t−1 − y

+i,−1

)−

T∑s=1

˜τst (y+i,s−1 − y

+i

)],

= plimN→∞

1

NT

N∑i=1

T∑t=1

(y+i,t−1 − y

+i,−1

)2 − plimN→∞

1

NT

N∑i=1

T∑t=1

T∑s=1

˜τsty+i,t−1y

+i,s−1,

=σ2ε

1− ρ2

(1− ρ2

TB(ρ, T )− 2

T

(1 + ρ

T−1∑t=1

ρt−1gF ,t

)). (A-39)

30


Next, letting(ρ0, F 0

)denote the true parameter ρ and the true factor F respectively such that, after

centering, the objective function in (33) is given by

SNT (ρ, F ) =1

NT

N∑i=1

(yi − ρyi,−1)′MF (yi − ρyi,−1)− 1

NT

N∑i=1

ε′iMF 0 εi,

=1

NT

N∑i=1

((ρ0 − ρ

)yi,−1 + γiF

0 + εi

)′MF

((ρ0 − ρ

)yi,−1 + γiF

0 + εi

)− 1

NT

N∑i=1

ε′iMF 0 εi,

= sNT (ρ, F ) + 2

(ρ0 − ρ

)NT

N∑i=1

y′i,−1MF εi + 21

NT

N∑i=1

γiF0′MF εi +

1

NT

N∑i=1

ε′i(MF −MF 0

)εi,

where

sNT (ρ, F ) =

(ρ0 − ρ

)2NT

N∑i=1

y′i,−1MF yi,−1 +1

NT

N∑i=1

γ2i F

0′MF F0 + 2

(ρ0 − ρ

)NT

N∑i=1

y′i,−1MF F0γi.

Using that for N →∞, followed by an expansion as T →∞

1

NT

N∑i=1

y′i,−1MF εi =1

NT

N∑i=1

y′i,−1εi −1

N

N∑i=1

y′i,−1F

T

(F ′F

T

)−1F ′εiT

= op (1) ,

1

NT

N∑i=1

γiF0′MF εi =

1

NT

N∑i=1

γiF0′ εi −

1

N

N∑i=1

γiF0′ F

T

(F ′F

T

)−1F ′εiT

= op (1) ,

1

NT

N∑i=1

ε′i(MF −MF 0

)εi =

1

N

N∑i=1

ε′iF0

T

(F 0′ F 0

T

)−1F 0′ εiT− 1

N

N∑i=1

ε′iF

T

(F ′F

T

)−1F ′εiT

= op (1) .

we have

SNT (ρ, F ) = sNT (ρ, F ) + op (1) , (A-40)

uniformly over ρ and F . First note that as MF 0 F0 = 0, sNT

(ρ0, F 0

)= 0. Second, we show that for any

(ρ, F ) 6=(ρ0, F 0

), sNT (ρ, F ) > 0; thus sNT

(ρ0, F 0

)attains its unique minimum value at (ρ, F ) =

(ρ0, F 0

).

Define

A =1

NT

N∑i=1

y′i,−1MF yi,−1; B =1

NT

N∑i=1

γ2i ; C =

1

NT

N∑i=1

γiMF yi,−1.

Then

sNT (ρ, F ) =(ρ0 − ρ

)2A+ F 0′MFBMF F

0 + 2(ρ0 − ρ

)C ′MF F

0,

=(ρ0 − ρ

)2 (A− C ′B−1C

)+(F 0′MF +

(ρ0 − ρ

)C ′B−1

)B(MF F

0 +B−1C(ρ0 − ρ

)),

=(ρ0 − ρ

)2D(F)

+ θ′Bθ,

≥ 0,

31

since D (F ) = A − C ′B−1C and B are both positive definite, where θ = MF F0 + B−1C

(ρ0 − ρ

). Note that

sNT (ρ, F ) > 0 if either ρ 6= ρ0 or F 6= F 0. This implies that ρCCEPr is consistent for ρ for N,T →∞.

From (A-36) and the consistency of ρCCEPr we have that for N →∞ followed by an expansion as T →∞

Ft = Ft + op (1) , (A-41)

such that

gF ,t = gF,t + op (1) . (A-42)

The large T expansion of the CCEPr estimator in (37) then follows from substituting (A-42) in (36) and

using similar derivations as to obtain (19).

Proof of Theorem 2.b. Under Assumptions A1-A3, it is straightforward to obtain the result in (38) using

the same derivations as in the proof of Theorem 2.a and using the results from the proof of Proposition 4 in


and D (ρ, T ) and using that the result in (A-42) for gF ,t continues to hold for gF ,t.

32

Documents

Common Correlated E ects Estimation of Dynamic Panels with ...geeverae/index_files/CCEP.pdfCCEP estimator and is not much worse than the infeasible WG estimator for moderate T. In