Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Consistent selection rules for
the number of dynamic factors
in approximate factor models
Jorg Breitung∗
University of BonnUta Pigorsch†
University of Mannheim
This version: January 2009
Abstract
Determining the number of dynamic factors in large panel data sets is animportant issue for many economic applications. In this paper we developtwo selection procedures that allow to consistently estimate the numberof dynamic factors in a dynamic factor model. The procedures are basedon a canonical correlation analysis of the static factors obtained from aprincipal component analysis. Compared to other selection procedures,our approach has the advantage that it does not assume a finite order au-toregressive representation of the common factors, and that it is invariantto a rotation of the factor space. Monte Carlo simulations show that theproposed selection rules outperform the existing ones based on principalcomponents. The new selection procedures are also applied to the U.S.macroeconomic data panel used in Stock and Watson (2005).
∗Address: University of Bonn, Institute of Econometrics, Adenauerallee 24-42, 53113 Bonn,Germany, phone: +49 (0)228 73 9201, fax: +49 (0)228 73 9189, email: [email protected].†Address: University of Mannheim, Department of Economics, L7, 3-5, 68131 Mannheim,
Germany, phone: +49 (0)621 181 1945, fax: +49 (0)621 181 1931, email: [email protected].
1 Introduction
In many economic applications it is very appealing to represent a large number
of series by a small number of latent factors. In macroeconomics, for example,
factor models have been used in the business cycle analysis (see e.g. Forni and
Reichlin, 1998 and Giannone, Reichlin, and Sala, 2006) and in the identification of
common macroeconomic or policy shocks (see e.g. Favero, Marcellino, and Neglia,
2005, Forni, Lippi, and Reichlin, 2008 and Stock and Watson, 2005). Recently, it
has also been shown that forecasts based on a few number of so–called diffusion
indices, that summarize a huge number of candidate predictor variables, obtain
smaller forecast errors relative to alternative techniques based on ARIMA models
or small structural economic models (see e.g. Angelini, Henry, and Mestre, 2001,
Brisson, Campbell, and Galbraith, 2003, Eickmeier and Ziegler, 2008, Marcellino,
Stock, and Watson, 2003, and Stock and Watson, 1999, 2002a,b). Knowing the
correct number of common factors is crucial for all of these applications. Bai and
Ng (2002) propose information criteria that can be used to consistently select the
number of common factors in a static factor model with both N and T converging
to infinity. The shortcoming of such a factor model, however, is the assumption
of static factors. In view of the common fluctuations of many economic series, it
seems to be more realistic to specify dynamic, e.g. autoregressive, factors that
explain the comovement of the observed series. A leading example of such a
dynamic factor model is considered in Stock and Watson (2002a). There, a large
number of series is represented by a few number of dynamic factors whose lags
may also enter the factor representation. Applying the information criteria of
Bai and Ng (2002) yields a consistent estimate of the number of common factors
in this so–called static representation of the factors.1
An important drawback of this approach, however, is that it cannot disen-
tangle the dynamic factors from their lags and, thus, the estimated number of
factors is equal to the sum of the number of dynamic factors and their lags in
the factor representation. Forni, Hallin, Lippi, and Reichlin (2000), therefore,
suggested informal criteria for determining the number of dynamic factors. Their
approach is based on the computation of the variance explained by the common
components estimated from the Stock-Watson procedure. The procedure is also
applied in Favero et al. (2005) and Giannone, Reichlin, and Sala (2002).
1Pesaran (2006) suggested a methodology that only assumes an upper bound for the numberof common factors. Thus, this approach sidesteps the problem of determining the number ofcommon factors.
1
A weakly consistent selection rule for the number of dynamic (or “primitive”)
factors (k) is suggested by Bai and Ng (2007). Their empirical procedure is based
on the fact that fitting a vector autoregressive (VAR) model to the vector of r > k
principal components yields an asymptotically singular covariance matrix of the
residuals. Therefore, the r − k smallest eigenvalues of the residual covariance
matrix converge to zero as N and T tend to infinity. Based on the eigenvalues of
the residual covariance matrix, Bai and Ng (2007) derive two statistics and corre-
sponding asymptotic bounds that allow for a consistent selection of the number of
dynamic factors. A related test procedure is suggested by Amengual and Watson
(2007) and Stock and Watson (2005). Their empirical procedure is based on the
innovations of the vector of time series. Since the factor representation of the
innovations involve k common components (instead of r factors of the original
time series), they suggest to apply the information criteria suggested by Bai and
Ng (2002) for selecting the number of static factors of these innovations, which
corresponds to the number of dynamic factors of the observed data.
In this paper two alternative specification criteria for the number of dynamic
factors are proposed. Our approach is based on a canonical correlation analysis
(CCA) between the current and past values of the r “static” (or reduced form)
factors. The first selection procedure is based on the fact that lagged values of
common factors are perfectly predictable conditional on the past of the factors
and, therefore, the associated eigenvalues of the CCA tend to unity as the sample
size tends to infinity, whereas the remaining eigenvalues converge to values smaller
than one. The second selection procedure exploits the fact that if a sufficient
number of lags of the dynamic factors enter the static representation of the factors,
then there exist k linear combinations that cannot be predicted based on the past
of the factors and, therefore, the respective eigenvalues converge to zero. To test
the hypothesis that k eigenvalues are equal to zero, we propose a likelihood ratio
statistic.
The CCA procedure has several advantages. First, it can also be applied to
the approximate (instead of a strict) factor model. Second, our first selection
procedure is not based on a finite order VAR representation of the dynamic
factors. Therefore, in contrast to the approach of Bai and Ng (2007) our selection
procedure can also be applied if the common factors are generated by moving
average processes. Third, the selection procedures based on the CCA are invariant
to any rotation of the factor space. Finally, our Monte Carlo simulations suggest
that the CCA criteria are less sensitive to nuisance parameters and generally
2
outperform alternative selection procedures.
The rest of the paper is organized as follows. In Section 2 we introduce the
factor model and review the existing selection procedures for the number of dy-
namic factors. Section 3 proposes two selection procedures based on canonical
correlations. In Section 4 we compare the finite sample properties of the alter-
native selection methods and apply the new procedures to the dataset of Stock
and Watson (2005). Section 5 concludes.
2 Determining the number of dynamic factors
Following Stock and Watson (1999, 2002a,b) we consider the (restricted) dynamic
factor model of the form2
xt = A0ft + · · ·+ Amft−m + ut (1)
≡ AFt + ut , (2)
where xt and ut are N -dimensional column vectors, ft is k×1 vector with k � N ,
A = [A0, . . . , Am] and Ft = [f ′t , . . . , f′t−m]′. For some part of our analysis we
assume that the vector of common dynamic factors possesses a VAR(p) represen-
tation
ft = Γ1ft−1 + · · ·+ Γpft−p + εt , (3)
where Γ1, . . . ,Γp and Σ = E(εtε′t) are k × k matrices. However, for our first
selection criterion we do not assume a finite order VAR representation and just
assume that T−1∑T
t=1 ftf′t
p→ Σf is a positive definite matrix.
It is important to note that A need not have full column rank. For example,
a subset of the factors may not enter with all lags. In this case the respective
columns of A are zero. Let r ≤ (m + 1)k be the rank of the matrix A. Then
there exists a N × r matrix A such that
AFt = AFt
where Ft = RFt and R is a nonsingular r × (m + 1)k matrix. Ft is called the
vector of static factors.
2Note that the model is a restricted version of the general dynamic factor model developedin Forni et al. (2000) and Forni and Lippi (2001) since we assume that the factors enter withfinite order lag polynomials. An identification procedure for the number of dynamic factors ina more general dynamic framework is proposed in Hallin and Liska (2007).
3
We assume that the idiosyncratic error ut is weakly correlated across series
and time, whereas the common factors ft, . . . , ft−p give rise to a strong correla-
tion among the series.3 To identify the number of dynamic factors (or primitive
factors) ft from the static factors Ft, Favero et al. (2005) and Forni et al. (2000)
inspect the fraction of variance explained by the common components. An im-
portant problem of this criterion is that the choice of the cut-off point is rather
arbitrary. Therefore, Amengual and Watson (2007), Bai and Ng (2007) and Stock
and Watson (2005) suggest consistent selection rules for the number of dynamic
factors.
In the following we briefly review these approaches. For the ease of exposition
we assume for the moment that A has full column rank such that A = A and Ft =
Ft. Furthermore, assume that m ≥ p so that the vector Ft can be represented by
a VAR(1) system given by
Ft = CFt−1 + ηt , (4)
where
C =
Γ1 Γ2 · · · Γm−1 ΓmIq 0 · · · 0 00 Iq 0 0...
. . ....
0 0 Iq 0
,ηt = [ε′t, 0]′ and Γj = 0 for j > p. As shown by Bai and Ng (2002) and Stock and
Watson (2002a), the Principal Components (PC) estimator Ft is a consistent es-
timator for HFt, where H is an appropriately chosen rotation matrix. Therefore,
if Ft is replaced by Ft the VAR becomes
Ft = C∗Ft−1 + et . (5)
In this representation C∗ = HCH−1 and, for N → ∞ and T → ∞, the asymp-
totic covariance matrix of et is Σe = HΣηH′, where Ση = E(ηtη
′t). Obviously, Σe
is of rank k since rk(Ση) = k.
Thus, the number of dynamic factors can be obtained by determining the rank
of Ση (or Σe). Bai and Ng (2007), for example, propose a selection rule based
on the ordered eigenvalues c1 ≥ · · · ≥ cN of the estimated residual covariance
matrix
Σe = T−1
T∑t=2
FtF′t − T−1
T∑t=2
FtF′t−1
(T∑t=2
Ft−1F′t−1
)−1 T∑t=2
Ft−1F′t .
3These properties are formalized in the assumption given in e.g. Bai and Ng (2002) andStock and Watson (2002a).
4
They suggest two test statistics based on the eigenvalues of Σe:
D1,k∗ =ck∗+1√∑r
j=1 c2j
(6)
D2,k∗ =
(∑rj=k∗+1 c
2j∑r
j=1 c2j
)1/2
. (7)
Since Σep→ Σe as N and T tend to infinity, it follows that Dj,k∗
p→ 0 for k∗ ≥ k
and j = 1, 2. Specifically, Bai and Ng (2007) show that the estimates of the
number of dynamic factors
kj = min{k : Dj,` < m∗/min(N2/5, T 2/5)} (8)
are (weakly) consistent in the sense that kjp→ k as N, T →∞. Based on Monte
Carlo simulations, Bai and Ng (2007) recommend the value m∗ = 1.
An alternative selection procedure is suggested by Amengual and Watson
(2007) and Stock and Watson (2005). They assume a factor model, where the
idiosyncratic components possess the autoregressive representation
αi(L)uit = νit i = 1, . . . , N, (9)
with αi(L) = 1−α1L−· · ·−αqiLqi and qi is a variable specific lag order. Consider
the vector of innovations:
ξt = xt − E(xt|xt−1, xt−2, . . .)
= A0εt + νt (10)
where νt = [ν1t, . . . , νNt]′. It follows that the number of common factors εt in (10)
is identical to the number of dynamic factors k. Therefore, the selection criteria
suggested by Bai and Ng (2002) can be used to determine k from the factor model
(10). To obtain an estimator of ξt, Amengual and Watson (2007) and Stock and
Watson (2005) suggest a two-step procedure. First, the r static factors in Ft are
estimated from xt by using the usual PC estimator. In a second step, an estimate
of the element ξit is obtained as the residual of a regression of xit on lags of Ft and
xit.4 The number of dynamic factors are estimated by applying the information
criteria suggested by Bai and Ng (2002) to the estimated innovation vector ξt.
4Alternatively, a regression of αi(L)xit on lags of Ft = αi(L)Ft can be performed, where αiis obtained from an autoregression of the estimated idiosyncratic components.
5
Jacobs and Otter (2008) suggest a test procedure for determining the number
of factors in a strict factor model (with small N) under the assumption that the
dynamic factors ft have a finite order moving average representation of order q
and the idiosyncratic components are white noise. In the first step, the lag order
q is selected by testing the hypothesis that all eigenvalues from a CCA of xt on
xt−q−1 are equal to zero. In a second step, a LR test is applied to find out the
number of nonzero eigenvalues from a CCA of xt on xt−q. If all factors have a
MA(q) representation, then the second CCA tends to indicate k nonzero eigenval-
ues, which corresponds to the estimated number of dynamic factors. An obvious
disadvantage of this test procedure is that it is based on a number of restrictive
assumptions (e.g. that the MA orders of all common factors are identical and
that the idiosyncratic components are white noise) that are difficult to verify in
empirical applications. In the next section alternative selection procedures are
suggested that are applicable to the dynamic factor model considered above.
3 Selection rules based on canonical correlations
An important drawback of the selection procedure suggested by Bai and Ng
(2007) is that the statistics depend on the scaling of the factor estimates Ft in
the sense that a transformation of the factors such as Ft = QFt affects the value
of the test statistics D1,k∗ and D2,k∗ . The test statistic of Bai and Ng (2007) is
obtained from the eigenvalue problem |c∗Ir−QΣeQ′| = 0. Obviously, the resulting
eigenvalues c∗j , j = 1, . . . , r depend on the rotation matrix Q. Dividing the
eigenvalues by the sum of eigenvalues does not solve the problem in general. This
is an undesirable property as the normalization of the factor space is arbitrary.5
In what follows we suggest a test statistic that is invariant to a transformation
of the form Ft = QFt. Consider first the case m = 1. In this case linear
combinations of ft and ft−1 may enter Ft. Our selection procedure is based on
the generalized eigenvalues from
|λ∗j S00 − Σe| = 0,
or, equivalently,
|λjS00 − S01S−111 S
′01| = 0 (11)
5For example, the test statistic is different if the estimated factors are computed as ft = V ′rxt(as in Bai and Ng, 2007), where Vr is the matrix of r eigenvectors associated with the r largesteigenvalues of Σ, or whether it is normalized as T−1
∑Nt=1 ftf
′t = Ir (as in Bai and Ng, 2002).
6
where λj = 1− λ∗j and
Sij = T−1
T∑t=2
Ft−iF′t−j.
An important advantage of using the eigenvalue problem (11) is that the eigenval-
ues are invariant to a rotation of the system. Therefore, selection criteria based
on the eigenvalue λκ or the sum of eigenvalues λκ + · · ·+ λr are scale invariant.
For the more general case with some lag order m ≥ 1 the following theo-
rem considers the asymptotic properties of the eigenvalues λ1, . . . , λr from the
generalized eigenvalue problem
|λjS00 − S01S−111 S
′01| = 0, (12)
where
S00 =T∑
t=m+1
FtF′t , S01 =
T∑t=m+1
FtG′t−1 , S11 =
T∑t=m+1
Gt−1G′t−1
and Gt−1 = [F ′t−1, . . . , F′t−m]′ is the vector of m lags.
Theorem: Let Ft denote the PC estimator of the vector of static components Ft
in (1) and assume that the assumptions of Bai and Ng (2002) are fulfilled such
that T−1∑T
t=1 ||Ft −HFt||2 = Op(C−2NT ), where CNT = min(
√N,√T ).
(i) If ft is a k-dimensional vector of dynamic factors, then
(1− λj) = Op(C−2NT ) for j = 1, . . . , r − k
(ii) There exists a constant M > 0 such that as N →∞ and T →∞
P (1− λj > M)→ 1 for j = r − k + 1, . . . , r.
(iii) If ft has a VAR(p) representation as in (3) with p ≤ m and rk(A) = (m+1)k
then, as T →∞ and N →∞,
T
r∑j=r−k+1
λjd→ χ2(k2)
Proof: See appendix.
Remark A: The results (i) and (ii) are derived under the fairly weak conditions
of an approximate factor model (e.g. Bai, 2003, Bai and Ng, 2002 and Stock and
7
Watson, 2002a). They do not require restrictive assumptions on the dynamic
process generating ft with the exception that T−1∑T
t=1 ftf′t
p→ Σf , where Σf is
a positive definite matrix. This assumption rules out that (some of) the factors
are I(1) but allows for a wide range of stationary processes. For practical appli-
cations, the choice of the lag order m is an important issue, as it ensures that
all lags in the factor model are contained in the vector Gt−1. So far, there does
not exist a statistical criterion to choose the lag length m. In practice it seems
reasonable to try out various values of m to find out the minimum lag length
necessary to obtain stable results.
Remark B: The additional assumptions of a finite order VAR representation
of the factors with p ≤ m and rk(A) = (m + 1)k are required for part (iii)
of the theorem. These assumptions ensure that p lags of the dynamic factors
are contained in Ft−1. If these additional assumptions are satisfied, a powerful
selection procedure can be constructed (see below), since the contrast between
the eigenvalues are maximal, i.e. r − k eigenvalues converge to unity, whereas
the remaining k eigenvalues converge to zero. Unfortunately, in practice it is not
known whether these conditions are fulfilled. Therefore, a reasonable selection
strategy is to apply selection procedures based on (ii) and (iii). If both procedures
find the same number of dynamic components one can be confident that this
number is a good choice.
Remark C: The term (1− λj) for j = 1, . . . , r − k is stochastically bounded by
a complicated function of N and T . Following Bai (2003) and Bai and Ng (2002)
the limiting properties are expressed by using the minimum function CNT . For
practical applications the use of this function suffers from an important drawback.
The term CNT is identical for fairly small sample sizes (e.g. N = 50, T = 50) and
for very large samples (such as N = 1000 and T = 50), although the estimation
error is considerably smaller in the latter case. In order to take into account the
relative sizes of the two sampling dimensions we employ an alternative rate CNT
with the property
CNTCNT
→ κ with 0 < κ <∞.
Specifically, our preferred selection criterion employs
C−2NT =
a
N+a
T.
This function has the property that a ≤ C2NT/C
2NT ≤ 2a and, therefore, the lim-
iting result presented in part (i) of the theorem can be alternatively represented
8
as
(1− λj) = Op(C−2NT ) for j = 1, . . . , r − k. (13)
Based on the theorem we can construct two selection rules for the number of
dynamic factors. Using the modified rate CNT of Remark C the first selection
criterion is constructed as
ξ(k∗) = C2−δNT
r−k∗∑j=1
(1− λj) , (14)
where 0 < δ < 2. The theorem implies that under the hypothesis that k∗ is
equal to the correct number of factors k, then the largest r − k∗ eigenvalues
λ1, · · · , λr−k∗ tend to unity and, thus, ξ(k∗) tends to zero. Under the assumption
that k∗ < k, the eigenvalue λr−k∗ has a probability limit smaller than one and,
therefore, the statistic ξ(k∗) tends to infinity. Define ξ(r) = ∞. The number of
dynamic factors can be estimated consistently by the largest number k∗ in the
sequence k∗ = r − 1, r − 2, . . . , 0, where the statistic ξ(k∗) is larger than some
fixed threshold level τ . Thus,
k = max{k∗ : ξ(k∗) > τ}. (15)
In our Monte Carlo simulations we found that a = 20, δ = 0.5 and τ = 1 generally
performs well.
If the conditions of part (iii) of the Theorem are satisfied, the test statistic
LR(k∗) = Tr∑
j=r−k∗+1
λj (16)
can be used to test the null hypothesis H0 : k∗ = k. This test statistic is approx-
imatively equivalent to the likelihood ratio statistic for the hypothesis that the
k∗ smallest eigenvalues of the canonical correlation analysis between Ft and Gt−1
are zero (cf. Tiao and Tsay, 1989). Accordingly, the number of dynamic factors
can be selected by testing the sequence of hypotheses k∗ = r − 1, r − 2, . . . , 1.
For example, if the test rejects for k∗ ≥ 3 but cannot reject for k∗ < 3, then the
maintained number of dynamic factors is k = 2. In other words, the estimated
number of dynamic factors is the largest value of k∗, which does not lead to a re-
jection of the hypothesis. Since it is well known that a selection procedure based
9
on tests with a fixed significance level is not consistent as the error probability
for selecting a smaller number of dynamic factors does not vanish, we construct
an information criterion as
IC(k∗) =r∑
j=r−k∗+1
λj + k∗c(T )
T(17)
where c(T ) is a penalty function. Using part (ii) of the Theorem it is not difficult
to show that the IC(k) is a weakly consistent selection criterion whenever c(T )→∞ and c(T )/T → 0 as T →∞. Consider the difference
IC(k∗)− IC(k∗ + 1) = λr−k∗ − c(T )/T.
It follows that k∗+1 is selected if T λr−k∗ > c(T ). If k ≥ k∗ then T λr−k∗ is Op(1),
whereas T λr−k∗ is Op(T ) if k < k∗. Therefore, a consistent selection rule requires
c(T )→∞ and c(T )/T → 0 as T →∞.
4 Finite sample properties
In the following we analyze the finite sample properties of the alternative selec-
tion procedures and present an empirical application of the new methods to the
dataset used in Stock and Watson (2005).
4.1 Simulation study
To investigate the performance of the proposed procedures to select the number
of dynamic factors, some Monte Carlo experiments were conducted. The data
are generated based on the model
xt = A0ft + A1ft−1 + ut , (18)
where the components of the k-dimensional vector ft are independent Gaussian
AR(1) processes, i.e. fit = γifit−1 + εit for i = 1, . . . , k with εiti.i.d.∼ N(0, 1− γ2
i ).
The elements of the N × k matrices A0 and A1 are i.i.d. standard normal.
The idiosyncratic errors are generated as uti.i.d.∼ N(0, 4ψIN) with ψ controlling
the signal–to–noise ratio. We also tried out alternative models but the general
conclusions remain the same. All results are based on 10,000 replications of the
model.
To investigate the ability of the selection procedures to identify the correct
number of dynamic factors k, we simulate data according to the above model with
10
k = 2 and the following two sets of autocorrelation coefficients: γ1 = γ2 = 0.5
and γ1 = 0.2, γ2 = 0.8. We further set ψ = 2.5, 1, and 0.5, corresponding to low,
medium and high signal–to–noise–ratios.
The performance of the newly proposed procedures is compared to the se-
lection criteria of Amengual and Watson (2007), Stock and Watson (2005) and
Bai and Ng (2007). Note that for the ease of exposition we only report results
for the criteria that have been shown to perform best in previous Monte Carlo
simulations. We therefore consider the information criterion of Amengual and
Watson (2007) and Stock and Watson (2005)
SWP2 = lnV (k∗) + k∗N + T
NTln min{N, T} (19)
with V (kk) denoting the sum of squared residuals (divided by NT ) of the factor
model for the innovation vector ξt as given in (10), if k∗ factors are estimated.
This resembles the ICP2 criterion suggested by Bai and Ng (2002) for the selection
of the number of static factors from the dataset. We further report results for
the selection rule of Bai and Ng (2007) based on the D1 statistics, see equations
(8) and (6). Following Bai and Ng (2007) we set m∗ = 1.
To concentrate on the properties of the various selection procedures, we as-
sume that the number of reduced form factors r is known. In practice, the
information criteria suggested by Bai and Ng (2002) can be used to obtain a
consistent estimate of r. We further base the construction of the D1 statistic on
a VAR(1) model for the factors and treat the number of lags of Ft and xit in the
regressions of the selection procedure of Amengual and Watson (2007) and Stock
and Watson (2005) as given. The computation of the new selection rules (14)
and (16) is based on one lag in the generalized eigenvalue problem, i.e. m = 1.
For the selection criterion ξ(k∗) we further set a = 20, δ = 0.5 and τ = 1, as
these values have been found to perform well in a wide range of Monte Carlo
simulations.
Tables 1 and 2 present the performance of different selection criteria for the
two different sets of the autoregressive parameter values. Reported are the fre-
quencies (in percent) of choosing the correct number of dynamic factors. The
results show that our selection procedures based on canonical correlations out-
perform the existing ones especially in small sample sizes. Moreover, these new
procedures find out the correct number of dynamic factors with a probability
converging to one and to 95% for the ξ(k∗) criterion and the LR(k∗) test, respec-
tively. In the tables we further present the success rates of the various criteria
11
for different values of ψ. Interestingly, the new selection criteria are quite robust,
while the selection procedures based on principal components seem to be very
sensitive to the signal–to–noise ratio. In particular, for low signal–to–noise ratios
these procedures, especially the SWP2 criterion, have difficulties in determining
the correct number of factors. This becomes even more pronounced in the pres-
ence of more persistent factors (see Table 2, which presents the success rates for
the parameter set γ1 = 0.2 and γ2 = 0.8). The performance of the new selection
procedures instead seem to be also robust to changes in the factor persistence.
As mentioned previously, the LR test is only a powerful selection criteria
if rk(A) = (m + 1)k. As this assumption is difficult to verify in practice, we
also investigate the finite sample performance of the procedure in the case of the
violation of this assumption. In particular, we consider the above factor model
with k = 2 and A1 having zero second column, which implies that r = 3. Table 3
presents the corresponding success rates of the different selection procedures for
γ1 = γ2 = 0.5. As expected the LR(k∗) test is not able to identify the correct
number of dynamic factors, while the previous conclusions for the alternative
criteria also hold for this data generation process. As the selection procedures
based on canonical correlations are both very powerful if rk(A) = (m + 1)k,
we suggest for empirical applications to compute both criteria. If both come to
similar conclusions, then this indicates that the assumption is not violated.
4.2 Empirical application
In the following we apply our selection procedures to a large U.S. macroeco-
nomic dataset constructed by Stock and Watson (2005)6. The data comprises
132 monthly macroeconomic time series ranging from January 1960 to December
2003, i.e. T = 528. For our empirical application we perform the same data
transformations as in Stock and Watson (2005). The dataset has been widely
used to test for the number of static and dynamic factors driving the U.S. econ-
omy. Bai and Ng (2007) and Stock and Watson (2005), for example, select seven
static factors based on the information criteria of Bai and Ng (2002). Given these
seven factors, their selection procedures indicate four and seven dynamic factors,
respectively.
Using the ICP2 information criteria of Bai and Ng (2002) we also find seven
6The dataset is kindly provided at Mark Watson’s homepage: http://www.princeton.edu/~mwatson/.
12
static factors to which we apply our selection procedures. Table 4 shows the
statistic ξ(k∗) and the LR(k∗) test statistics using m = 1 in the generalized
eigenvalue problem (12). The first selection criterion ξ(k∗) selects k = 4 dynamic
factors, while the LR(k∗) test indicates one dynamic factor. The observation
that both criteria choose a different number of dynamic factors suggests that the
rank conditions of part (iii) of the theorem are not fulfilled and we thus rely in
the following on the ξ(k∗) criterion.
For the accuracy of the ξ(k∗) criteria, we need to ensure that all lags in the
factor model are also contained in the vector Gt−1. We therefore also consider
larger values of m. The results for m ranging from 2 to 4 are presented in Table
5. Interestingly, the selected number of dynamic factor remains at k = 4 for all
lags, which indicates that a lag order of one is sufficient. We therefore conclude
that there are four dynamic factors that drive the U.S. economy - a result that
is consistent with the findings of Bai and Ng (2007).
5 Conclusion
In this paper two identification procedures are suggested that allow to estimate
the number of dynamic factors (or “structural factors”) from a set of static factors
(or “reduced form factors”) that are obtained from a principal component analysis
of a large number of observed series, see Stock and Watson (2002a,b). While the
procedures of Bai and Ng (2007), Amengual and Watson (2007) and Stock and
Watson (2005) use the method of principal components to determine the number
of dynamic factors, our approach is based on a canonical correlation analysis
(CCA) of the static factors. An important advantage of our approach is that
it is invariant to the normalization of the factor space and does not assume a
finite order VAR model for the vector of common factors. Furthermore, our
Monte Carlo simulations indicate that the new criteria generally outperform the
alternative selection criteria and are much more reliable especially in the presence
of low signal–to–noise ratios and/or persistent factors. Applying the new criteria
to the dataset used in Stock and Watson (2005) yields that seven static factors
can be represented by four dynamic factors and three lags which corroborates the
earlier findings of Bai and Ng (2007).
13
Appendix A: Proof of the Theorem
We first state the following lemma:
Lemma A.1: Under Assumptions A–F of Bai (2003) it holds for any fixed j
that
(a) T−1
T∑t=j+1
(Ft − Ft)F ′t−j = Op(C−2NT ), T−1
T∑t=j+1
(Ft − Ft)F ′t−j = Op(C−2NT )
(b) T−1
T∑t=j+1
FtF′t−j = T−1
T∑t=j+1
FtF′t−j +Op(C
−2NT )
Proof: The proof follows closely the proof of Lemma B.2 and Lemma B.3 of
Bai (2003) and can be found in Breitung and Tenhofen (2008, Lemma A.1). �
To proof part (i) of the theorem, let F = [Fm+1, . . . , FT ]′, F = [Fm+1, . . . , FT ]′,
G = [Gm, . . . , GT−1]′, G = [Gm, . . . , GT−1]
′ and Gt−1 = [Ft−1, . . . , Ft−m]′. From
Lemma A.1 it follows that
T−1F ′F = T−1F ′F +Op(C−2NT ), T−1F ′G = T−1F ′G+Op(C
−2NT ),
T−1G′G = T−1G′G+Op(C−2NT )
and, therefore, λj = λ0j + Op(C
−2NT ), where λ0
j denotes the eigenvalue from the
eigenvalue problem
|λ0jF′F − F ′G(G′G)−1G′F | = 0. (20)
The eigenvalues of can be written as
λ0j =
v′jF′G(G′G)−1G′Fvj
v′jF′Fvj
.
If Ft contains r − k lagged values up to the maximal lag m that is used to
construct Gt−1 then there exist r − k linear independent combinations of the
form v′jFt = w′jGt−1 where wj is some mr× 1 vector. Therefore, the r− k largest
eigenvalues λ01, . . . , λ
0r−k are equal to one. This implies that for j = 1, . . . , r − k
we have (1− λj) = Op(C−2NT ).
To proof part (ii) we first note that if there exist r dynamic factors, then
the covariance matrix of the projection residual ηt = Ft − E(Ft|Gt−1) is positive
definite. Hence,
1− λ0j =
v′jF′Fvj − v′jF ′G(G′G)−1G′Fvj
v′jF′Fvj
14
and by the weak law of large numbers there exists a constant M such that
limT→∞ P (1 − λ0j > M) = 1 for j = r − k + 1, . . . , r. Finally, if N → ∞
and T → ∞, then P (1 − λj > M − Op(C−2NT )) will converge to unity as well for
j = r − k + 1, . . . , r.
Under the conditions stated in part (iii) of the theorem, all lags of the VAR
representation of Ft enter Gt−1 and, thus, there exist r − k vectors vj, j =
r − k + 1, . . . , k such that v′jFt = w′jut is white noise. The following lemma
presents the limiting distribution of the sum of eigenvalues in this situation.
Lemma A.2. Let yt = [z′t, y′2t]′ and xt = [z′t, x
′2t]′, where x2t and y2t are m × 1
and zt is a n × 1 vector. Furthermore E(xt) = E(yt) = 0 and E(xtx′t) = Σx,
E(yty′t) = Σy. Assume that there exist m linear combinations [w′1yt, . . . , w
′myt]
′ ≡W ′yt with E(W ′yt|xt) = 0 and V ar(W ′yt|xt) = Σm, where Σm is positive definite.
Then, as T →∞
Tn+m∑i=n+1
µi + op(1)d→ χ2(m2),
where µi denotes the eigenvalues (in descending order) of
|µSyy − SyxS−1xx S
′xy| = 0 , (21)
and
Sab = T−1
T∑t=1
atb′t, a, b ∈ {x, y}.
Proof: Let x2t (y2t) denote the projection residuals of x2t (y2t) on zt, and
xt =
[ztx2t
], yt =
[zty2t
].
Accordingly we define
Sxx =1
T
T∑t=1
xtx′t =
[Szz 00 Sx2x2
]
Syy =1
T
T∑t=1
yty′t =
[Szz 00 Sy2y2
]
Sxy =1
T
T∑t=1
xty′t =
[Szz 00 Sx2y2
].
15
Since zt enters both xt and yt, the first n eigenvalues are unity and the corre-
sponding eigenvectors are [v1, . . . , vn] = V = [In, 0]′. The remaining matrix of m
eigenvectors can be normalized such that WT = [wT1, . . . , wTm]′ = [0, Im]′. The
eigenvalues µn+1, . . . , µn+m can be written as
µn+j = w′TjS−1/2yy S ′xyS
−1xx SxyS
−1/2yy wTj
= e′jS−1/2y2y2
S ′x2y2S−1x2x2
Sx2y2S−1/2y2y2
ej,
where ej is the j’th column of Im and
n+m∑i=n+1
µi = tr{S−1/2y2y2
S ′x2y2S−1x2x2
Sx2y2S−1/2y2y2
}.
Using Sy2y2p→ Σy2y2 , Sx2x2
p→ Σx2x2 and√Tvec(Σ
−1/2y2y2
S ′x2y2Σ−1/2x2x2
)d→ N(0, Im2)
it follows that T∑n+m
j=n+1 µj has an asymptotic χ2 limiting distribution with m2
degrees of freedom. �
Under the conditions of part (iii) of the theorem we can find rotations Q1Ft =
[z′t, x′t]′ and Q2Gt−1 = [z′t, y
′t]′ that fulfill the conditions of Lemma 2. Therefore,
the sum of the k smallest eigenvalues have a χ2 limiting distribution with k2
degrees of freedom. Let[z∗tx∗t
]= QFt and
[z∗ty∗t
]= QGt−1
and denote by xt and yt the respective residuals from a projection on z∗t . Then,
by using the results in part (i) of the theorem
T−1/2
T∑t=m+1
x∗t y′∗t = T−1/2
T∑t=m+1
xty′t +Op(
√T/C2
NT )
T−1
T∑t=m+1
x∗t x′∗t = T−1
T∑t=m+1
xtx′t +Op(1/C
2NT )
T−1
T∑t=m+1
y∗t y′∗t = T−1
T∑t=m+1
yty′t +Op(1/C
2NT )
and, therefore,
Tr∑
j=r−k+1
λj = T
r∑r−k+1
µj +Op(√T/C2
NT )
where the eigenvalues µj are defined as in Lemma A.2 with m = k. It follows
that T∑r
j=r−k+1 λj has a χ2 distribution with k2 degrees of freedom.
16
References
Amengual, D. and Watson, M. (2007), “Consistent Estimation of the Number of
Dynamic Factors in a Large N and T Panel,” Journal of Business & Economic
Statistics, 25, 91–96.
Angelini, E., Henry, J., and Mestre, R. (2001), “Diffusion Index-based Inflation
Forecasts for the Euro Area,” ECB Working Paper 61.
Bai, J. (2003), “Inferential Theory for Factor Models of Large Dimensions,”
Econometrica, 71, 135–172.
Bai, J. and Ng, S. (2002), “Determining the Number of Factors in Approximate
Factor Models,” Econometrica, 70, 191–221.
— (2007), “Determining the Number of Primitive Shocks in Factor Models,”
Journal of Business & Economic Statistics, 25, 52–60.
Breitung, J. and Tenhofen, J. (2008), “GLS Estimation of Dynamic Factor Mod-
els,” Working paper, http://www.ect.uni-bonn.de/mitarbeiter/breitung/pc-
gls.pdf.
Brisson, M., Campbell, B., and Galbraith, J. (2003), “Forecasting Some Low-
predictability Time Series Using Diffusion Indices,” Journal of Forecasting, 22,
515–531.
Eickmeier, S. and Ziegler, C. (2008), “How Successful Are Dynamic Factor Models
at Forecasting Output and Inflation? A Meta-Analytic Approach,” Journal of
Forecasting, 27, 237–265.
Favero, C., Marcellino, M., and Neglia, F. (2005), “Principal Components at
Work: The Empirical Analysis of Monetary Policy with Large Data Sets,”
Journal of Applied Econometrics, 20, 603–620.
Forni, M., Hallin, M., Lippi, M., and Reichlin, L. (2000), “The Generalized Dy-
namic Factor Model: Identification and Estimation,” The Review of Economics
and Statistics, 82, 540–554.
Forni, M. and Lippi, M. (2001), “The Generalized Factor Model: Representation
Theory,” Econometric Theory, 17, 1113–1141.
17
Forni, M., Lippi, M., and Reichlin, L. (2008), “Opening the Black Box: Structural
Factor Models versus Structural VARs,” Econometric Theory, forthcoming.
Forni, M. and Reichlin, L. (1998), “Lets Get Real: A Factor-Analytical Approach
to Disaggregated Business Cycle Dynamics,” Review of Economic Studies, 65,
453–473.
Giannone, D., Reichlin, L., and Sala, L. (2002), “Tracking Greenspan: Systematic
and Unsystematic Monetary Policy Revisited,” CEPR Discussion Paper 3550.
— (2006), “VARs, Factor Models and the Empirical Validation of Equilibrium
Business Cycle Models,” Journal of Econometrics, 132, 257–279.
Hallin, M. and Liska, R. (2007), “Determining the Number of Dynamic Factors
in the General Dynamic Factor Model,” Journal of the American Statistical
Association, 102, 603–617.
Jacobs, J. and Otter, P. (2008), “Determining the Number of Factors and Lag Or-
der in Dynamic Factor Models: A Minimum Entropy Approach,” Econometric
Reviews, 27, 398–427.
Marcellino, M., Stock, J., and Watson, M. (2003), “Macroeconomic Forecasting
in the Euro Area: Country Specific versus Area-Wide Information,” European
Economic Review, 47, 1–18.
Pesaran, M. H. (2006), “Estimation and Inference in Large Heterogeneous Panels
with a Multifactor Error Structure,” Econometrica, 74, 967–1012.
Stock, J. and Watson, M. (1999), “Forecasting Inflation,” Journal of Monetary
Economics, 44, 293–335.
— (2002a), “Forecasting Using Principal Components from a Large Number of
Predictors,” Journal of the American Statistical Association, 97, 1167–79.
— (2002b), “Macroeconomic Forecasting Using Diffusion Indexes,” Journal of
Business & Economic Statistics, 20, 147–162.
— (2005), “Implications of Dynamic Factor Models for VAR Analysis,” NBER
Working Paper 11467.
Tiao, G. and Tsay, R. (1989), “Model Specification in Multivariate Time Series,”
Journal of the Royal Statistical Society, Series B, 51, 157–213.
18
Table 1: Rates of success of model selection criteria for r = 4, k = 2 andγ1 = γ2 = 0.5
SWP2 D1 ξ LR SWP2 D1 ξ LRψ = 2.5
N T = 50 T = 10020 0.0021 0.0081 0.5345 0.3708 0.0024 0.0057 0.2034 0.833850 0.0107 0.0007 0.5336 0.8697 0.0085 0.0155 0.3078 0.9464100 0.0096 0.3104 0.9125 0.9399 0.0537 0.1190 0.8569 0.9497150 0.0367 0.7794 0.9903 0.9436 0.3292 0.7072 0.9908 0.9469200 0.0648 0.9540 0.9987 0.9382 0.6074 0.9704 0.9999 0.9435N T = 150 T = 20020 0.0026 0.0076 0.1262 0.9339 0.0036 0.0105 0.0997 0.942350 0.0314 0.0174 0.1840 0.9456 0.0622 0.0220 0.1074 0.9506100 0.3353 0.1649 0.7423 0.9479 0.6269 0.1894 0.6025 0.9501150 0.6603 0.5049 0.9791 0.9454 0.9377 0.5918 0.9591 0.9443200 0.9327 0.9406 0.9995 0.9433 0.9900 0.8653 0.9991 0.9482
ψ = 1N T = 50 T = 10020 0.2772 0.3531 0.9863 0.8658 0.4644 0.4586 0.9818 0.951050 0.4876 0.6840 0.9990 0.9402 0.9167 0.8390 1.0000 0.9455100 0.9091 0.9963 1.0000 0.9371 0.9982 0.9955 1.0000 0.9470150 0.9789 0.9999 1.0000 0.9410 1.0000 1.0000 1.0000 0.9476200 0.9907 1.0000 1.0000 0.9389 1.0000 1.0000 1.0000 0.9426N T = 150 T = 20020 0.5367 0.5029 0.9833 0.9509 0.9345 0.9174 0.9846 0.945950 0.9832 0.8879 1.0000 0.9472 1.0000 0.9997 1.0000 0.9487100 1.0000 0.9991 1.0000 0.9460 1.0000 1.0000 1.0000 0.9478150 1.0000 1.0000 1.0000 0.9440 1.0000 1.0000 1.0000 0.9426200 1.0000 1.0000 1.0000 0.9440 1.0000 1.0000 1.0000 0.9486
ψ = 0.5N T = 50 T = 10020 0.7948 0.8430 0.8816 0.9388 0.9028 0.8950 1.0000 0.947750 0.9832 0.9899 0.9992 0.9377 1.0000 0.9990 1.0000 0.9452100 0.9999 1.0000 1.0000 0.9341 1.0000 1.0000 1.0000 0.9464150 1.0000 1.0000 1.0000 0.9402 1.0000 1.0000 1.0000 0.9466200 1.0000 1.0000 1.0000 0.9373 1.0000 1.0000 1.0000 0.9430N T = 150 T = 20020 0.9216 0.9127 1.0000 0.9469 0.9461 0.9234 1.0000 0.945950 1.0000 0.9991 1.0000 0.9469 1.0000 0.9997 1.0000 0.9488100 1.0000 1.0000 1.0000 0.9457 1.0000 1.0000 1.0000 0.9481150 1.0000 1.0000 1.0000 0.9446 1.0000 1.0000 1.0000 0.9425200 1.0000 1.0000 1.0000 0.9437 1.0000 1.0000 1.0000 0.9468
19
Table 2: Rates of success of model selection criteria for r = 4, k = 2 andγ1 = 0.2, γ2 = 0.8
SWP2 D1 ξ LR SWP2 D1 ξ LRψ = 2.5
N T = 50 T = 10020 0.0005 0.0118 0.6295 0.3687 0.0003 0.0049 0.4347 0.851150 0.0000 0.0029 0.4348 0.7852 0.0000 0.0035 0.2959 0.9485100 0.0000 0.1224 0.6504 0.9196 0.0001 0.0300 0.5567 0.9475150 0.0000 0.4455 0.8192 0.9334 0.0003 0.3868 0.8326 0.9451200 0.0000 0.7207 0.9102 0.9419 0.0025 0.8306 0.9515 0.9485N T = 150 T = 20020 0.0004 0.0051 0.3933 0.9372 0.0003 0.0072 0.3854 0.946450 0.0002 0.0070 0.2370 0.9484 0.0003 0.0072 0.1941 0.9478100 0.0005 0.0603 0.5297 0.9452 0.0031 0.0929 0.4883 0.9477150 0.0008 0.2468 0.8328 0.9470 0.0137 0.3587 0.8144 0.9502200 0.0126 0.7882 0.9590 0.9457 0.0323 0.6887 0.9574 0.9467
ψ = 1N T = 50 T = 10020 0.1064 0.2935 0.9747 0.8043 0.1527 0.3793 0.9810 0.944150 0.0659 0.4725 0.9784 0.9410 0.3071 0.7301 0.9939 0.9453100 0.2950 0.9780 0.9982 0.9432 0.5732 0.9808 1.0000 0.9443150 0.4498 0.9993 0.9995 0.9370 0.8849 1.0000 1.0000 0.9438200 0.5500 1.0000 1.0000 0.9404 0.9645 1.0000 1.0000 0.9476N T = 150 T = 20020 0.1892 0.4440 0.9894 0.9505 0.2026 0.4698 0.9927 0.946750 0.4946 0.8166 0.9983 0.9473 0.6212 0.8563 0.9990 0.9481100 0.8975 0.9944 1.0000 0.9451 0.9742 0.9978 1.0000 0.9482150 0.9800 0.9999 1.0000 0.9451 0.9992 1.0000 1.0000 0.9444200 0.9981 1.0000 1.0000 0.9453 0.9999 1.0000 1.0000 0.9474
ψ = 0.5N T = 50 T = 10020 0.5597 0.7894 0.9375 0.9241 0.7324 0.8589 0.9996 0.945050 0.7240 0.9726 0.9997 0.9394 0.9681 0.9956 1.0000 0.9458100 0.9596 0.9999 1.0000 0.9395 0.9992 1.0000 1.0000 0.9460150 0.9596 0.9999 1.0000 0.9333 0.9992 1.0000 1.0000 0.9408200 0.9959 1.0000 1.0000 0.9380 1.0000 1.0000 1.0000 0.9453N T = 150 T = 20020 0.7937 0.8842 1.0000 0.9488 0.8202 0.8912 1.0000 0.946150 0.9935 0.9967 1.0000 0.9489 0.9983 0.9988 1.0000 0.9449100 1.0000 1.0000 1.0000 0.9432 1.0000 1.0000 1.0000 0.9483150 1.0000 1.0000 1.0000 0.9452 1.0000 1.0000 1.0000 0.9440200 1.0000 1.0000 1.0000 0.9446 1.0000 1.0000 1.0000 0.9469
20
Table 3: Rates of success of model selection criteria for r = 3, k = 2 andγ1 = γ2 = 0.5
SWP2 D1 ξ LR SWP2 D1 ξ LRψ = 2.5
N T = 50 T = 10020 0.0009 0.0357 0.9396 0.8660 0.0014 0.0264 0.9988 0.541650 0.0001 0.0729 0.9943 0.6179 0.0075 0.0741 0.9986 0.1727100 0.0096 0.5861 0.9949 0.4696 0.0516 0.3151 0.9999 0.0721150 0.0311 0.9088 0.9962 0.4042 0.3085 0.8579 1.0000 0.0523200 0.0550 0.9856 0.9952 0.3751 0.5999 0.9910 1.0000 0.0375N T = 150 T = 20020 0.0030 0.0268 0.9999 0.3068 0.0021 0.0235 1.0000 0.161650 0.0298 0.0806 0.9990 0.0317 0.0598 0.0839 0.9998 0.0049100 0.3278 0.3718 1.0000 0.0075 0.6280 0.4123 1.0000 0.0004150 0.6676 0.7267 1.0000 0.0038 0.9357 0.7802 1.0000 0.0003200 0.9322 0.9767 1.0000 0.0036 0.9907 0.9480 1.0000 0.0000
ψ = 1N T = 50 T = 10020 0.2607 0.5445 0.3914 0.6147 0.4303 0.6249 0.7579 0.192350 0.4577 0.8509 0.8707 0.4213 0.9172 0.9358 0.9990 0.0600100 0.9068 0.9989 0.9499 0.3531 0.9977 0.9990 1.0000 0.0364150 0.9763 1.0000 0.9624 0.3273 1.0000 1.0000 1.0000 0.0309200 0.9903 1.0000 0.9715 0.3228 1.0000 1.0000 1.0000 0.0249N T = 150 T = 20020 0.5161 0.6702 0.8870 0.0581 0.5640 0.6824 0.9402 0.013250 0.9811 0.9544 1.0000 0.0044 0.9944 0.9638 1.0000 0.0004100 1.0000 0.9997 1.0000 0.0019 1.0000 1.0000 1.0000 0.0000150 1.0000 1.0000 1.0000 0.0021 1.0000 1.0000 1.0000 0.0000200 1.0000 1.0000 1.0000 0.0011 1.0000 1.0000 1.0000 0.0000
ψ = 0.5N T = 50 T = 10020 0.8072 0.9279 0.0502 0.4641 0.9370 0.9566 0.1733 0.090850 0.9847 0.9977 0.6433 0.3536 1.0000 0.9997 0.9878 0.0372100 0.9998 1.0000 0.8827 0.3187 1.0000 1.0000 0.9999 0.0272150 1.0000 1.0000 0.9316 0.3050 1.0000 1.0000 1.0000 0.0262200 1.0000 1.0000 0.9505 0.3064 1.0000 1.0000 1.0000 0.0216N T = 150 T = 20020 0.9607 0.9631 0.2989 0.0142 0.9675 0.9693 0.3726 0.001550 1.0000 0.9997 0.9997 0.0018 1.0000 0.9998 1.0000 0.0001100 1.0000 1.0000 1.0000 0.0009 1.0000 1.0000 1.0000 0.0000150 1.0000 1.0000 1.0000 0.0018 1.0000 1.0000 1.0000 0.0000200 1.0000 1.0000 1.0000 0.0008 1.0000 1.0000 1.0000 0.0000
21
Table 4: Selection criteria for the number of dynamic factors in the U.S. economy
k∗ λ ξ(k∗) LR(k∗) crit. val.1 0.0422 3.0625 1.5• 3.84152 0.1706 2.1048 23.7 9.48773 0.2237 1.2754 113.5 16.91904 0.7577 0.4991• 231.1 26.29625 0.8596 0.2567 629.6 37.65256 0.8837 0.1163 1081.8 50.9985
Reported are the selection criteria ξ(k∗) and LR(k∗) for differ-ent values of k∗ using m = 1. Note, that the critical value ofthe ξ(k∗) statistic is C−2+δ
NT =0.2873. The first column reportsthe eigenvalues of (12) with m = 1, the last columns reportsthe critical values of the LR–test. • indicates those criteriathat select the number of dynamic factors.
Table 5: Selection criteria ξ(k∗) based on different values of m
k∗ m=2 m=3 m=40 3.7484 3.6428 3.52211 2.8051 2.7212 2.61372 1.8827 1.8203 1.74503 1.1119 1.0731 1.05064 0.4218• 0.3980• 0.3883•
5 0.2244 0.2151 0.20956 0.1047 0.0981 0.0942
Reported are the selection criteria ξ(k∗) for differentvalues of k∗ and different lag lengths m. The criticalvalue of the ξ(k∗) statistic is C−2+δ
NT =0.2873. • indicatesthose criteria that select the number of dynamic factors.
22