Department of Economics Zongwu Cai University of …...Seminar Series Zongwu Cai University of Kansas “A New Semiparametric Quantile Panel Data Model: Theory and Applications ”

Department of Economics Seminar Series

Zongwu Cai University of Kansas

“A New Semiparametric Quantile Panel Data Model: Theory and Applications”

Friday, September 11, 2015

3:30 p.m. 212 Middlebush

A New Semiparametric Quantile Panel Data Model: Theory and Applications

Zongwu Cai

Department of Economics, University of Kansas, e-mail: [email protected] WISE, Xiamen University, [email protected]

ABSTRATC: In this paper, we propose a new semiparametric quantile panel data model with correlated random effects in which some of the coefficients are allowed to depend on some smooth economic variables while other coefficients remain constant. A three-stage estimation procedure is proposed to estimate both constant and functional coefficients based on the integrated quasi-likelihood approach and their asymptotic properties are investigated. We show that the estimator of constant coefficients is root-N consistent and the estimator of varying coefficients converges in a nonparametric rate. A Monte Carlo simulation is conducted to examine the finite sample performance of the proposed estimators. Finally, the proposed semiparametric quantile panel data model is applied to estimating the impact of foreign direct investment (FDI) on economic growth using the cross-country data from 1970 to 1999.

This is a join work with Dr. Linna Chen and Dr. Ying Fang.

Keywords: Correlated Random Effect; Foreign Direct Investment; Panel Data; Quantile Regression Model; Quasi-likelihood; Semi-parametric Model; Varying Coefficient Model.

mailto:[email protected]

mailto:[email protected]

A Quantile Panel Data Model with Partially Varying

Coefficients and Correlated Random Effects∗

Zongwu Caia,b, Linna Chena, and Ying Fangb

aDepartment of Economics, The University of Kansas, Lawrence, KS 66045, USAbWang Yanan Institute for Studies in Economics, MOE Key Laboratory of Econometrics, and

Fujian Key Laboratory of Statistical Science, Xiamen University, Xiamen, Fujian 361005, China

September 1, 2015

Abstract

In this paper, to estimate the impact of FDI on economic growth in host countries,

we propose a new semiparametric quantile panel data model with correlated random

effects, in which some of the coefficients are allowed to depend on some smooth economic

variables while other coefficients remain constant. A three-stage estimation procedure

based on quasi-maximum (local) likelihood estimation (QMLE) is proposed to estimate

both constant and functional coefficients and their asymptotic properties are investigated.

We show that the estimator of constant coefficients is√N consistent and the estimator

of varying coefficients converges in a nonparametric rate. A Monte Carlo simulation is

conducted to examine the finite sample performance of the proposed estimators. Finally,

using the cross-country data from 1970 to 1999, we find a strong empirical evidence of the

existence of the absorptive capacity hypothesis. Moreover, another new finding is that

FDI has a much stronger growth effects for countries with fast economic growth than for

those with slow economic growth.

Keywords: Correlated Random Effect; Foreign Direct Investment; Panel Data; Quantile

Regression Model; Quasi-likelihood; Semiparametric Model; Varying Coefficient Model.

∗Cai’s research is partially supported by the National Natural Science Foundation of China grant #71131008

(Key Project). Fang’s research is partially supported by the National Natural Science Foundation of China

grants #70971113, #71271179 and #71131008 (Key Project).

1

1 Introduction

It is well documented in the growth literature that foreign direct investment (FDI) plays an

important role in the economic growth process in host countries since FDI is often consid-

ered as a vehicle to transfer new ideas, advanced capitals, superior technology and know-how

from developed countries to developing countries and so on. However, the existing empirical

studies provide contradictory results on whether or not FDI promotes an economic devel-

opment in host countries.1 The recent studies in the literature concluded that the mixed

empirical evidences may be due to nonlinearities in FDI effects on the economic growth and

the heterogeneity across countries.

Indeed, it is well recognized by many economists in empirical studies that a standard linear

growth model may be inappropriate for investigating the nonlinear effect of FDI on economic

development. The nonlinearity in FDI effects is mainly due to the so called absorptive capac-

ity in host countries, the fact that host countries need some minimum conditions to absorb

the spillovers from FDI.2 Most existing literature to deal with the nonlinearity issue used

simply some parametric nonlinear models, for example, including an interacted term in the

regression or running a threshold regression. A parametric nonlinear model has the risk of

encountering the model misspecification problem. Misspecified models can lead to biased

estimation and misleading empirical results. Recently, Henderson, Papageorgiou and Parme-

ter (2011) and Kottaridi and Stengos (2010) adopted nonparametric regression techniques

into a growth model. However, due to the curse of dimensionality in a pure nonparametric

estimation, such applications are either restricted by the sample size problem or rely heavily

on the variable selection which is not an easy task.

1For example, Blomstrom and Persson (1983), Blomstrom, Lipsey and Zejan (1992), De Gregorio (1992),

Borensztein, De Gregorio and Lee (1998), De Mello (1999), Ghosh and Wang (2009), Kottaridi and Stengos

(2010) among others found positive effects of FDI on promoting the economic growth in various environments.

On the other hand, many studies including Haddad and Harrison (1993), Aitken and Harrison (1999), Lipsey

(2003), and Carkovic and Levine (2005) failed to find beneficial effects of FDI on the economic growth in

host countries. Grog and Strobl (2001) did a meta analysis of 21 studies using the data from 1974 to 2001

that worked on estimating FDI effects on productivity in host countries, of which 13 studies reported positive

results, 4 studies reported negative effects and the remaining reported inconclusive evidence.2Nunnemkamp (2004) emphasized the importance of the initial condition for host countries to absorb the

positive impacts of FDI, Borensztein, De Gregorio and Lee (1998) found that a threshold stock of human

capital in host countries is necessary for them to absorb beneficial effects of advanced technologies brought

from FDI, and Hermes and Lensink (2003), Alfaroa, Chandab, Kalemli-Ozcan and Sayek (2004) and Durham

(2004) addressed the local financial market conditions of a country’s absorptive capacity.

2

The heterogeneity among countries is another concern in cross-country studies. Grog and

Strobl (2001) found that whether a cross sectional or time series data had been used matters

for estimating the effect of FDI on the economic growth, because both the cross sectional and

time series model cannot control the country-specific heterogeneity. Recent literature focused

on using panel data to estimate growth models, which can control the country-specific un-

observed heterogeneity using individual effects. However, including individual effects which

only allows a location shift for each country, does not have the ability to deal with the hetero-

geneity effect of FDI on the economic growth across the countries. For example, some studies

found that the empirical results changed when the sample included developed countries. The

existing literature to handle this issue is to split sample into groups.3 Generally speaking,

splitting sample can lead to potential theoretical and empirical problems. First, regressing

on the split samples separately may lose other parts of information and degrees of freedom,

which may lead to inefficient estimation. Secondly, the applied researchers often split sample

without following the theoretical guideline of the selection of thresholds.

To deal with the aforementioned two issues (nonlinearities and heterogeneity) in a simul-

taneous fashion, we propose a partially varying-coefficient quantile panel data model with

correlated random effects to estimate the nonlinear effect of FDI on the economic growth

with heterogeneity. Different from the existing literature, we resolve the nonlinearity issue

by employing a partially varying-coefficient model which allows some of coefficients to be

constant but others, reflecting the effects of FDI on the economic growth, to depend on the

country’s initial condition. Compared to a full nonparametric estimation, our model setup

can achieve the dimension reduction and accommodate the well recognized economic theory

such as the absorptive capacity. In addition to using panel data with individual effects which

allows for location shifts for individual countries, we propose a semiparametric conditional

quantile regression model instead of the commonly used conditional mean models. A condi-

tional quantile model can provide more flexible structures than conditional mean models to

characterize heterogeneity among countries. For example, besides including individual effects

allowing country-specific heterogeneity, a conditional quantile model allows different growth

equations for different quantiles. Therefore, we can take the advantage of all sample informa-

3For example, Luiz and De Mello (1999) considered OECD and non-OECD samples and Kottaridi and

Stengos (2010) split the whole sample into high-income and middle-income groups.

3

tion to identify the effect of FDI on the economic growth without splitting sample according

to development stages. Moreover, estimating all quantiles can provide a whole picture of the

conditional distribution and avoid the possibly misleading conditional mean results. In other

words, using the quantile approach can characterize the different roles of FDI in the economic

growth for different types of countries.

The application of conditional quantile model to analyze economic and financial data has

a long history that can be traced to the seminal paper by Koenker and Bassett (1978, 1982);

see the book by Koenker (2005) for more details. Recently, many studies have focused on

nonparametric or semiparametric quantile regression models for either independently iden-

tically distributed (iid) data or time series data. For example, Chaudhuri (1991) studied

nonparametric quantile estimation and derived its local Bahadur representation, He, Ng and

Portnoy (1998), He and Ng (1999), and He and Portnoy (2000) considered nonparametric

estimation using splines, De Gooijer and Zerom (2003), Yu and Lu (2004), and Horowitz and

Lee (2005) focused on additive quantile models, and Honda (2004) and Cai and Xu (2008)

studied varying-coefficient quantile models for time series data. In particular, semiparametric

quantile models have attracted increasing research interests during the recent years due to

their flexibility. For example, He and Liang (2000) investigated the quantile regression of

a partially linear errors-in-variable model, Lee (2003) discussed the efficient estimation of a

partially linear quantile regression, and Cai and Xiao (2012) proposed a partially varying-

coefficient dynamic quantile regression model, among others.

Due to the fact that the approach of taking a difference, which is commonly used in con-

ditional mean panel data (linear) models to eliminate individual effects, is invalid in quantile

regression settings, even for linear quantile regression model, the literature on quantile panel

data models is relatively small. To the best of our knowledge, the paper by Koenker (2004)

is the first paper to consider a linear quantile panel data model with fixed effects, where the

fixed effects are assumed to have pure location shift effects on the conditional quantiles of

the dependent variable but the effects of regressors are allowed to be dependent on quantiles.

Koenker (2004) proposed two methods to estimate such a panel data model with fixed effects.

The first method is to solve a piecewise linear quantile loss function by using interior point

methods and the second one is the penalized quantile regression method, in which the quan-

tile loss function is minimized by adding L1 penalty on fixed effects. Recently, in a penalized

4

quantile panel data regression model as in Koenker (2004), Lamarche (2010) discussed how

to select the tuning parameter, which can control the degree of shrinkage for fixed effects,

whereas Galvao (2011) extended the quantile regression to a dynamic panel data model with

fixed effects by employing the lagged dependent variables as instrumental variables and by ex-

tending Koenker’s first method to Chernozhukov and Hansen (2006)’s quantile instrumental

variable framework. Finally, Canay (2011) proposed a simple two-stage method to estimate

a quantile panel data model with fixed effects. However, the consistency of the estimator in

Canay (2011) relies on the assumption of T going to infinity and the existence of an initial√NT -consistent estimator in the conditional mean model.

An alternative way to deal with individual effects in a panel data model is to treat them

as correlated random effects initiated by Chamberlain (1982, 1984) for the mean regression

model. Under the framework of Chamberlain (1982, 1984), to estimate the effect of birth

inputs on birth weight, Abrevaya and Dahl (2008) employed a linear quantile panel data

model with correlated random effects which are viewed as a linear projection onto some

covariates plus an error term. The identification of the effects of covariates only requires two-

period information. Furthermore, Gamper-Rabindram, Khan and Timmins (2008) estimated

the impact of piped water provision on infant mortality by adopting a linear quantile panel

data model with random effects where the random effects were allowed to be correlated with

covariates nonparametrically. The model can be estimated through a two-step procedure, in

which some conditional quantiles were estimated nonparametrically in the first step and in the

second step, the coefficients are estimated by regressing the differenced estimated quantiles

on the differenced covariates.

The motivation of this study is to examine the role of FDI in the economic growth process

based on the cross-country data from 1970 to 1999 by using the proposed partially varying-

coefficient quantile regression model for panel data with correlated random effects. Indeed,

this model includes the models in Lee (2003), Cai and Xu (2008), and Cai and Xiao (2012) as

special cases. In contrast to Koenker (2004), Galvao (2011), and Canay (2011) by requiring

that both N and T go to infinity in their asymptotics, our model requires only N going

to infinity with T possibly fixed. Actually, T ≥ 2 is required, and indeed, it is an impor-

tant assumption for identification; see Abrevaya and Dahl (2008) and Assumption A7 later.

Also, different from Abrevaya and Dahl (2008) and Gamper-Rabindram, Khan and Timmins

5

(2008), we use a partially varying-coefficient structure in the conditional quantile model to

provide more flexibility in model specification than a linear model. Based on this empirical

study, there some novel findings. For the countries with fast economic growth, there is a

strong evidence that the initial condition of GDP benefits to the effect of FDI on economic

growth when Ui is bigger than 8.2 for the host country. However, on the other hand, for

the countries with slow economic growth, the initial condition of GDP benefits to the effect

of FDI on economic growth when Ui is smaller than 8.2 for the host country. The detailed

analysis of this empirical example is reported in Section 4.

The rest of the paper is organized as follows. In Section 2, we introduce a partially

varying-coefficient quantile panel data model with correlated random effects and propose a

three-stage estimation procedure. Furthermore, the asymptotic properties of our estimators

are established. In Section 3, a simulation study is conducted to examine the finite sample

performance. Section 4 is devoted to reporting the empirical results of the cross-country

panel data growth model. Section 5 concludes.

2 Econometric Modeling

2.1 Model Setup

2.1.1 The Model

In this paper, we consider the following partially varying-coefficient panel data quantile model

with correlated random effects, in which there are both constant coefficients and varying

coefficients. Let Yit, a scalar dependent variable, be the observation on ith individual at time

t for 1 ≤ i ≤ N and 1 ≤ t ≤ T . The conditional quantile model is given by

Qτ (Yit | Uit,Xit, αi) = X ′it,1γτ +X ′it,2βτ (Uit) + αi, (1)

where Qτ (Yit | Uit,Xit, αi) is the τth quantile of Yit given Uit, Xit, and αi. Here, Xit,1 and

Xit,2 are regressors with K1×1 and K2×1 dimensions, respectively, Xit = (X ′it,1,X′it,2)

′, γτ

denotes a K1×1 vector of constant coefficients, βτ (Uit) denotes a K2×1 vector of functional

coefficients, Uit is an observable scalar smoothing variable4 , and αi is an individual effect. In

the quantile panel data literature, Koenker (2004) treated αi as a fixed effect and proposed

4For simplicity, we only consider the univariate case for the smoothing variable. The estimation procedure

and asymptotic results still hold for the multivariate case with much complicated notation.

6

a penalized quantile regression method which requires both N and T go to infinity. In this

paper, following Abrevaya and Dahl (2008) and Gamper-Rabindran, Khan and Timmins

(2008), we view the individual effect as a correlated random effect which is allowed to be

correlated with covariates Xi = (X ′i1, · · · ,X ′iT )′ and UitTt=1; that is,

αi = α(Xi, Ui1, · · · , UiT ) + vi, (2)

where α(·) is an unknown function of Xi and UitTt=1, and vi is a random error.

A fully nonparametric model of α(·) may lead to the problem of the curse of dimensionality

and become infeasible in practice. Compared with a linear projection in Chamberlain (1982)

and Abrevaya and Dahl (2008), an additive model with functional coefficients can accommo-

date more flexibility. Thus, we approximate the unknown function α(Xi, Ui1, · · · , UiT ) by a

functional-coefficient model5 such that

α(Xi, Ui1, · · · , UiT ) =T∑t=1

X ′itδt(Uit), (3)

where δ(Uit) is a K × 1 vector of unknown functional coefficients.

Finally, in the case of estimating FDI effect on the economic growth in our empirical studies,

the smoothing variable varies only across different individual units but keeps constant over

time periods.6 Therefore, in this paper, we focus on the simple case where Uit = Ui for all

1 ≤ t ≤ T . Model (1) can be rewritten as

Qτ (Yit | Ui,Xi, vi) = X ′it,1γτ +X ′it,2βτ (Ui) +

T∑t=1

X ′itδt(Ui) + vi. (4)

2.1.2 Model identification

The identification issue of panel data quantile model with fixed effects for large N and

short T has received increasing attentions in the recent literature. Canay (2011) discussed

the identification condition for T ≥ 2. The identification of the parameters of interest is

obtained from the conditional distribution of Yit using deconvolution. The key condition of

5As elaborated by Cai, Das, Xiong and Wu (2006) and Cai (2010), a functional-coefficient model can be a

good approximation to a general fully nonparametric model, g(X,Z) =d∑j=0

gj(Z)Xj = X ′g(Z).

6When Uit1 6= Uit2 6= Ui for any t1 6= t2, two estimation approaches can be employed. We can apply the

series estimation or adopt a single index method Ui = ω1Ui1 + · · · + ωTUiT using the iterative backfiiting

method proposed by Fan, Yao and Cai (2003).

7

the identification up to location is a conditional independence restriction which implies that

αi does not change across quantiles. Rosen (2012) considered a different set of identification

conditions for the same model. In addition to the usual conditional quantile restriction, Rosen

(2012) showed further that a weak conditional independence helps to identify the coefficients

of interest.

We consider a semiparametric quantile model with correlated random effects, which is a

generalization of Abrevaya and Dahl (2008). The identification of the model can be regarded

as a special case of Canay (2011). Two additional assumptions are required to identify the

model up to a location:

I1. vi is independently and identically distributed (iid) and independent of Ui,Xi.

I2. Qτ (Yit|Ui,Xi, vi) = Qτ (Yit|Ui,Xit, vi).

Remark 1. The independence restriction in Assumption I1 is a common assumption used

in the literature of correlated random effects; see Abrevaya and Dahl (2008). It restricts that

the quantiles of vi do not depend upon Xi. Assumption I2 allows for arbitrary forms of

heteroscedasticity only through Xit. Note that unfortunately, Assumption I2 rules out the

case of dynamic panel data models.

It is easy to see by Assumptions I1 and I2 that

Qτ (Yit|Ui,Xi, vi) = Qτ (Yit|Ui,Xit, vi) ≡ πτ (Ui,Xit, vi)

where πτ (Ui,Xit, vi) is a nonlinear unknown form. We can approximate πτ (Ui,Xit, vi) by a

semiparametric form as in (4). As argued in Cai (2010), such a semiparametric specification

of πτ (·) is flexible enough for many real applications.

2.2 Estimation Procedures

2.2.1 Pooling Regression Strategy

From model (4), one can observe that the conditional quantile effects ofXit on Yit are through

two channels: a direct effect γτ for constant coefficients or βτ (Ui) for varying coefficients,

and an indirect effect δt(Ui) working through the correlated random effects. Assuming that

T ≥ 2, to identify the direct effects γτ or βτ (Ui), one has to estimate at least two conditional

8

quantile models Qτ (Yit | Ui,Xi, vi) and Qτ (Yis | Ui,Xi, vi) given by

Qτ (Yit|Ui,Xi, vi) = X ′it,1[γτ + δ1t(Ui)] +X ′it,2[βτ (Ui) + δ2t(Ui)] +∑l 6=tX ′ilδl(Ui) + vi

and

Qτ (Yis|Ui,Xi, vi) = X ′it,1δ1t(Ui) +X ′it,2δ2t(Ui) +X ′is,1γτ +X ′is,2βτ (Ui) +∑l 6=tX ′ilδl(Ui) + vi,

respectively, where t 6= s, δ1t(Ui) is the vector which contains the first K1 components of

δt(Ui), and δ2t(Ui) is the vector which contains the last K2 components of δt(Ui). Hence, the

estimates of γτ and β(Ui) are respectively given by

γτ =∂Qτ (Yit | Ui,Xi, vi)

∂Xit,1− ∂Qτ (Yis | Ui,Xi, vi)

∂Xit,1,

and

βτ (Ui) =∂Qτ (Yit | Ui,Xi, vi)

∂Xit,2− ∂Qτ (Yis | Ui,Xi, vi)

∂Xit,2.

However, in order to avoid running two separating conditional quantile models, we adopt

Abrevaya and Dahl (2008)’s pooling regression strategy by stacking covariates. From model

(4), Qτ (Yit | Ui,Xi, vi) and Qτ (Yis | Ui,Xi, vi) can be expressed as

Qτ (Yit|Ui,Xi, vi) = X ′it,1γτ +X ′it,2βτ (Ui) +X ′i1δ1(Ui) + · · ·+X ′iTδT (Ui) + vi,

and

Qτ (Yis|Ui,Xi, vi) = X ′is,1γτ +X ′is,2βτ (Ui) +X ′i1δ1(Ui) + · · ·+X ′iTδT (Ui) + vi.

Hence, we treat

Y11...

Y1T...

Yi1...

YiT...

YN1

...

YNT

and

X ′11,1 X ′11,2 X ′11 · · · X ′1T...

...... · · ·

...

X ′1T,1 X ′1T,2 X ′11 · · · X ′1T...

...... · · ·

...

X ′i1,1 X ′i1,2 X ′i1 · · · X ′iT...

...... · · ·

...

X ′iT,1 X ′iT,2 X ′i1 · · · X ′iT...

...... · · ·

...

X ′N1,1 X ′N1,2 X ′N1 · · · X ′NT...

...... · · ·

...

X ′NT,1 X ′NT,2 X ′N1 · · · X ′NT

9

as the dependent variable and the right-side explanatory variables, respectively. This pooled

regression directly estimates (γ ′τ ,β′τ (Ui), δ

′1(Ui), · · · , δ′T (Ui))

′. We now consider the following

transformed model from (4),

Qτ (Ui,Zit, vi) = Z ′it,1γτ +Z ′it,2θτ (Ui) + vi, (5)

where θτ (Ui) = (β′τ (Ui), δ′1(Ui), · · · , δ′T (Ui))

′, Zit,1 denotes the corresponding variables in

the first column in the above design matrix, Zit,2 represents those entries in the remaining

columns, and Zit = (Z ′it,1,Z′it,2)

′.

2.2.2 Quasi-Likelihood Function

For a conditional quantile regression model, according to Koenker and Bassett (1978), the

estimation of parameters can be obtained by minimizing the following objective (loss) function

minθLKB(θ) =

T∑t=1

ρτ (yt − qτ (wt, θ)),

where qτ (wt, θ) is the regression function with unknown parameters θ, ρτ (x) = x(τ − Ix<0)

is called the check function, and IA is the indicator function of any set A. Komunjer (2005)

generalized the estimation of Koenker and Bassett (1978) by proposing a class of quasi-

maximum likelihood estimations (QMLEs), θ, obtained by solving

maxθLQ(θ) =

T∑t=1

ln lt(yt, qτ (wt, θ)),

where lt(·) is a period-t conditional quasi-likelihood. As pointed out by Komunjer (2005),

if lt(yt, qτ (wt, θ)) is taken to be C(yt, wt) exp(−ρτ (yt − qτ (wt, θ))), the QMLEs become the

estimators of Koenker and Bassett (1978).

We consider a class of QMLEs for conditional quantile like the model defined in (5), ob-

tained by solving the maximization of a quasi-likelihood function for the τth conditional

quantile

maxϑ

QLτ (ϑ) = maxϑ

N∑i=1

T∑t=1

ln lτ (Yit, qτ (Ui,Zit,ϑ)), (6)

where lτ (Yit, qτ (Ui,Zit,ϑ)) is a quasi-likelihood function for the τth conditional quantile on

individual i at time t and ϑ′ = (γ ′τ ,θ′τ (U1), · · · ,θ′τ (UN )). For simplicity, by assuming that

10

vi is iid as normal7 with mean zero and variance σ2, lτ (y, q) is generated from the integrated

quasi-likelihood function for the τth conditional quantile,

lτ (y, a1) =1√2πσ

∞∫−∞

qlτ (y, qτ (a1 + v))e−v2

2σ2 dv,

where qlτ (y, q) is the quasi-likelihood function for the τth conditional quantile. It is empha-

sized by Komunjer (2005) that different choices of qlτ (·, ·) affect the asymptotic theory of

the QMLE for quantile, similar to the case that different choices of likelihood function would

affect the asymptotic theory of QMLE for mean model when the object of interest is the

conditional mean. In this paper, for simplicity, we define qlτ (y, q) as

qlτ (y, q) = e−ρτ (y−q), (7)

This definition makes qlτ (·, ·) belongs to the so-called tick-exponential family defined by

Komunjer (2005). Substituting (7) into the integrated quasi-likelihood function for the τth

conditional quantile, we have

lτ (y, a) =1√2πσ

∞∫−∞

exp[−ρτ (a− v)− v2

2σ2]dv,

where a = y − a1. By a simple calculation,

lτ (a, σ) = e−ρτ (a)λτ (a, σ)(Ia≥0 + e−aIa<0). (8)

Thus,

ln lτ (a, σ) = −ρτ (a) + ln(λτ (a, σ)) + ln(Ia≥0 + e−aIa<0), (9)

where λτ (a, σ) = eτ2σ2

2 Φ( aσ − τσ) + e(τ−1)2σ2

2 Φ(− aσ + (τ − 1)σ)ea and Φ(·) is the standard

normal distribution function. Clearly, the last two terms in (9) can be regarded as a penalty

due to the randomness of vi. Also, it is easily to show that the penalty goes to zero when

σ goes to zero. When σ = 0, (9) is exactly same as the case without vi. Therefore, the

quasi-likelihood function QLτ (ϑ, σ) is given by

QLτ (ϑ, σ) =N∑i=1

T∑t=1

[−ρτ (ait) + ln(λτ (ait, σ)) + ln(Iait≥0 + e−aitIait<0)], (10)

7The normality assumption on vi here is just for simplicity and of course, it can be relaxed but the quasi-

likelihood function would be very complex. It would be very interesting to explore this issue in the future

research.

11

where ϑ is the vector of all the parameters to be estimated. Hence, for a given σ satisfying

the following equation

∂QLτ (ϑ, σ)

∂σ|σ=σ

= NTτ2σ +N∑i=1

T∑t=1

−φ(aitσ − τ σ

)+ (1− 2τ)σeait+

(1−2τ)σ2

2 Φ(− ait

σ + (τ − 1)σ)

Φ(aitσ − τ σ

)+ eait+

(1−2τ)σ2

2 Φ(− ait

σ + (τ − 1)σ) = 0, (11)

the QMLE of ϑ, ϑ, is obtained by

ϑ = (γ ′τ , θ′τ (U1), · · · , θ′τ (UN ))′ = arg max

ϑQLτ (ϑ, σ). (12)

Remark 2. We can simply estimate ϑ and σ by iterating (11) and (12) until convergence.

Given an initial value of σ, we can estimate ϑ by maximizing QLτ (ϑ, σ). Alternatively, σ

can be obtained by the following

σ2 =1

TVar

(Yit −Z ′it,1γ0.5 −Z ′it,2θ0.5(Ui)

)− 1

12NT 2

N∑i=1

T∑t=1

[Z ′it,1ˆ∂γτ∂τ|τ=0.5+Z

′it,2

ˆ∂θτ (Ui)

∂τ|τ=0.5]

2.

(13)

Then, we let σ = σ and iterate (12) and (13) until convergence.

2.2.3 Three-stage Estimation Procedure

To estimate the semiparametric model (5), similar to Cai and Xiao (2012), we propose a

three-stage estimation procedure to the panel data model.

At the first stage, we treat all coefficients as functional coefficients depending on Ui, such as

γτ = γτ (Ui). It is assumed throughout that γτ (·) and θτ (·) are both twice continuously dif-

ferentiable, and then we apply the local constant approximations to γτ (·) and the local linear

approximations to θτ (·) respectively. Hence, model (5) is estimated as a fully functional-

coefficient model and following Cai and Xu (2008), the localized quasi-likelihood function is

given by

maxγ0,θ0,θ1

N∑i=1

T∑t=1

[−ρτ (ait,1) + ln(λτ (ait,1, σ)) + ln(Iait,1≥0 + e−ait,1Iait,1<0)]Kh(Ui − u0), (14)

where ait,1 = Yit−Z ′it,1γ0−Z ′it,2θ0−Z ′it,2θ1(Ui−u0), γ0 = γτ (u0), θ0 = θτ (u0), θ1 = θτ (u0),

Kh(u) = K(u/h)/h, and K(·) is the kernel function. Note that A and A denote the first

order and second order partial derivatives of A throughout the paper.

12

Since γτ is a global parameter, in order to utilize all sample information to estimate γτ ,

at the second stage, we employ the average method to achieve the√N consistent estimator

of γτ , which is given by

γτ =1

N

N∑i=1

γτ (Ui). (15)

Theorem 1 (see later) shows that indeed, γτ is a√N consistent estimator.

Remark 3. First, it is worth pointing out that the well known profile least squares type of

estimation approach (Robinson (1988) and Speckman (1988)) for classical semiparametric

regression models may not be suitable to quantile setting due to lack of explicit normal equa-

tions. Secondly, the estimator γτ given in (15) has the advantage that it is easy to construct

and also achieves the√N -rate of convergence (see Theorem 1 later). In addition to this

simple estimator, other√N consistent estimators of γτ can be constructed. For example, to

estimate the parameter γτ without being overly influenced by the tail behavior of the distri-

bution of Ui, one might use a trimming function wi = IUi∈D with a compact subset D of R;

see Cai and Masry (2000) for details. Then, (15) becomes the weighted average estimator as

γw,τ =

[1

N

N∑i=1

wi

]−1 [1

N

N∑i=1

wiγτ (Ui)

].

Indeed, this type of estimator was considered by Lee (2003) for a partially linear quantile

regression model. To estimate γτ more efficiently, a general weighted average approach can

be constructed as follows

γw,τ =

[1

N

N∑i=1

W (Ui)

]−1 [1

N

N∑i=1

W (Ui)γτ (Ui)

],

where W (·) is a weighting function (a symmetric matrix) which can be chosen optimally by

minimizing the asymptotic variance; see Cai and Xiao (2012) for details. For simplicity, our

focus is on γτ given in (15).

At the last step, to estimate the varying coefficients, for a given√N -consistent estimator

γτ of γτ which may be obtained from (15), we plug γτ into model (5) and obtain the partial

residual denoted by Y ∗it = Yit − Z ′it,1γτ . Hence, the functional coefficients can be estimated

by using the local linear quantile estimation which is given by

maxθ0,θ1

N∑i=1

T∑t=1

[−ρτ (ait,2) + ln(λτ (ait,2, σ)) + ln(Iait,2≥0 + e−ait,2Iait,2<0)]Kh(Ui − u0), (16)

13

where ait,2 = Y ∗it − Z ′it,2θ0 − Z ′it,2θ1(Ui − u0). By moving u0 along the domain of Ui, the

entire estimated curve of the functional coefficient is obtained.

2.3 Asymptotic Theory

This section provides asymptotic results of γτ and βτ (u0) defined in Section 2.2.3. All proofs

are relegated to the appendices.

Firstly, define µj =∫∞−∞ u

jK(u)du and νj =∫∞−∞ u

jK2(u)du with j ≥ 0. Let h1 and h2 be

the bandwidths used at the first and third stages, respectively. Let e′1 = (IK∗1 ,0K∗1×K∗2 ) and

e′2 = (IK2 ,0K2×KT ), where K∗ = K∗1 + K∗2 , K∗1 = K1 and K∗2 = K2 + KT . Then, denote

fU (·) by the marginal density of U .

Secondly, we give the following notations and definitions which will be used in the rest of the

paper. Let gτ (a, σ) = ∂ ln(λτ (a, σ))/∂a 8, bit,1(u0) = Yit−Z ′it,1γτ−Z ′it,2θτ (u0) and bit,2(u0) =

Y ∗it − Z ′it,2θτ (u0), then denote mg(u0,Zit, σ) = E[gτ (bit,1(u0), σ)|u0,Zit], m∗g(u0,Zit,2, σ) =

E[gτ (bit,2(u0), σ)|u0,Zit,2], mg2(u0,Zit, σ) = E[g2τ (bit,1(u0), σ)|u0,Zit], m∗g2(u0,Zit,2, σ) =

E[g2τ (bit,2(u0), σ)|u0,Zit,2], mg1t(u0,Zi1,Zit, σ) = E[gτ (bi1,1(u0), σ)gτ (bit,1(u0), σ)|u0,Zi1,Zit],

m∗g1t(u0,Zi1,2,Zit,2, σ) = E[gτ (bi1,2(u0), σ)gτ (bit,2(u0), σ)|u0,Zi1,2,Zit,2], mg(u0,Zit, σ) =

E[gτ (bit,1(u0), σ)|u0,Zit], mg(u0,Zit, σ) = E[gτ (bit,1(u0), σ)|u0,Zit], and m∗g(u0,Zit,2, σ) =

E[gτ (bit,2(u0), σ)|u0,Zit,2].

Moreover, let

Ωτ,z(u0, σ) = τ2Ωz(u0)− 2τΩzg(u0, σ) + Ωzg2(u0, σ)

with Ωz(u0) = E(ZitZ′it|u0), Ωzg(u0, σ) = E(ZitZ

′itmg(u0,Zit, σ)|u0) and Ωzg2(u0, σ) =

E(ZitZ′itmg2(u0,Zit, σ)|u0),

Ωτ,z1t(u0, σ) = τ2Ωz1t(u0)− 2τΩz1tg(u0, σ) + Ωz1tg1t(u0, σ)

with Ωz1t(u0) = E(Zi1Z′it|u0), Ωz1tg(u0, σ) = E(Zi1Z

′itmg(u0,Zit, σ)|u0) and Ωz1tg1t(u0, σ) =

E(Zi1Z′itmg1t(u0,Zi1,Zit, σ)|u0),

Ωzg(u0, σ) = E[ZitZ′itmg(u0,Zit, σ)|u0],

Ωτ,z2(u0, σ) = τ2Ωz2(u0)− 2τΩz2g(u0, σ) + Ωz2g2(u0, σ)

8 ∂ ln(λτ (a,σ))∂a

= ea+(1−2τ)σ2

2 Φ(− a

σ+ (τ − 1)σ

)/[Φ(aσ− τσ

)+ ea+

(1−2τ)σ2

2 Φ(− a

σ+ (τ − 1)σ

)].

14

with Ωz2(u0) = E(Zit,2Z′it,2|u0), Ωz2g(u0, σ) = E(Zit,2Z

′it,2m

∗g(u0,Zit,2, σ)|u0) and Ωz2g2(u0, σ) =

E(Zit,2Z′it,2m

∗g2(u0,Zit,2, σ)|u0),

Ωτ,z1t,2(u0, σ) = τ2Ωz1t,2(u0)− 2τΩz1t,2g(u0, σ) + Ωz1t,2g1t(u0, σ)

with Ωz1t,2(u0) = E(Zi1,2Z′it,2|u0), Ωz1t,2g(u0, σ) = E(Zi1,2Z

′it,2m

∗g(u0,Zit,2, σ)|u0) and

Ωz1t,2g1t(u0, σ) = E(Zi1,2Z′it,2m

∗g1t(u0,Zi1,2,Zit,2, σ)|u0),

Ωz2g(u0, σ) = E[Zit,2Z′it,2m

∗g(u0,Zit,2, σ)|u0].

Consequently, we define the asymptotic biases and variances in the following theorems as

Bγ,τ (σ) = −µ2h21

2e′1E

Ω−1zg (Ui, σ)[2Ωzg(Ui, σ)

(0

θτ (Ui)

)+ Θ(Ui, σ)]

with Θ(Ui, σ) = Emg(Ui,Zit, σ)Zit[Z

′it,2θτ (Ui)]

2|Ui,

Bβ,τ (u0, σ) ≡ Bβ,τ (u0) = −µ2h22

2βτ (u0),

Σγ,τ (σ) = e′1EΩ−1zg (Ui, σ)[1

TΩτ,z(Ui, σ) +

T∑t=2

2(T − t+ 1)

T 2Ωτ,z1t(Ui, σ)]Ω−1zg (Ui, σ)e1,

and

Σβ,τ (u0, σ) =ν0e′2

fU (u0)Ω−1z2g(u0, σ)[

1

TΩτ,z2(u0, σ) +

T∑t=2

2(T − t+ 1)

T 2Ωτ,z1t,2(u0, σ)]Ω−1z2g(u0, σ)e2.

Remark 4. Note that g(a, σ) = Ia<0 when σ = 0, then mg(u0,Zit, σ), m∗g(u0,Zit,2, σ),

mg2(u0,Zit, σ), m∗g2(u0,Zit,2, σ), mg1t(u0,Zi1,Zit, σ) and m∗g1t(u0,Zi1,2,Zit,2, σ) all equal to

τ , moreover, mg(u0,Zit, σ), m∗g(u0,Zit,2, σ) and mg(u0,Zit, σ) equal to fY |U,Z(Z ′it,1γτ +

Z ′it,2θτ (u0)), fY ∗|U,Z2(Z ′it,2θτ (u0)) and fY |U,Z(Z ′it,1γτ + Z ′it,2θτ (u0)), respectively. Accord-

ingly, Ωτ,z(u0, σ) = τ(1− τ)Ωz(u0), Ωzg(u0, σ) = E[ZitZ′itfY |U,Z(Z ′it,1γτ +Z ′it,2θτ (u0))|u0] ≡

Ωzf (u0), Ωτ,z1t(u0, σ) = τ(1 − τ)Ωz1t(u0), Ωτ,z2(u0, σ) = τ(1 − τ)Ωz2(u0), Ωz2g(u0, σ) =

E[Zit,2Z′it,2fY ∗|U,Z2

(Z ′it,2θτ (u0))|u0] ≡ Ωz2f (u0), Ωτ,z1t,2(u0, σ) = τ(1− τ)Ωz1t,2(u0).

Therefore, the asymptotic bias Bγ,τ (σ) reduce to

Bγ,τ = −µ2h21e′1E

Ω−1zf (Ui)[2Ωzf (Ui)

(0

θτ (Ui)

)+ Θ(Ui)]

15

with Θ(Ui) = EZit[Z ′it,2θτ (Ui)]2fY |U,Z(Z ′it,1γτ +Z ′it,2θτ (Ui))|Ui, and the asymptotic vari-

ances Σγ,τ (σ) and Σβ,τ (u0, σ) reduce to

Σγ,τ = τ(1− τ)e′1EΩ−1zf (Ui)[1

TΩz(Ui) +

T∑t=2

2(T − t+ 1)

T 2Ωz1t(Ui)]Ω

−1zf (Ui)e1,

and

Σβ,τ (u0) =τ(1− τ)ν0e

′2

fU (u0)Ω−1z2f (u0)[

1

TΩz2(u0) +

T∑t=2

2(T − t+ 1)

T 2Ωz1t,2(u0)]Ω

−1z2f

(u0)e2.

The asymptotic bias of the varying coefficients is exactly same as the one of the case without

vi no matter σ is zero or not.

The following assumptions are necessary to establish the consistency and asymptotic nor-

mality of our estimators, although they might not be the weakest ones.9

Assumptions:

A1. The series Ui is iid. The series Zit is iid across individual i, but can be correlated

around t for fixed i. The series vi is iid N(0, σ2).

A2. The distribution of Y given U and Z has an everywhere positive Lebesgue density

fY |U,Z(·), which is bounded and satisfies the Lipschitz continuity condition.

A3. The kernel function K(·) is a symmetric bounded density with a bounded support

region.

A4. Assume that the functional coefficients θ(u0) are 2 times continuously differentiable

in a small neighborhood of u0. The marginal density smoothing variable U , fU (·), is contin-

uous with fU (u0) > 0. Assume all the variance-covariance matrix are positive-definite and

continuously differentiable in a neighborhood of u0.

A5. Assume that E(||Z||δ∗) <∞ with δ∗ > 4.

A6. Assume that bandwidth h1 → 0, h2 → 0, Nh1 → ∞ and Nh2 → ∞ as N → ∞.

Furthermore, Nh41 → 0.

A7. Assume that T ≥ 2.

Remark 5. Assumption A1 specifies that the data is iid across individual i, but can be

correlated around t for fixed i. This is a standard assumption in the panel data literature.

9The assumptions are imposed for a fixed u0 and a fixed τ , and the same assumptions are imposed on the

finite points of interest.

16

The normality assumption of vi can be relaxed by using some approximation approaches such

as Laplace approximation but the quasi-likelihood would be complex since there is no close

form of the quasi-likelihood like (8). We leave the generalization of this normally distributed

restriction to other types of distribution as future work. Assumption A6 requires that N go

to infinity, but T can be fixed. If the model is extended to the case of large T , an appropriate

mixing condition should be included. Assumption A7 excludes the case of T = 1 in order to

identify the coefficients of interest. This is not a severe assumption because this paper focus

on panel data. Assumption A2 does not exclude the dependence between error and covariates.

This kind of dependence will cause heteroscedasticity, which is indicated by the changing of

coefficients under different quantile levels. Assumption A3 is a standard assumption in the

nonparametric literature. Assumption A4 includes some smoothness conditions on functionals

involved.

As mentioned above, a√N consistent estimator of γτ at the second stage is constructed

by using the average method defined in (15). The following Theorem 1 states its asymptotic

normality result which can be obtained by following the U-statistic technique in Powell, Stock

and Stoker (1989).

Theorem 1. Suppose that Assumptions I1, I2 and A1-A7 hold, we have

√N(γτ − γτ +Bγ,τ (σ))

D→ N(0,Σγ,τ (σ)).

Results of Theorem 1 and Remark 4 show that the asymptotic normality of the estimate for

constant coefficients reduce to the one of the case without vi when σ = 0, because Bγ,τ (σ) and

Σγ,τ (σ) are exactly same as Bγ,τ and Σγ,τ , the corresponding asymptotic bias and variance of

the estimate for constant coefficients relating to the case without vi, when σ = 0. Therefore,

the estimator γτ is√N consistent and is asymptotically unbiased when the bandwidth h1

satisfies√Nh21 → 0, which implies that it requires under-smooth at the first stage. Clearly,

compare the above theorem with Theorem 1 in Cai and Xiao (2012), the asymptotic bias

term, Bγ,τ , is the same but the asymptotic variance, Σγ,τ , depends on T . However, when T

goes to infinite, some conditions on the observations (Yit,Xit, Ui), say mixing conditions,

are needed; see Cai and Xiao (2012) for details.

17

At the last stage, the partial residuals Y ∗it are used to estimate θτ (u0). The following

Theorem 2 states the asymptotic normality result of βτ (u0) which can be easily extracted

from the asymptotic normality result of θ0,τ (u0) because of βτ (u0) = e′2θ0,τ (u0).

Theorem 2. Suppose that Assumptions I1, I2 and A1-A7 hold, given the√N consistent

estimator of γτ , we have√Nh2[βτ (u0)− βτ (u0) +Bβ,τ (u0)]→ N(0,Σβ,τ (u0, σ)).

Results of Theorem 2 and Remark 4 show that the asymptotic normality of the estimate for

varying coefficients reduce to the one of the case without vi when σ = 0, because Σβ,τ (u0, σ)

is exactly same as Σβ,τ (u0), the corresponding asymptotic variance of the estimate for varying

coefficients relating to the case without vi, when σ = 0. Compared with Theorem 1 in Cai

and Xu (2008) and Theorem 2 in Cai and Xiao (2012), the asymptotic bias term in the

above theorem is the same but the asymptotic variance, Σβ,τ (u0), depends on T . Clearly,

the optimal bandwidth is h2,opt = O(N−1/5) by minimizing the mean squared error and the

asymptotic mean squared error is in the order of O(N−4/5). This means that the regular

bandwidth selection procedures can be applied here.

2.4 A Specification Test

After deriving the asymptotic results, we now turn to discussing statistical inferences such as

constructing confidence intervals and testing hypotheses. To make statistical inferences for

γτ and βτ (·) in practice, first one needs some consistent covariance estimators of Σγ,τ (σ) and

Σβ,τ (u0, σ). To this end, we need to estimate Ωzg(u0, σ), Ωτ,z(u0, σ), Ωτ,z1t(u0, σ), Ωz2g(u0, σ),

Ωτ,z2(u0, σ) and Ωτ,z1t,2(u0, σ) consistently. The estimation of Ωz2g(u0, σ), Ωτ,z2(u0, σ) and

Ωτ,z1t,2(u0, σ) is similar to Ωzg(u0, σ), Ωτ,z(u0, σ) and Ωτ,z1t(u0, σ). Hence, we here only focus

on the latter to save notations. Following Cai and Xu (2008), we define

Ωz(u0) = (NT )−1N∑i=1

T∑t=1

ZitZ′itKh(Ui − u0),

Ωz1t(u0) = (N(T − t))−1N∑i=1

T−t∑s=1

ZisZ′i(s+t)Kh(Ui − u0),

Ωzg(u0, σ) = (NT )−1N∑i=1

T∑t=1

ZitZ′itmg(u0,Zit, σ)Kh(Ui − u0)

18

and

Ωz1tg(u0, σ) = (N(T − t))−1N∑i=1

T−t∑s=1

ZisZ′i(s+t)mg(u0,Zi(s+t), σ)Kh(Ui − u0),

where

mg(u, z′1γτ − z′2θτ (u), σ) =

N∑i=1

T∑t=1

Kh(Ui − u,Zit − z)g(Yit − z′1γτ − z′2θτ (u), σ)

N∑i=1

T∑t=1

Kh(Ui − u,Zit − z)

.

It can be easily shown that Ωz(u0) = fU (u0)Ωz(u0) + op(1), Ωzg(u0, σ) = fU (u0)Ωzg(u0, σ) +

op(1), Ωz1t(u0) = fU (u0)Ωz1t(u0) + op(1) and Ωz1tg(u0, σ) = fU (u0)Ωz1tg(u0, σ) + op(1).

Similarly, we can get the consistent estimators of Ωzg2(u0, σ), Ωz1tg1t(u0, σ) and Ωzg(u0, σ).

Then, Ωτ,z(u0, σ) = τ2Ωz(u0) − 2τ Ωzg(u0, σ) + Ωzg2(u0, σ) and Ωτ,z1t(u0, σ) = τ2Ωz1t(u0) −

2τ Ωz1tg(u0, σ) + Ωz1tg1t(u0, σ). Finally, the consistent covariance estimators of Σγ,τ (σ) can

be respectively given by

e′1N−1

N∑i=1

Ω−1zg (Ui, σ)[1

TΩτ,z(Ui, σ) +

T∑t=2

2(T − t+ 1)

T 2Ωτ,z1t(Ui, σ)]Ω−1zg (Ui, σ)e1.

Therefore, the consistent estimate of Σβ,τ (u0, σ) can be constructed accordingly in an obvious

manner.

In empirical studies, it is of importance to test the constancy of the varying coefficients.

The null hypothesis H0 : βτ (uj) = βτ for ujqj=1 where ujqj=1 is a set of distinct points

within the domain of Ui. Cai and Xiao (2012) provided some comments on the choice of

ujqj=1 and q in practice. Under the null hypothesis, a simple and easily implemented test

statistics can be constructed as follows

TN =∑

1≤j≤q||√Nh2Σβ,τ (uj , σ)−1/2(βτ (uj)− βτ )||2 → χ2

qK2(17)

where χ2qK2

is a chi-squared random variable with qK2 degrees of freedom. Thus, the null is

rejected if TN is too large. Note that the proposed test statistic TN in (17) is different from

that in Cai and Xiao (2012). Of course, other types of test statistics may be constructed and

it would be warranted as future research topics.

19

3 Monte Carlo Simulations

In this section, we conduct Monte Carlo simulations to demonstrate the finite sample perfor-

mance of the proposed estimators for both constant and functional coefficients. We consider

the following data generating process

Qτ (Yit|Ui,Xi, vi) = γ0,τ +Xit,1γ1,τ +Xit,2βτ (Ui) +T∑t=1

Xit,2δt(Ui) + vi (18)

with T = 2, where the smoothing variable Ui is generated from iid U(−2.5, 2.5), Xit,1 andXit,2

are respectively generated from iid U(0, 3) and U(0, 2), vi is generated from iid N(0, σ2) with

σ = 0.1, and Yit is generated base on the Skorohod representation. The constant coefficients

are set by γ0,τ = 2 + τ and γ1,τ = −1.5 + τ , respectively. The functional coefficients are

defined as βτ (u) = 0.5 cos(2u) + u/3 + τ , δ1(u) = sin(1.5u), and δ2(u) = 1.5e−u2 − 0.75.

To measure the performance of γj,τ for 0 ≤ j ≤ 2 and βτ (·), we use the mean absolute

deviation errors (MADE) of the estimator, defined by

MADE(βτ (·)) =1

n0

n0∑l=1

|βτ (ul)− βτ (ul)|,

where uln0l=1 are the grid points within the domain of Ui, and

MADE(γj,τ ) = |γj,τ − γj,τ |

for 0 ≤ j ≤ 2.

We consider three sample sizes as N = 200, 500 and 1000. For a given sample size, we

repeat simulations 500 times to calculate the MADE values. We compare the estimation

results using different bandwidths, such as h1 = 5N−2/5 (under-smooth) and h2 = cN−1/5,

where c is chosen from 1.5, 1.7, 2, 2.2, 2.5, 2.7, 3.0, · · · . From the simulation results we

can conclude that the estimation of constant coefficients is not sensitive to the choice of the

bandwidth when the first step is under-smoothed, and the estimation of βτ (·) is quite stable

when the bandwidth selection is chosen within a reasonable range. The optional bandwidth

for estimating functional coefficient βτ (·) is about h2 = 2.7N−1/5.

The boxplots of 500 MADE values of γ0,τ , γ1,τ and βτ (·) are given in Figures 1 for γ0,τ ,

2 for γ1,τ , and 3 for βτ (·). From the boxplots, we can observe that the median and stan-

dard deviation of 500 MADE values are decreasing for all settings when the sample size is

20

increasing. This concludes that the proposed estimators for both constant and functional

coefficients are consistent.

0.0

0.1

0.2

0.3

0.4

0.5

MADE of γ0,0.15

− N=200 − − N=500 − − N=1000 −

0.0

0.1

0.2

0.3

0.4

0.5

0.6

MADE of γ0,0.5

− N=200 − − N=500 − − N=1000 −

0.0

0.1

0.2

0.3

0.4

0.5

MADE of γ0,0.75

− N=200 − − N=500 − − N=1000 −

Figure 1: Boxplot of MADE values of γ0,τ .

0.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

MADE of γ1,0.15

− N=200 − − N=500 − − N=1000 −

0.0

0.1

0.2

0.3

0.4

MADE of γ1,0.5

− N=200 − − N=500 − − N=1000 −

0.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

MADE of γ1,0.75

− N=200 − − N=500 − − N=1000 −

Figure 2: Boxplot of MADE values of γ1,τ .

21

0.0

0.1

0.2

0.3

0.4

MADE of β0.15(u)− N=200 − − N=500 − − N=1000 −

0.0

0.1

0.2

0.3

0.4

MADE of β0.5(u)− N=200 − − N=500 − − N=1000 −

0.0

0.1

0.2

0.3

0.4

MADE of β0.75(u)− N=200 − − N=500 − − N=1000 −

Figure 3: Boxplot of MADE values of βτ (u).

Also, the simulation results (the median and standard deviation, denoted by SD, in paren-

theses of 500 MADE values) for the estimators of both constant and functional coefficient are

summarized in Table 1. From Table 1, one can see clearly that the medians of 500 MADE

values for all settings decrease significantly as N increases. For example, when the sample

size increases from 200 to 1000, the medians of MADE values for γ0,0.15, γ1,0.15 and β0.15(·)

shrink quickly from 0.1089 to 0.0439, from 0.0683 to 0.0289, and from 0.2193 to 0.1214, re-

spectively. The standard deviations also shrink quickly when the sample size is enlarged. For

example, for γ0,0.15, the standard deviations shrink from 0.1080 to 0.0384, and for γ1,0.15 and

β0.15(·), they decrease from 0.0721 to 0.0285 and from 0.0577 to 0.0225, respectively. Similar

results can be observed at the median, τ = 0.5, and the upper quantile, τ = 0.75. This

is in line with our asymptotic theory and implies that our proposed estimators are indeed

consistent. Compared with the estimation of βτ (·), the shrinkage speed of the estimation

of γ1,τ is relatively fast, which is also consistent with the theoretical results in the previous

sections.

22

Table 1: The median and SD of the MADE values for γ0,τ , γ1,τ and βτ (·)

τ = 0.15 τ = 0.5 τ = 0.75

γ0,τ γ1,τ βτ (·) γ0,τ γ1,τ βτ (·) γ0,τ γ1,τ βτ (·)200 0.1089 0.0683 0.2193 0.1399 0.0848 0.2498 0.1238 0.0814 0.2320

(0.1080) (0.0721) (0.0577) (0.1146) (0.0757) (0.0555) (0.1078) (0.0765) (0.0559)

500 0.0629 0.0445 0.1550 0.0823 0.0544 0.1968 0.0782 0.0507 0.1751

(0.0594) (0.0411) (0.0313) (0.0732) (0.0518) (0.0397) (0.0677) (0.0440) (0.0356)

1000 0.0439 0.0289 0.1214 0.0613 0.0398 0.1603 0.0514 0.0337 0.1411

(0.0384) (0.0285) (0.0225) (0.0495) (0.0354) (0.0273) (0.0465) (0.0297) (0.0241)

4 Modeling the Effect of FDI on Economic Growth

4.1 Empirical Models

The existing literature presented contradictory empirical evidences on whether or not FDI can

promote the economic growth in host countries. Recent studies in the literature tried to find

the sources of the mixed empirical conclusions and some studies concluded that the reason

may be due to nonlinearities in FDI effects on the economic growth and the heterogeneity

among countries. For example, in the growth literature like Kottaridi and Stengos (2010),

the following is a typical model in empirical studies

yit = αi + β1(FDI/Y )it + β2 log(DI/Y )it + β3nit + β4hit + εit, (19)

where yit denotes the growth rate of GDP per capita in the country or region i during the

period t, αi is the individual effect used to control the unobserved country-specific hetero-

geneity, nit is the logarithm of population growth rate, hit is the human capital, and εit is

random error. Moreover, the FDI and DI in (19) refer to foreign direct investment and do-

mestic investment respectively and Y represents the total output. Hence, (FDI/Y )it denotes

the average ratio between the FDI and the total output during the period t in country i and

(DI/Y )it is defined in the same fashion for the domestic investment. To allow the possible

joint effect of FDI and human capital, some literatures considered to add an interacted term

between FDI and human capital into the empirical growth model, then (19) becomes

yit = αi + β1(FDI/Y )it + β2 log(DI/Y )it + β3nit + β4hit + β5((FDI/Y )it × hit) + εit, (20)

23

which is indeed a nonlinear parametric model; see Kottaridi and Stengos (2010) and the

references therein.

Since the majority of the literature realized that the effect of FDI on the economic growth

depends on the absorptive capacity in host countries and the initial GDP per capita is one

of the most important indicators to reflect the initial conditions and the absorptive capacity

in the host country; see Hansen (2000), Nunnemkamp (2004), and among others, we hereby

propose a partially varying-coefficient model which allows the effect of FDI on the economic

growth to depend on the initial GDP per capita in the host country. Hence, our first empirical

econometric model is the conditional mean model, given by

yit = αi+β1(Ui)(FDI/Y )it+β2 log(DI/Y )it+β3nit+β4hit+β5((FDI/Y )it×hit) + εit, (21)

where Ui is the logarithm of initial GDP per capita in country i and β1(Ui) is the varying

coefficient over the logarithm of initial GDP per capita Ui. Therefore, model (21) has an

ability to characterize how the FDI may have different effects on the economic growth under

the different initial conditions.

As we discussed in the introduction, the conditional mean model in (21) is usually not

sufficiently enough to control the heterogeneity among countries. The existing literature

dealt with the aforementioned issue by simply looking at sub-samples. Instead, in this paper,

we propose to adopt quantile regression approach to investigate the impact of FDI on the

economic growth. Our method is capable of dealing with heterogeneity among countries by

allowing different quantiles to have different empirical growth equations, and at the same

time, we can avoid splitting the sample. Different from the mean model, another advance

of considering the quantile model is that one can see how the FDI effects differently on the

economic growth in the different groups of countries, say the economy fast growing countries

(upper quantile) and the economy slowly growing countries (lower quantile) conditional on

controlled covariates.

By assuming that the εit in (21) takes a linearly heteroscedastic form as εit = (X ′itϕ)ε∗it; see

Koenker and Bassett (1978), where Xit includes all regressors in (21) and ε∗it is independent

of all covariates, but given i, ε∗it is allowed to be correlated around t, then we can obtain the

24

following conditional quantile model,

Qτ (yit | Ui,Xit, αi) = αi + β1,τ (Ui)(FDI/Y )it + β2,τ log(DI/Y )it

+β3,τnit + β4,τhit + β5,τ ((FDI/Y )it × hit), (22)

which can be regarded as a special case of model (1). Imposing the correlated random effect

assumption in (3), we can derive the conditional quantile regression model in (5) for studying

this empirical example. Therefore, the three-stage estimation procedures described in Section

2 can be applied here.

4.2 The Data and Empirical Results

Our data set includes 95 countries or regions from 1970 to 1999. To smooth the yearly

fluctuations in aggregate economic variables, we take five-year averages by following the

convention in the empirical growth literature; see Maasoumi, Racine and Stengos (2007),

Durlauf, Kourtellos and Tan (2008), and Kottaridi and Stengos (2010). Hence, there are six

periods totally. The population growth is computed by the average annual growth rate in

each period, the human capital is measured as mean years of schooling in each period, and

the domestic investment refers to the average of the domestic gross fixed capital formation

measured by the US dollars in 2000 constant values. We measure the initial GDP by the GDP

per capita of each country in the beginning year of each decade in constant 2000 US dollars.10

All the above data are available to be downloaded from World Development Indicators (WDI).

The FDI flows, in constant 2000 US dollars, are taken from United Nations Conference on

Trade and Development (UNCTAD). The full list of countries and regions can be found in

Table 5 in Appendix A.

Firstly, we consider the classical linear regression model in (20). Table 2 presents corre-

sponding estimation results, including coefficient estimates, standard deviations, t-values and

p-values from Column 2 to Column 5. The estimate of the FDI effect, denoted by β1, is about

0.56, which is positive and significant with a p-value of 0.027. On average, the linear condi-

tional mean model reports a mild positive effect of FDI on promoting the economic growth.

Compared to the growth effect of FDI, Table 2 reports a larger effect of domestic investments

on the economic growth, which is about 2.72 and highly significant with the p-value of 0.009.

10We combine three decades, from 1970 to 1979 (69 countries), from 1980 to 1989 (93 countries), and from

1990 to 1999 (95 countries), and then obtain a panel of 514 observations with N = 257 and T = 2.

25

The effect of population growth (β3) is also positive and significant, with an estimate of 0.65.

However, other estimates (the effect of human capital and the effect of the interacted term

between human capital and FDI) are not significant.

Table 2: Empirical results of a linear conditional mean model in (20)

Mean Model Coefficient Standard Deviation t-value p-value

β1 0.5589 0.2513 2.2248 0.0270 ∗

β2 2.7180 1.0374 2.6206 0.0093 ∗∗

β3 0.6483 0.2575 2.5183 0.0124 ∗

β4 -0.0290 0.4269 -0.0682 0.9458

β5 -0.0359 0.0375 -0.9561 0.3401

Next, we move to the partially varying-coefficient conditional mean model in (21). Com-

pared to the linear model in (20), we now allow the effect of FDI to depend on the initial

conditions. Figure 4 and Table 3 present the corresponding estimation results. The solid

line in Figure 4 represents the nonparametric estimate of the varying coefficient β1(·) along

various values of initial GDP, and the shaded area is the corresponding 90% pointwise con-

fidence intervals with the bias ignored. The nonparametric estimate shows a mild but clear

pattern that the growth effect of FDI increases as the initial GDP improves, which is in line

with the hypothesis of the absorptive capacity. The range of the estimated values of the

varying coefficient is between 0.9 and 1.6 for different initial GDPs, much larger than 0.56,

the estimated value of the linear model. Table 3 reports the estimates of constant coefficients

in (21), which are quite different from the corresponding estimation results in Table 2. For

example, in Table 3, the estimate of β2 is now 3.81 in stead of 2.72. The impact of popula-

tion growth rate on the economic growth now becomes to be significantly negative with an

estimate of −1.18. Moreover, both the coefficients of human capital and the interacted term

between FDI and human capital become significant in Table 3. The estimate of the impact

of human capital is positive with a value of 0.17 and the estimate of the interacted term is

−0.18. We attribute the different estimation results to the existence of nonlinearity in the

regression model.

26

Table 3: Constant coefficients of a partial linear conditional mean model in (21)

Mean Model Coefficient Standard Deviation t-value p-value

β2 3.8100 0.1321 28.8326 0.0000 ∗∗∗

β3 -1.1838 0.3357 -3.5268 0.0004 ∗∗∗

β4 0.1728 0.0268 6.4519 0.0000 ∗∗∗

β5 -0.1753 0.0112 -15.5394 0.0000 ∗∗∗

7.0 7.5 8.0 8.5 9.0

0.8

1.0

1.2

1.4

1.6

1.8

log(Initial GDP)

Figure 4: Estimated curve of functional coefficient β1(·) in model (21) together with the

pointless 90% confidence interval with the bias ignored.

Finally, we consider the partially varying-coefficient quantile model in (22). We conduct a

constancy test as in Section 3.3 to testing whether the coefficient β1,τ (·) does not vary with

the initial GDP at different quantiles and whether the coefficients βj,τ where j = 2, · · · , 5

do not vary with the initial GDP at different quantiles. The testing results are summarized

in Table 4. It turns out that the p-values are 0.0000, 0.0000, and 0.0000 for 0.15, 0.50, and

0.75 quantiles, respectively. The test results strongly reject the null of constancy of β1,τ (·).

All these results verify the existence of the heterogeneity among countries and regions with

different development stages.

27

Table 4: p-values of testing constancyτ β1,τ β2,τ β3,τ β4,τ β5,τ

0.15 0.0000 0.9999 0.9999 0.9999 0.9999

0.50 0.0000 0.9999 0.9999 0.9999 0.9999

0.75 0.0000 0.9999 0.9999 0.9999 0.9999

Figure 5 presents the estimates of all four constant coefficients βj,τ for 2 ≤ j ≤ 5 under

different quantile levels as τ = 0.15, 0.25, 0.35, 0.45, · · · , 0.75, 0.85. The horizontal axis

represents different quantiles and the vertical axis measures the estimated value of βj,τ . The

curves in solid line denote the estimates under different quantiles and the areas in dark

gray color are corresponding 90% confidence intervals. The horizontal solid lines denote the

estimates under conditional mean model and the areas in light gray color are corresponding

90% confidence intervals. Except the estimated values of β3,τ in the upper left panel in Figure

5, most estimated values are outside the 90% confidence intervals of the conditional mean

estimates, implying that indeed, βj,τ changes over τ and the conditional mean model might

be inadequate to catch the heterogeneity effect. We observe that the estimated values of β2,τ

increase with τ when τ is bigger than 0.35 but the estimated values of β5,τ decrease with

τ when τ is in the range from 0.25 to 0.75. The estimated values of β4,τ decrease with τ

when τ < 0.5 and increase with τ when τ > 0.5. Moreover, the estimated values of β2,τ

and β4,τ are all positive but the estimated values of β3,τ and β5,τ are all negative. Hence,

generally speaking, we find a clear evidence that domestic investments and human capitals

have positive effects on the economic growth, while the effects of domestic investments are

larger and increase more significantly in countries or regions with better economic growth

performance than those with poor growth performance and the effects of human capitals show

a U-shape across the countries or regions from poor growth performance to better economic

growth performance.

28

0.2 0.3 0.4 0.5 0.6 0.7 0.8

3.0

3.5

4.0

Tau

beta_2

0.2 0.3 0.4 0.5 0.6 0.7 0.8

−1.

5−

1.0

−0.

5

Tau

beta_3

0.2 0.3 0.4 0.5 0.6 0.7 0.8

0.15

0.25

0.35

Tau

beta_4

0.2 0.3 0.4 0.5 0.6 0.7 0.8

−0.

24−

0.20

−0.

16−

0.12

Tau

beta_5

Figure 5: Estimated values of constant coefficients βj,τ in model (22) for 2 ≤ j ≤ 5 and βj in

model (21) and their corresponding 90% confidence intervals.

The nonparametric estimates of functional coefficient β1,τ (·) with the upper (τ = 0.75, in

the dashed line) and lower (τ = 0.15, in the solid line) quantiles are demonstrated in Figure

6. The horizontal axis measures different values of log of initial GDP, Ui, and the vertical

axis measures the nonparametric estimated values of β1,τ (·). The shaded areas represent the

90% confidence intervals of β1,τ (·), where the higher order bias is ignored. We observe that

the estimated values of β1,τ (·) at the upper quantile significantly higher than those at the

lower quantile uniformly over the values of initial GDPs, which is on line with the testing

results in Table 4. In general, our empirical findings support the hypothesis of absorptive

capacity. The initial conditions really matter for host countries to benefit from adopting

foreign direct investments. At the upper quantile, the estimated values of β1,τ (·) generally

increase with the value of initial GDPs, and furthermore, the tendency of increase speeds up

when Ui > 8.2. However, at the lower quantile, although the estimated curve has an overall

positive slope, it becomes almost flat when Ui is larger than 8.2 for host countries.

29

7.0 7.5 8.0 8.5 9.0

0.5

1.0

1.5

2.0

log(Initial GDP)

Tau=0.75Tau=0.15

Figure 6: Estimated curves of functional coefficient β1,τ (·) in model (22) for τ = 0.15 (solid

line) and τ = 0.75 (dashed line) and their concresponding 90% pointwise confidence intervals

with the bias ignored.

5 Conclusion

Quantile panel data models have gained a lot of attentions in the literature during recent

years. In this paper, we propose a partially varying-coefficient quantile panel data model

with correlated random effects. Compared to quantile panel data models with fixed effect,

our estimation assumes only large N and short (fixed) T , while the latter requires that both

N and T go to infinity. In our semiparametric model, we allow some coefficients to vary with

other economic variables while others keep constant. This novel semiparametric quantile

panel data model is applied to estimating the impact of FDI on the economic growth.

There are several issues still worth of further studies. First, it is reasonable to allow for

cross sectional dependence in the current model. In the literature of conditional mean models,

some methods have been developed to deal with cross sectional dependence, for example,

using the factor structure or the interactive effect. However, due to the nature of conditional

quantile model, it is not obvious to extend these under the quantile setup. Second, it is

also interesting to address a dynamic structure and endogeneity issue in conditional quantile

30

panel data models. We leave these important issues as future research topics.

Appendix A: Table of Countries and Regions

Table 5: Countries and regions in the empirical data setAlgeria Australia Austria Bahrain

Bangladesh Barbados Belgium Benin

Bolivia Botswana Brazil Cameroon

Canada Central African Republic Chile China

Colombia Congo, Rep. Costa Rica Cyprus

Denmark Dominican Republic Ecuador Egypt, Arab Rep.

El Salvador Fiji Finland France

Gambia Germany Ghana Greece

Guatemala Guyana Honduras Hong Kong SAR, China

Hungary Iceland India Indonesia

Iran, Islamic Rep. Ireland Israel Italy

Jamaica Japan Jordan Kenya

Korea, Rep. Lesotho Malawi Malaysia

Mali Malta Mauritius Mexico

Mozambique Nepal Netherlands New Zealand

Nicaragua Niger Norway Pakistan

Panama Papua New Guinea Paraguay Peru

Philippines Poland Portugal Rwanda

Senegal Sierra Leone Singapore South Africa

Spain Sri Lanka Sudan Swaziland

Sweden Switzerland Syrian Arab Republic Thailand

Togo Trinidad and Tobago Tunisia Turkey

Uganda United Kingdom United States Uruguay

Venezuela, RB Zambia Zimbabwe

Appendix B: Proof of Theorem 1

In order to establish the asymptotic theory of γτ in Theorem 1, the local Bahadur representa-

tion for the estimators obtained from the first stage should be derived. At first, we introduce

the following additional notations and definitions: H = diag(1K∗ , h11K∗2 )(K∗+K∗2 )×(K∗+K∗2 )

and G =

(IK∗

0K∗2×K∗1 , Uih1IK∗2

)(K∗+K∗2 )×K∗

, where Uih1 = (Ui − u0)/h1. Following Cai and

31

Xu (2008) and Cai and Xiao (2012), we have

√Nh1H

γτ (u0)− γτ (u0)

θ0,τ (u0)− θτ (u0)

θ1,τ (u0)− θτ (u0)

=Ω−1(u0, σ)√Nh1TfU (u0)

N∑j=1

T∑t=1

GZjtψτ (ajt,1, σ)K(Ujh1) + op(1),

where Ω(u0, σ) = diag(Ωzg(u0, σ), µ2e′0Ωzg(u0, σ)e0), e

′0 = (0K∗2×K∗1 , IK∗2 ), and ψτ (a, σ) =

τ − gτ (a, σ). In particular, we can obtain

√Nh1

(γτ (u0)− γτ (u0)

θ0,τ (u0)− θτ (u0)

)=

Ω−1zg (u0, σ)√Nh1TfU (u0)

N∑j=1

T∑t=1

Zjtψτ (ajt,1, σ)K(Ujh1) + op(1),

(23)

which is useful for establishing the asymptotic results for our estimators.

For any u0, (23) can be rewritten as

√Nh1

(γτ (u0)− γτ (u0)

θ0,τ (u0)− θτ (u0)

)' h1√

Nh1T

N∑j=1

T∑t=1

B−1(u0, σ)Z(u0,Zjt, σ)

+h1√Nh1T

N∑j=1

T∑t=1

B−1(u0, σ)Zjt[ψτ (ajt,1, σ)− ψτ (bjt,1(u0), σ)]Kh(Uj − u0),

where B(u0, σ) = fU (u0)Ωzg(u0, σ), Z(u0,Zjt, σ) = Zjtψτ (bjt,1(u0), σ)Kh(Uj − u0) and

bjt,1(u0) = Yjt − Z ′jt,1γτ − Z ′jt,2θτ (u0) for Uj in a small neighborhood of u0. In particu-

lar,

γτ (u0)− γτ (u0) '1

NT

N∑j=1

T∑t=1

e′1B−1(u0, σ)Z(u0,Zjt, σ)

+1

NT

N∑j=1

T∑t=1

e′1B−1(u0, σ)Zjt[ψτ (ajt,1, σ)− ψτ (bjt,1(u0), σ)]Kh(Uj − u0)

≡ 1

NT

N∑j=1

T∑t=1

e′1B−1(u0, σ)Z(u0,Zjt, σ) +BN (u0, σ)

32

holds uniformly for all u0 under Assumption A1-A4. Thus,

γτ − γτ =1

N

N∑i=1

[γτ (Ui)− γτ (Ui)]

=1

N2

N∑i=1

N∑j=1

1

T

T∑t=1

e′1B−1(Ui, σ)Z(Ui,Zjt, σ) +

1

N

N∑i=1

BN (Ui, σ)

=2

N2

∑1≤i<j≤N

e′1B−1(Ui, σ)

1

T

T∑t=1

Z(Ui,Zjt, σ) +1

N

N∑i=1

BN (Ui, σ)

=1

N2

∑1≤i<j≤N

[e′1B−1(Ui, σ)

1

T

T∑t=1

Z(Ui,Zjt, σ) + e′1B−1(Uj , σ)

1

T

T∑t=1

Z(Uj ,Zit, σ)]

+1

N

N∑i=1

BN (Ui, σ)

=N − 1

2NUN + BN ,

where BN = 1N

N∑i=1

BN (Ui, σ) and UN = 2N(N−1)

∑1≤i<j≤N

pN (ξi, ξj , σ) with

pN (ξi, ξj , σ) = e′1B−1(Ui, σ)

1

T

T∑t=1


1

T

T∑t=1

Z(Uj ,Zit, σ),

and ξi = (Ui,ZZZi) indicates all the information for i. Define rN (ξi, σ) = E[pN (ξi, ξj , σ)|ξi],

θN (σ) = E[rN (ξi, σ)] = E[pN (ξi, ξj , σ)], and UN = θN (σ) + 2N

N∑i=1

[rN (ξi, σ) − θN (σ)]. The

following two lemmas are useful to prove Theorem 1 and their detailed proofs are relegated

to Appendix D.

Lemma 1. Under the assumptions in Theorem 1, we have

(i) rN (ξi, σ) = e′1Ω−1zg (Ui, σ) 1

T

T∑t=1

Zitψτ (bit,1(Ui), σ)(1 + o(1)),

(ii) θN (σ) = −µ2h21e′1EΩ−1zg (Ui, σ)[2Ωzg(Ui, σ)

(0

θτ (Ui)

)+ Θ(Ui, σ)]+ o(h21),

(iii) V ar[rN (ξi, σ)] = Σγ(σ) + o(h1).

Lemma 2. Under the assumptions in Theorem 1, we have

BN = µ2h21e′1EΩ−1zg (Ui, σ)[2Ωzg(Ui, σ)

(0

θτ (Ui)

)+ Θ(Ui, σ)]+ o(h21).

33

Proof of Theorem 1: First, note that E[||pN (ξi, ξj , σ)||2] = O(h−1) = O[N(Nh1)−1] →

o(N) if and only if Nh1 → ∞ as h1 → 0. Lemma 3.1 in Powell, Stock and Stoker (1989)

gives that√N(UN − UN ) = op(1). Then the result follows from Lemma 1, Lemma 2 and the

Lindeberg-Levy central limit theorem.

Appendix C: Proof of Theorem 2

The asymptotic distribution of βτ (u0) can be easily extracted from the asymptotic distri-

bution of θ0,τ (u0) because of βτ (u0) = e′2θ0,τ (u0). Similar to the proof of Theorem 1, the

local Bahadur representation for the estimators obtained from the third stage should be

derived at first. We introduce the following additional notations and definitions: H2 =

diag(1K∗2 , h21K∗2 )2K∗2×2K∗2 and G2 =

(IK∗2

Uih2IK∗2

)2K∗2×K∗2

, where Uih2 = (Ui−u0)/h2. For

a given√N consistent estimator γτ of γτ , we can obtain that

√Nh2H2

(θ0,τ (u0)− θτ (u0)

θ1,τ (u0)− θτ (u0)

)=

Ω−1(u0, σ)√Nh2TfU (u0)

N∑i=1

T∑t=1

G2Zit,2ψτ (ait,2, σ)K(Uih2)+op(1),

where Ω(u0, σ) = diag(Ωz2g(u0, σ), µ2Ωz2g(u0, σ)). Similar to (23), we have

√Nh2

(θ0,τ (u0)− θτ (u0)

)=

Ω−1z2g(u0, σ)√Nh2TfU (u0)

N∑i=1

T∑t=1

Zit,2ψτ (ait,2, σ)K(Uih2) + op(1),

which is useful to establish the asymptotic result for θ0,τ (u0). The above equation can be

rewritten as

√Nh2(θ0,τ − θτ (u0)) '

Ω−1z2g(u0, σ)√Nh2TfU (u0)

N∑i=1

T∑t=1

Zit,2ψτ (ait,2, σ)K(Uih2)

=Ω−1z2g(u0, σ)√Nh2TfU (u0)

N∑i=1

T∑t=1

Zit,2[ψτ (ait,2, σ)− ψτ (bit,2(Ui), σ)]K(Uih2)

+Ω−1z2g(u0, σ)√Nh2TfU (u0)

N∑i=1

T∑t=1

Zit,2ψτ (bit,2(Ui), σ)K(Uih2)

≡ BN + ΨN ,

where bit,2(Ui) = Y ∗it − Z ′it,2θτ (Ui). We will show that the first term BN determines the

asymptotic bias and the second term ΨN gives the asymptotic normality.

34

First, note that the first-order conditional moment of ψτ (bit,2(Ui), σ) is

E(ψτ (bit,2(Ui), σ)|Ui,Zit,2) = E(τ − g(bit,2(Ui), σ)|Ui,Zit,2)

= τ − E[g(bit,2(Ui), σ)|Ui,Zit,2]

≡ τ −m∗g(Ui,Zit,2, σ).

Then, the second-order conditional moments are

E(ψ2τ (bit,2(Ui), σ)|Ui,Zit,2) = E[τ2 − 2τg(bit,2(Ui), σ) + g2(bit,2(Ui), σ)|Ui,Zit,2]

= τ2 − 2τE[g(bit,2(Ui), σ)|Ui,Zit,2] + E[g2(bit,2(Ui), σ)|Ui,Zit,2]

≡ τ2 − 2τm∗g(Ui,Zit,2, σ) +m∗g2(Ui,Zit,2, σ),

and

E(ψτ (bi1,2(Ui), σ)ψτ (bit,2(Ui), σ)|Ui,Zi1,2,Zit,2)

= E[τ2 − τg(bi1,2(Ui), σ)− τg(bit,2(Ui), σ) + g(bi1,2(Ui), σ)g(bit,2(Ui), σ)|Ui,Zi1,2,Zit,2]

= τ2 − τE[g(bi1,2(Ui), σ)|Ui,Zi1,2]− τE[g(bit,2(Ui), σ)|Ui,Zit,2]

+E[g(bi1,2(Ui), σ)g(bit,2(Ui), σ)|Ui,Zi1,2,Zit,2]

≡ τ2 − τm∗g(Ui,Zi1,2, σ)− τm∗g(Ui,Zit,2, σ) +m∗g1t(Ui,Zi1,2,Zit,2, σ).

Thus,

E(ΨN ) =Ω−1z2g(u0, σ)√Nh2fU (u0)

NE[Zit,2ψτ (bit,2(Ui), σ)K(Uih2)]

=Ω−1z2g(u0, σ)√Nh2fU (u0)

NEE[Zit,2ψτ (bit,2(Ui), σ)|Ui]K(Uih2) = 0

with E[Zit,2ψτ (bit,2(Ui), σ)|Ui] = 0, since we know that bit,2(Ui) is the maximizer of the

35

corresponding likelihood function, and

V ar(ΨN )

=Ω−1z2g(u0, σ)

h2T 2f2U (u0)V ar

T∑t=1

Zit,2ψτ (bit,2(Ui), σ)K(Uih2)Ω−1z2g(u0, σ)

=Ω−1z2g(u0, σ)

h2T 2f2U (u0)TV ar[Zit,2ψτ (bit,2(Ui), σ)K(Uih2)]

+

T∑t=2

2(T − t+ 1)Cov((Zi1,2ψτ (bi1,2(Ui), σ)K(Uih2),Zit,2ψτ (bit,2(Ui), σ)K(Uih2))Ω−1z2g(u0, σ)

=ν0

TfU (u0)Ω−1z2g(u0, σ)[Ωτ,z2(u0, σ) +

T∑t=2

2(T − t+ 1)

TΩτ,z1t,2(u0, σ)]Ω−1z2g(u0, σ),

since

E[Zit,2Z′it,2ψ

2τ (bit,2(Ui), σ)K2(Uih2)]

= EZit,2Z ′it,2E[ψ2τ (bit,2(Ui), σ)|Ui,Zit,2]K2(Uih2)

= EZit,2Z ′it,2[τ2 − 2τm∗g(Ui,Zit,2, σ) +m∗g2(Ui,Zit,2, σ)]K2(Uih2)

≡ ν0h2fU (u0)[τ2Ωz2(u0)− 2τΩz2g(u0, σ) + Ωz2g2(u0, σ)](1 + o(1))

≡ ν0h2fU (u0)Ωτ,z2(u0, σ)(1 + o(1)),

and

E[Zi1,2Z′it,2ψτ (bi1,2(Ui), σ)ψτ (bit,2(Ui), σ)K2(Uih2)]

= EZi1,2Z ′it,2E[ψτ (bi1,2(Ui), σ)ψτ (bit,2(Ui), σ)|Ui,Zi1,2,Zit,2]K2(Uih2)

= EZi1,2Z ′it,2[τ2 − τm∗g(Ui,Zi1,2, σ)− τm∗g(Ui,Zit,2, σ) +m∗g1t(Ui,Zi1,2,Zit,2, σ)]K2(Uih2)

≡ ν0h2fU (u0)[τ2Ωz1t,2(u0)− 2τΩz1t,2g(u0, σ) + Ωz1t,2g1t(u0, σ)](1 + o(1))

≡ ν0h2fU (u0)Ωτ,z1t,2(u0, σ)(1 + o(1)).

Let QN = 1NT

N∑i=1

T∑t=1Zit,2ψτ (bit,2(Ui), σ)K(Uih2). Using the Cramer-Wold device, for any

d ∈ RK∗2 , define ZN,it =√

h2T d′Zit,2ψτ (bit,2(Ui), σ)K(Uih2), then we have

√Nh2d

′QN =1√NT

N∑i=1

T∑t=1

ZN,it =1√NT

N∑i=1

Z∗N,i,

where Z∗N,i =T∑t=1

ZN,it, which is iid across i. Hence, it follows by the Lindeberg-Levy central

limit theorem that the asymptotic normality holds.

36

Next, we move to work on the first term BN . Note that

bit,2(Ui)− ait,2 = Z ′it,2θτ (u0) +Z ′it,2θτ (u0)h2Uih2 −Z ′it,2θτ (Ui) = −h22

2Z ′it,2θτ (u)U2

ih2 ,

then

g(bit,2(Ui), σ)− g(ait,2, σ) = g(ait,2 + ζ(bit,2(Ui)− ait,2), σ)(bit,2(Ui)− ait,2)

= −h22

2g(ait,2 − ζ

h222Z ′it,2θτ (u)U2

ih2 , σ)Z ′it,2θτ (u)U2ih2

and

[g(bit,2(Ui), σ)− g(ait,2, σ)]2

= 2[g(ait,2 + ζ(bit,2(Ui)− ait,2), σ)− g(ait,2, σ)]g(ait,2 + ζ(bit,2(Ui)− ait,2), σ)(bit,2(Ui)− ait,2)

= 2[g(ait,2 + η(ait,2 + ζ(bit,2(Ui)− ait,2)− ait,2), σ)(ait,2 + ζ(bit,2(Ui)− ait,2)− ait,2)]

g(ait,2 + ζ(bit,2(Ui)− ait,2), σ)(bit,2(Ui)− ait,2)

= 2ζg(ait,2 + ηζ(bit,2(Ui)− ait,2), σ)g(ait,2 + ζ(bit,2(Ui)− ait,2), σ)(bit,2(Ui)− ait,2)2

= O(h42).

Thus,

E[ψτ (ait,2, σ)− ψτ (bit,2(Ui), σ)|Ui,Zit,2]

= −h22

2E[g(ait,2 − ζ

h222Z ′it,2θτ (u)U2

ih2 , σ)Z ′it,2θτ (u)U2ih2 |Ui,Zit,2]

= −h22

2E[g(bit,2(Ui), σ)|Ui,Zit,2]Z ′it,2θτ (Ui)U

2ih2(1 + o(1))

≡ −h22

2m∗g(Ui,Zit,2, σ)Z ′it,2θτ (Ui)U

2ih2(1 + o(1))

and

E[ψτ (ait,2, σ)− ψτ (bit,2(Ui), σ)]2|Ui,Zit,2 = O(h42).

Let Bit = Zit,2[ψτ (ait,2, σ)− ψτ (bit,2(Ui), σ)]K(Uih2), we have

E(Bit) = −h22

2E[Zit,2m

∗g(Ui,Zit,2, σ)Z ′it,2θτ (Ui)U

2ih2K(Uih2)](1 + o(1))

= −h22

2EE[Zit,2Z

′it,2m

∗g(Ui,Zit,2, σ)|Ui]θτ (Ui)U

2ih2K(Uih2)(1 + o(1))

= −h22

2E[Ωz2g(Ui, σ)θτ (Ui)U

2ih2K(Uih2)](1 + o(1))

= −h32

2fU (u0)µ2Ωz2g(u0, σ)θτ (u0)(1 + o(1))

37

and E[B2it] = o(h42) and similarly, we have E[Bit1Bit2 ] = o(h42). Finally, we show that

E(BN ) =Ω−1z2g(u0, σ)√Nh2fU (u0)

NEZit,2[ψτ (ait,2, σ)− ψτ (bit,2(Ui), σ)]K(Uih2)

= −Ω−1z2g(u0, σ)√Nh2fU (u0)

Nh322fU (u0)µ2Ωz2g(u0, σ)θτ (u0)(1 + o(1))

=µ2h

22

√Nh2

2θτ (u0)(1 + o(1)).

and V ar(BN ) = o(h32). This completes the proof of Theorem 2.

Appendix D: Proofs of Lemmas

Proof of Lemma 1: For any Ui, note that

E[ψτ (bjt,1(Ui), σ)|ξj ] = τ − E[g(bjt,1(Ui), σ)|ξj ]

' τ − Eg(bjt,1(Uj), σ) + g(bjt,1(Uj), σ)Z ′jt,2[θτ (Uj)− θτ (Ui)]

+1

2g(bjt,1(Uj), σ)Z ′jt,2[θτ (Uj)− θτ (Ui)]2|ξj

≡ τ −mg(Uj ,Zjt, σ)−mg(Uj ,Zjt, σ)Z ′jt,2[θτ (Uj)− θτ (Ui)]

−1

2mg(Uj ,Zjt, σ)Z ′jt,2[θτ (Uj)− θτ (Ui)]2

= τ −mg(Uj ,Zjt, σ)−mg(Uj ,Zjt, σ)Z ′jt

(0

θτ (Uj)− θτ (Ui)

)−1

2mg(Uj ,Zjt, σ)Z ′jt,2[θτ (Uj)− θτ (Ui)]2,

since bjt,1(Ui)− bjt,1(Uj) = Z ′jt,2[θτ (Uj)− θτ (Ui)]. Hence, we have

E[Z(Ui,Zjt, σ)|ξi]

= EZjtψτ (bjt,1(Uj), σ)Kh(Uj − Ui)|ξi+ Eψτ bjt,1(Uj), σ)[bjt,1(Uj)− bjt,1(Ui)]Kh(Uj − Ui)|ξi

+Eψτ (bjt,1(Uj), σ)[bjt,1(Uj)− bjt,1(Ui)]2Kh(Uj − Ui)|ξi+ o(h21)

= −E[mg(Uj ,Zjt, σ)ZjtZ′jt

(0


)Kh(Uj − Ui)|ξi]

−1

2Emg(Uj ,Zjt, σ)ZjtZ ′jt,2[θτ (Uj)− θτ (Ui)]2Kh(Uj − Ui)|ξi+ o(h21)

≡ −I1 −I22

+ o(h21),

since EZjtψτ (bjt,1(Uj), σ)Kh(Uj−Ui)|ξi = EE[Zjtψτ (bjt,1(Uj), σ)|Uj ]Kh(Uj−Ui)|ξi = 0.

38

Then, we obtain that

I1 = E[mg(Uj ,Zjt, σ)ZjtZ′jt

(0


)Kh(Uj − Ui)|ξi]

= EE[mg(Uj ,Zjt, σ)ZjtZ′jt|Uj ]

(0

θτ (Ui)(Uj − Ui)− θτ (Ui)(Uj − Ui)2

)Kh(Uj − Ui)|ξi+ o(h21)

≡ E[Ωzg(Uj , σ)

(0

θτ (Ui)(Uj − Ui)− θτ (Ui)(Uj − Ui)2

)Kh(Uj − Ui)|ξi] + o(h21)

=

∫[Ωzg(Uj , σ) + uh1Ωzg(Uj , σ)]

(0

uh1θτ (Ui) + (uh1)2

2 θτ (Ui)

)K(u)[fU (Ui) + uh1fU (Ui)]du+ o(h21)

= µ2h21B(Ui, σ)

(0

12 θτ (Ui) + fU (Ui)

fU (Ui)θτ (Ui)

)+ µ2h

21fU (Ui)Ωzg(Ui, σ)

(0

θτ (Ui)

)+ o(h21),

and

I2 = Emg(Uj ,Zjt, σ)ZjtZ ′jt,2[θτ (Uj)− θτ (Ui)]2Kh(Uj − Ui)|ξi

= Emg(Uj ,Zjt, σ)Zjt[Z′jt,2θτ (Uj)]

2(Uj − Ui)2Kh(Uj − Ui)|ξi+ o(h21)

= EEmg(Uj ,Zjt, σ)Zjt[Z′jt,2θτ (Uj)]

2|Uj(Uj − Ui)2Kh(Uj − Ui)|ξi+ o(h21)

≡ E[Θ(Uj , σ)(Uj − Ui)2Kh(Uj − Ui)|ξi] + o(h21)

= µ2h21fU (Ui)Θ(Ui, σ) + o(h21).

It follows that

rN (ξi, σ) = E[e′1B−1(Ui, σ)

1

T

T∑t=1


1

T

T∑t=1

Z(Uj ,Zit, σ)|ξi]

= E[e′1B−1(Ui, σ)

1

T

T∑t=1

Z(Ui,Zjt, σ)|ξi] + E[e′1B−1(Uj , σ)

1

T

T∑t=1

Z(Uj ,Zit, σ)|ξi]

=1

T

T∑t=1

e′1

∫B−1(u, σ)Zitψτ (bit,1(u), σ)Kh(Ui − u)fU (u)du

=1

T

T∑t=1

e′1B−1(Ui, σ)Zitψτ (bit,1(Ui), σ)fU (Ui)(1 + o(1))

=1

T

T∑t=1

Zitψτ (bit,1(Ui), σ)(1 + o(1))

39

and furthermore, we obtain that

θN (σ) = E[pN (ξi, ξj , σ)] = 2E[e′1B−1(Ui, σ)Z(Ui,Zjt, σ)]

= −µ2h21E[e′1

(0

θτ (Ui) + 2 fU (Ui)fU (Ui)θτ (Ui)

)] + 2E[e′1Ω

−1zg (Ui, σ)Ωzg(Ui, σ)

(0

θτ (Ui)

)]

+E[e′1Ω−1zg (Ui, σ)Θ(Ui, σ)]+ o(h21)

= −µ2h21e′1EΩ−1zg (Ui, σ)[2Ωzg(Ui, σ)

(0

θτ (Ui)

)+ Θ(Ui, σ)]+ o(h21).

Similar to the derivation of V ar(ΨN ), it follows that

V ar[rN (ξi)] = e′1EΩ−1zg (Ui, σ)[1

TΩτ,z(Ui, σ)

+T∑t=2

2(T − t+ 1)

T 2Ωτ,z1t(Ui, σ)]Ω−1zg (Ui, σ)e1 + o(h1)

≡ Σγ(σ) + o(h1).

Therefore, Lemma 1 is established.

Proof of Lemma 2: Similar to Lemma 1 (ii), we show that

BN = µ2h21e′1EΩ−1zg (Ui, σ)[2Ωzg(Ui, σ)

(0

θτ (Ui)

)+ Θ(Ui, σ)]+ o(h21).

The lemma is established due to the fact that V ar(BN ) = O( 1N h

21).

40

Appendix E:

Derivation of (8)

1√2πσ

∞∫−∞

exp[−ρτ (a− v)− v2

2σ2]dv

=1√2πσ

a∫−∞

exp[−τ(a− v)− v2

2σ2]dv +

∞∫a

exp[−(τ − 1)(a− v)− v2

2σ2]dv

=1√2πσe−τa

a∫−∞

exp[− 1

2σ2(v2 − 2τσ2v)]dv + e−(τ−1)a

∞∫a

exp[− 1

2σ2(v2 − 2(τ − 1)σ2v)]dv

= e−τa+τ2σ2

2 Φ(aσ− τσ

)+ e−(τ−1)a+

(τ−1)2σ2

2 [1− Φ(aσ− (τ − 1)σ

)]

= e−τa+τ2σ2

2 Φ(aσ− τσ

)+ e−(τ−1)a+

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)=

e−τa[eτ2σ2

2 Φ(aσ − τσ

)+ ea+

(τ−1)2σ2

2 Φ(− a

σ + (τ − 1)σ)] , when a ≥ 0

e−(τ−1)a[e−a+τ2σ2

2 Φ(aσ − τσ

)+ e

(τ−1)2σ2

2 Φ(− a

σ + (τ − 1)σ)] , when a < 0

= e−ρτ (a)Ia≥0[eτ2σ2

2 Φ(aσ− τσ

)+ ea+

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)]

+e−ρτ (a)Ia<0[e−a+ τ2σ2

2 Φ(aσ− τσ

)+ e

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)]

= e−ρτ (a)[eτ2σ2

2 Φ(aσ− τσ

)+ e

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)ea](Ia≥0 + e−aIa<0)

≡ e−ρτ (a)λτ (a, σ)(Ia≥0 + e−aIa<0)

Derivation of (11)

Differentiation maxσ

N∑i=1

T∑t=1

ln(λτ (ait, σ)) with respect to σ leads to

N∑i=1

T∑t=1

∂λτ (ait,σ)∂σ

λτ (ait, σ)|σ=σ = 0.

By definition of λτ (a, σ) = eτ2σ2

2 Φ(aσ − τσ

)+ e

(τ−1)2σ2

2 Φ(− a

σ + (τ − 1)σ)ea,

∂λτ (a, σ)

∂σ= τ2σe

τ2σ2

2 Φ(aσ− τσ

)+ (τ − 1)2σea+

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)+(− a

σ2− τ)e

τ2σ2

2 φ(aσ− τσ

)+ (

a

σ2+ τ − 1)ea+

(τ−1)2σ2

2 φ(− a

σ+ (τ − 1)σ

)= τ2σλτ (a, σ) + (1− 2τ)σea+

(τ−1)2σ2

2 Φ(− a

σ+ (τ − 1)σ

)− e

τ2σ2

2 φ(aσ− τσ

).

41

Thus,

NTτ2σ +N∑i=1

T∑t=1

−φ(aitσ − τ σ

)+ (1− 2τ)σeait+

(1−2τ)σ2

2 Φ(− ait

σ + (τ − 1)σ)

Φ(aitσ − τ σ

)+ eait+

(1−2τ)σ2

2 Φ(− ait

σ + (τ − 1)σ) = 0.

Derivation of (13)

Assume Yit is generated based on Skorohod representation,

Yit = Z ′it,1γ(Vit) +Z ′it,2θ(Vit, Ui) + vi

where Vit is distributed uniformly from 0 to 1 conditional on all the observarions. Then,

Yit−Z ′it,1γ0.5−Z ′it,2θ0.5(Ui) = Z ′it,1[γ(Vit)−γ0.5]+Z ′it,2[θ(Vit, Ui)−θ0.5(Ui)]+vi. Therefore,

by delta method, we have

V ar

(1

NT

N∑i=1

T∑i=1

(Yit −Z ′it,1γ0.5 −Z ′it,2θ0.5(Ui))

)

= V ar

(1

NT

N∑i=1

T∑i=1

Z ′it,1[γ(Vit)− γ0.5] +Z ′it,2[θ(Vit, Ui)− θ0.5(Ui)]

)+ V ar

(1

NT

N∑i=1

T∑i=1

vi

)

≈ 1

N2T 2

N∑i=1

T∑i=1

Z ′it,1∂γτ∂τ|τ=0.5 +Z ′it,2

∂θτ (Ui)

∂τ|τ=0.52V ar (Vit) +

1

NV ar (vi)

=1

12N2T 2

N∑i=1

T∑i=1

Z ′it,1∂γτ∂τ|τ=0.5 +Z ′it,2

∂θτ (Ui)

∂τ|τ=0.52 +

1

NV ar (vi) .

References

Abrevaya, J., Dahl, C., 2008. The effects of birth inputs on birth weight: evidence fromquantile estimation on panel data. Journal of Business & Economic Statistics 26,379-397.

Aitken, B., Harrison, A., 1999. Do domestic firms benefit from direct foreign investment?Evidence from Venezuela. American Economic Review 89, 605-618.

Alfaro, L., Chanda, A., Kalemli-Ozcan, S., Sayek, S., 2004. FDI and economic growth: Therole of local financial markets. Journal of International Economics 64, 89-112.

Blomstrom, M., Lipsey, R., Zejan, M., 1992. What explains developing country growth?Working Paper No. 4132, NBER.

Blomstrom, M., Persson, H., 1983. Foreign investment and spillover efficiency in an un-derdeveloped economy: evidence from the Mexican manufacturing industry. WorldDevelopment 11, 493-501.

42

Borensztein, E., De Gregorio, J., Lee, J-W., 1998. How does foreign direct investment affecteconomic growth? Journal of International Economics 45, 115-135.

Cai, Z., 2010. Functional coefficient models for economic and financial data. In Ferraty, F.,Romain, Y. (Eds), Handbook of Functional Data Analysis, Oxford: Oxford UniversityPress, 166-186.

Cai, Z., Das, M., Xiong, H., Wu, X., 2006. Functional coefficient instrumental variablesmodels. Journal of Econometrics 133, 207-241.

Cai, Z., Masry, E., 2000. Nonparametric estimation of additive nonlinear ARX time series:Local linear fitting and projection. Econometric Theory 16, 465-501.

Cai, Z., Xiao, Z., 2012. Semiparametric quantile regression estimation in dynamic modelswith partially varying coefficients. Journal of Econometrics 167, 413-425.

Cai Z., Xu, X., 2008. Nonparametric quantile estimations for dynamic smooth coefficientmodels. Journal of the American Statistical Association 103, 1595-1608.

Canay, I., 2011. A simple approach to quantile regression for panel data. EconometricsJournal 14, 368-386.

Carkovic, M., Levine, R. 2005. Does foreign direct investment accelerate economic growth?In: Moran, T., Graham, E., Blomstrom, M. (Eds), Does Foreign Investment PromoteDevelopment? Washington: Peterson Institute Press.

Chamberlain, G., 1982. Multivariate regression models for panel data. Journal of Econo-metrics 18, 5-46.

Chamberlain, G., 1984. Panel data. In: Griliches, Z., Intriligator, M. (Eds), Handbook ofEconometrics Vol. II, Amsterdam: North-Holland, 1247-1318.

Chaudhuri, P., 1991. Global nonparametric estimation of conditional quantile functions andtheir derivatives. Journal of Multivariate Analysis 39, 246-269.

Chernozhukov, V., Hansen, C., 2006. Instrumental quantile regression inference for struc-tural and treatment effect models. Journal of Econometrics 132, 491-525.

Durham, J.B., 2004. Absorptive capacity and the effects of foreign direct investment andequity foreign portfolio investment on economic growth. European Economic Review48, 285-306.

De Mello, R., 1999. FDI-led growth: Evidence from time series and panel data. OxfordEconomic Papers 51, 133-151.

De Gooijer, J.G., Zerom, D., 2003. On additive conditional quantiles with high-dimensionalcovariates. Journal of the American Statistical Association 98, 135-146.

De Gregorio, J., 1992. Economic growth in Latin America. Journal of Development Eco-nomics 39, 58-84.

43

Durlauf, S.N., Kourtellos, A., Tan, C., 2008. Are any growth theories robust? EconomicJournal 118, 329-346.

Fan, J., Yao, Q., Cai, Z., 2003. Adaptive varying-coefficient linear models. Journal of theRoyal Statistical Society, Series B 65, 57-80.

Fan, J., Yao, Q., Tong, H., 1996. Estimation of conditional densities and sensitivity measuresin nonlinear dynamical systems. Biometrica 83, 189-206.

Galvao, A., 2011. Quantile regression for dynamic panel data with fixed effects. Journal ofEconometrics 164, 142-157.

Gamper-Rabindran, S., Khan, S., Timmins, C., 2008. The impact of piped water provisionon infant mortality in Brazil: a quantile panel data approach. NBER Working PaperNo. 14365.

Ghosh, M., Wang, W., 2009. Does FDI accelerate economic growth? The OECD experiencebased on panel data estimation for the period 1980-2004. Global Economy Journal 9,1-21.

Grog, H., Strobl, E., 2001. Multinational companies and productivity spillovers: an meta-analysis. Economic Journal 111, 723-739.

Haddad, M., Harrison, A., 1993. Are there positive spillovers from direct foreign investment?Evidence from panel data for Morocco. Journal of Development Economics 42, 51-74.

Hansen, B., 2000. Sample Splitting and Threshold Estimation. Econometrica 68, 575-603.

He, X., Liang, H., 2000. Quantile regression estimates for a class of linear and partiallylinear errors-in-variable models. Statistica Sinica 10, 129-140.

He, X., Ng, P., 1999. Quantile splines with several covariates. Journal of Statistical Planningand Inference 75, 343-352.

He, X., Ng, P., Portnoy, S., 1998. Bivariate quantile smoothing splines. Journal of the RoyalStatistical Society, Series B 60, 537-550.

He, X., Portnoy, S., 2000. Some asymptotic results on bivariate quantile splines. Journal ofStatistical Planning and Inference 91, 341-349.

Henderson, D., Papageorgiou, C., Parmeter, C., 2011. Growth Empirics without ParametersThe Economic Journal 122, 125-154.

Hermes, N., Lensink, R., 2003. Foreign direct investment, financial development and eco-nomic growth. Journal of Development Studies 40, 142-163.

Honda, T., 2004. Quantile regression in varying coefficient models. Journal of StatisticalPlanning and Inference 121, 113-125.

Horowitz, J., Lee, S., 2005. Nonparametric estimation of an additive quantile regressionmodel. Journal of the American Statistical Association 100, 1238-1249.

44

Koenker, R., 2004. Quantile regression for longitudinal data. Journal of Multivariate Anal-ysis 91, 74-89.

Koenker, R., 2005. Quantile Regression. Cambridge University Press, New York.

Koenker R., Bassett G., 1978. Regression quantiles. Econometrica 46, 33-50.

Koenker R., Bassett G., 1982. Robust tests for heteroscedasticity baeds on regression quan-tiles. Econometrica 50, 43-61.

Komunjer I., 2005. Quasi-maximum likelihood estimation for conditional quantiles. Journalof Econometrics 128, 137-164.

Kottaridi, C., Stengos, T., 2010. Foreign direct investment, human capital and non-linearities in economic growth. Journal of Macroeconomics 32, 858-871.

Lamarche, C., 2010. Robust penalized quantile regression estimation for panel data. Journalof Econometrics 157, 396-408.

Lee, S., 2003. Efficient semiparametric estimation of partially linear quantile regressionmodel. Econometric Theory 19, 1-31.

Lipsey, R. 2003. Foreign direct investment, growth, and competitiveness in developingcountries. In: Cornelius, P. (Eds.), The Global Competitiveness Report, 2002-2003.New York: Oxford University Press.

Luiz, R., De Mello, J., 1999. Foreign direct investment-led growth: Evidence from timeseries and panel data. Oxford Economic Papers 51, 133-151.

Maasoumi, E.J., Racine, J.S., Stengos, T., 2007. Growth and convergence: A profile ofdistribution dynamics and mobility. Journal of Econometrics 136, 483-508.

Nunnenkamp, P., 2004. To what extent can foreign direct investment help achieve interna-tional development goals? The World Economy, 27, 657C677.

Powell, J., Stock, J., Stoker, T., 1989. Semiparametric estimation of index coefficients.Econometrica57, 1403-1430.

Robinson, P.M., 1988. Root-N -consistent semiparametric regression. Econometrica 56,931-954.

Rosen, A., 2012. Set Identification via quantile restrictions in short panels, Journal ofEconometrics 166, 127-137.

Speckman, P., 1988. Kernel smoothing partial linear models. The Journal of RoyalStatistical Society, Series B 50, 413-426.

Yu, K., Lu, Z., 2004. Local linear additive quantile regression. Scandinavian Journal ofStatistics 31, 333-346.

45

Documents

Department of Economics Zongwu Cai University of …...Seminar Series Zongwu Cai University of Kansas “A New Semiparametric Quantile Panel Data Model: Theory and Applications ”