54
Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP) (WTO) 1

Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Trade Flows and Trade Policy Analysis

October 2013

Dhaka, Bangladesh

Witada Anukoonwattaka Cosimo Beverelli

(ESCAP) (WTO)

1

Page 2: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Selected econometric methodologies and STATA

applications

2

Page 3: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Content

a) Classical regression model • Linear prediction

• Ordinary least squares estimator (OLS)

• OLS in STATA

• Interpretation of coefficients

b) Introduction to panel data analysis: fixed effects • Definition and advantages

• Fixed effects versus random effects models

• Fixed effects (FE) estimator

• Alternative: Brute force OLS (dummy variable estimator)

• FE and dummy variable estimator estimation in STATA

c) Binary dependent variable models i. Cross-section

ii. Panel data

iii. Binary dependent variable models in STATA

3

Page 4: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

a) Classical regression model

• Linear prediction

• Ordinary least squares estimator (OLS)

• OLS in STATA

• Interpretation of coefficients

4

Page 5: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

5

Linear prediction

1. Starting from an economic model and/or an economic intuition, the purpose of regression is to test a theory and/or to estimate a relationship

2. Regression analysis studies the conditional prediction of a dependent (or endogenous) variable 𝑦 given a vector of regressors (or predictors or covariates) 𝒙, 𝐸[𝑦|𝒙]

3. The classical regression model is:

• A stochastic model: 𝑦 = 𝐸 𝑦 𝒙 + 휀, where ε is an error (or disturbance) term

• A parametric model: 𝐸 𝑦 𝑥 = 𝑔(𝒙, 𝛽), where 𝑔(∙) is a specified function and 𝛽 a vector of parameters to be estimated

• A linear model in parameters: 𝑔(∙) is a linear function, so: 𝐸 𝑦 𝒙 = 𝒙′𝛽

Page 6: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

6

Ordinary least squares (OLS) estimator

• With a sample of N observations (𝑖 = 1,… ,𝑁) on 𝑦 and 𝒙, the linear regression model is:

𝑦𝑖 = 𝑥′𝑖𝛽 + 휀𝑖

where 𝑥𝑖 is a 𝐾 × 1 regression vector and 𝛽 is a 𝐾 × 1 parameter vector (the first element of 𝑥𝑖 is a 1 for all 𝑖)

• In matrix notation, this is written as 𝑦 = 𝑋𝛽 + 휀

• OLS estimator of 𝛽 minimizes the sum of squared errors:

휀𝑖2 = 휀′휀 = (𝑦 − 𝑋𝛽)′(𝑦 − 𝑋𝛽)

𝑁

𝑖=1

which (provided that 𝑋 is of full column rank 𝐾) yields:

𝛽 𝑂𝐿𝑆 = (𝑋′𝑋)−1𝑋′𝑦 = 𝑥𝑖𝑥′𝑖𝑖

−1

𝑥𝑖𝑦𝑖𝑖

• This is the best linear predictor of 𝑦 given 𝒙 if a squared loss error function 𝐿 𝑒 = 𝑒2 is used (where 𝑒 ≡ 𝑦 − 𝑦 is the prediction error)

Page 7: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Ordinary least squares (OLS) estimator

7

Page 8: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

8

OLS in STATA

• Stata’s regress command runs a simple OLS regression

• Regress depvar indepvar1 indepvar2 …., options

• Always use the option robust to ensure that the covariance estimator can handle heteroskedasticity of unknown form

• Usually apply the cluster option and specify an appropriate level of clustering to account for correlation within groups

• Rule of thumb: apply cluster to the most aggregated level of variables in the model

• Example: In a model with data by city, state, and country, cluster by country

Page 9: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

9

Interpretation of coefficients

• Economists are generally interested in marginal effects and elasticities

• Consider the model:

𝑦 = 𝛽𝑥 + 휀

• 𝛽 =𝜕𝑦

𝜕𝑥 gives the marginal effect of 𝑥 on 𝑦

• If there is a dummy variable D, the model is:

𝑦 = 𝛽𝑥 + 𝛿𝐷 + 휀

• 𝛿 =𝜕𝑦

𝜕𝐷 gives the difference in 𝑦 between the observations for which

𝐷 = 1 and the observations for which 𝐷 = 0

• Example: if 𝑦 is firm size and 𝐷 = 1 if the firm exports (and zero otherwise), the estimated coefficient on 𝐷 is the difference in size between exporters and non-exporters

Page 10: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

10

Interpretation of coefficients (ct’d)

• Often, the baseline model is not a linear one, but is based on exponential mean:

𝑦 = exp (𝛽𝑥)휀

• This implies a log-linear model of the form: ln y = 𝛽𝑥 + ln (휀)

• 100 ∗ 𝛽 is the semi-elasticity of 𝑦 with respect to 𝑥 (percentage change in 𝑦 following a marginal change in 𝑥)

• If the log-linear model contains a dummy variable: ln y = 𝛽𝑥 + 𝛿𝐷 + ln (휀)

• The percentage change (𝑝) in 𝑦 from switching on the dummy is equal to

exp 𝛿 − 1

• You can do better and estimate 𝑝 = exp [𝛿 −1

2𝑣𝑎𝑟 𝛿 ] − 1, which is

consistent and (almost) unbiased

Page 11: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

11

Interpretation of coefficients (ct’d)

• In many applications, the estimated equation is log-log:

ln 𝑦 = 𝛽 ln 𝑥 + 휀

• 𝛽 is the elasticity of 𝑦 with respect to 𝑥, i.e. percentage change in 𝑦 following a unit percentage increase in 𝑥

• Notice that dummies enter linearly in a log-log model, so their interpretation is the one given in the previous slide

Page 12: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

12

ln 𝑦 = 𝛽0+ 𝛽1𝑥 + 𝑢 Log-level Regression 𝑙𝑛 𝑦 = 𝛽0 + 𝛽1𝑥 + 𝑢

Interpretation of coefficients (ct’d)

Page 13: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

b) Introduction to panel data analysis

• Definition and advantages

• Fixed effects (FE) versus random effects (RE) models

• Fixed effects estimator (within transformation)

• Alternative to estimate FE: Brute force OLS (dummy variables estimator)

• Fixed effects estimation in STATA (within transformation)

• Brute force OLS (dummy variables) estimation in STATA

13

Page 14: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Definition

• Panel data are repeated observations on the same cross section

• Example: a cross-section of 𝑁 firms observed over 𝑇 time periods

14

Firm

Page 15: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

15

Advantages

• There are three advantages of panel data:

1. Increased precision in the estimation

2. Possibility to address omitted variable problems

3. Possibility of learning more about dynamics of individual behavior

• Example: in a cross-section of firms, one may determine that 20% are exporting, but panel data are needed to determine whether the same 20% export each year

Page 16: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Econometric techniques for panel data analysis

16

1. Pooled OLS 𝑦𝑖𝑡 = 𝛼 + 𝛽𝑥𝑖𝑡 + 휀𝑖𝑡

2. Fixed effects (within transformation) 𝑦𝑖𝑡 = 𝛼𝑖 + 𝛽𝑥𝑖𝑡 + 휀𝑖𝑡

3. Random effects

𝑦𝑖𝑡 = 𝛼 + 𝛽𝑥𝑖𝑡 + 휀𝑖𝑡+ 𝑢𝑖

Same intercept

Different intercept for each individual

Variations across individuals are random

Page 17: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

17

Fixed effects (FE) versus random effects (RE) model

• We start from a panel data model with observable variables 𝑥𝑖𝑡 that vary over time 𝑡 and unobserved cross-section effects 𝛼𝑖:

𝑦𝑖𝑡 = 𝛼𝑖 + 𝑥′𝑖𝑡𝛽 + 휀𝑖𝑡 , 𝑖 = 1,… ,𝑁, 𝑡 = 1, … , 𝑇

• The individual-specific effects 𝛼𝑖 are random variables that capture unobserved heterogeneity

• The fixed and random effects model differ in their assumptions regarding the unobserved individual effect 𝛼𝑖

• RE model: 𝛼𝑖 are not correlated with 𝑥𝑖𝑡: Cov(𝑥𝑖𝑡, 𝛼𝑖)=0

• 𝛽 𝑅𝐸 are efficient and consistent if assumption holds; otherwise not consistent as the error term will be correlated with the 𝑥𝑖𝑡

• Estimation by Generalized Least Squares (GLS) (not covered as rarely used)

• FE model: 𝛼𝑖 are correlated with 𝑥𝑖𝑡

• 𝛽 𝐹𝐸 efficient if FE assumption holds; consistent under both FE and RE assumptions

• Typically used in practice

Page 18: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

18

Fixed effects estimator (within transformation)

• Take the model: 𝑦𝑖𝑡 = 𝛼𝑖 + 𝑥′𝑖𝑡𝛽 + 휀𝑖𝑡

• Take the individual average over time:

𝑦 𝑖 = 𝛼𝑖 + 𝑥 ′𝑖𝛽 + 휀 𝑖

• Subtracting the two equations we obtain: 𝑦𝑖𝑡 − 𝑦 𝑖 = (𝑥𝑖𝑡−𝑥 𝑖)′𝛽 + (휀𝑖𝑡−휀 𝑖)

• OLS estimation of this equation gives the within-estimator (also called

fixed effects estimator) 𝛽 𝐹𝐸

• 𝛽 𝐹𝐸 measures the association between individual-specific deviations of regressors from their individual-specific time averages and individual-specific deviations of the dependent variable from its individual-specific time average

Page 19: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

19

Fixed effects estimator (ct’d)

• There are two potential problems for statistical inference: heteroskedasticity and autocorrelation

• Correct statistical inference must be based on panel-robust sandwich standard errors

• Stata command: vce(cluster id) or robust cluster(id), where id is your panel variable

• For instance, if you observe firms over time, your id variable is the firm identifier

• You can also use panel bootstrap standard errors, because under the key assumption that observations are independent over 𝑖, the bootstrap procedure of re-sampling with replacement over 𝑖 is justified

• Stata command: vce(bootstrap, reps(#)) where # is the number of pseudo-samples you want to use

Page 20: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

20

Fixed effects estimator (ct’d)

• Applying the within-transformation seen above, we do not have to worry about the potential correlation between 𝛼𝑖 and 𝑥′𝑖𝑡

• As long as 𝐸 휀𝑖𝑡 𝑥𝑖𝑡 , … , 𝑥𝑖𝑡 = 0 (strict exogeneity) holds, 𝛽 𝐹𝐸 is consistent

• Note: strict exogeneity implies that the error term has zero mean conditional on past, present and future values of the regressors

• In words, fixed effects gives consistent estimates in all cases in which we suspect that individual-specific unobserved variables are correlated with the observed ones (and this is normally the case…)

• The drawback of fixed effect estimation is that it does not allow to identify the coefficients of time-invariant regressors (because if 𝑥𝑖𝑡 = 𝑥𝑖, 𝑥𝑖𝑡 − 𝑥 𝑖 = 0)

• Example: it is not possible to identify the effect of foreign ownership on export values if ownership does not vary over time

Page 21: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

21

Alternatives to the FE estimator: Brute force OLS (Dummy variables estimator)

• Brute force OLS does not apply the within transformation but estimates the individual-specific effects 𝛼𝑖 by including 𝑁 dummy variables in the model.

• The within transformation and the dummy variables approach result in the

same estimated coefficients, i.e. 𝛽 𝐹𝐸

• However, while 𝛽 𝐹𝐸 are consistent with fixed 𝑇 as 𝑁 → ∞ , 𝛼 𝑖 are not estimated consistently in short panels

• The information for the estimation of 𝛼 𝑖 increases as 𝑇 → ∞, but does not increase as 𝑁 → ∞

• In the case of OLS, estimation of 𝛼 𝑖 does not affect the consistency of 𝛽 𝐹𝐸 . However, in many non-linear panel data models such as the probit model, estimation of the incidental parameter 𝛼 𝑖 leads to inconsistent estimates of

𝛽 𝐹𝐸 (incidental parameters problem)

Page 22: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

22

Fixed effects estimation in STATA (within transformation)

• A variety of commands are available for estimating fixed effects regressions

• The most efficient method is the fixed effects regression (within estimation), xtreg

• Stata’s xtreg command is purpose built for panel data regressions

• Use the fe option to specify fixed effects

• Make sure to set the panel dimension before using the xtreg command, using xtset

• For example:

• xtset countries sets up the panel dimension as countries

• xtreg depvar indepvar1 indepvar2 …, fe runs a regression with fixed effects by country

• Hint: xtset cannot work with string variables, so use (e.g.) egen countries = group(country) to convert string categories to numbers

Page 23: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

23

Fixed effects (within) estimation in STATA (ct’d)

• As with regress, always specify the robust option with xtreg

• xtreg, robust will automatically correct for clustering at the level of the panel variable (firms in the previous example)

• Note that xtreg can only include fixed effects in one dimension. For additional dimensions, enter the dummies manually

Page 24: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

24

Brute force OLS (dummy variables) estimation in STATA

• The fixed effects can enter as dummies in a standard regression

• Regress depvar indepvar1 indepvar2 … dum1 dum2 …., options

• Specify dum* to include all dummy variables with the same stem

• Stata automatically excludes one dummy if a constant is retained in the model

• Possibilities to create dummies:

• Quietly tabulate country, gen(c_)

• Will produce c_1, c_2, etc. automatically

• Then regress depvar indepvar1 indepvar2 … c_*, vce(cluster …)

• Or you can use the i.varname command to creates dummies

• xi: regress depvar indepvar1 indepvar2 … i.country, vce(cluster …)

Page 25: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

25 Source: Oscar Torres-Reyna

Page 26: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

26 Source: Oscar Torres-Reyna

Page 27: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

c) Binary dependent variable models

i. Binary dependent variable models in cross-section

ii. Binary dependent variable models with panel data

iii. Binary dependent variable models in STATA

27

Page 28: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

i. Binary dependent variable models in cross-section

• Binary outcome

• Latent variable

• Linear probability model (LPM)

• Probit model

• Logit model

• Marginal effects

• Odds ratio in logit model

• Maximum likelihood (ML) estimation

• Rules of thumb

28

Page 29: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

29

Binary outcome

• In many applications the dependent variable is not continuous but qualitative, discrete or mixed:

• Qualitative: car ownership (Y/N)

• Discrete: education degree (Ph.D., University degree,…, no education)

• Mixed: hours worked per day

• Here we focus on the case of a binary dependent variable

• Example with firm-level data: exporter status (Y/N)

Page 30: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

30

Binary outcome (ct’d)

• Let 𝑦 be a binary dependent variable:

𝑦 = 1 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑝0 𝑤𝑖𝑡ℎ 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 1 − 𝑝

• A regression model is formed by parametrizing the probability 𝑝 to depend on a vector of explanatory variables 𝒙 and a 𝐾 × 1 parameter vector 𝛽

• Commonly, we estimate a conditional probability:

𝑝𝑖 = Pr 𝑦𝑖 = 1 𝒙 = 𝐹(𝒙𝑖′𝛽) (1)

where 𝐹(∙) is a specified function

Page 31: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

31

Intuition for 𝐹(∙): latent variable

• Imagine we wanted to estimate the effect of 𝒙 on a continuous variable 𝑦∗

• The “index function” model we would like to estimate is:

𝑦𝑖∗ = 𝒙𝑖′𝛽 − 휀𝑖

• However, we do not observe 𝑦∗ (latent variable) but only the binary variable 𝑦

𝑦 = 1 𝑖𝑓 𝑦∗ > 00 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Page 32: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

32

Intuition for 𝐹(∙): latent variable (ct’d)

• we need a link function F(𝑦𝑖) that takes an observed dichotomous 𝑦𝑖 and gives us an unobserved continuous 𝑦𝑖

• There are two ways of interpreting 𝑦𝑖∗:

1. Utility interpretation: 𝑦𝑖∗ is the additional utility that individual 𝑖 would

get by choosing 𝑦𝑖 = 1 rather than 𝑦𝑖 = 0

2. Threshold interpretation: 휀𝑖 is a threshold such that if 𝒙𝑖′𝛽 > 휀𝑖, then 𝑦𝑖 = 1

• The parametrization of 𝑝𝑖 is:

𝑝𝑖 = Pr 𝑦 = 1 𝒙 = Pr 𝑦∗ > 0 𝑥 = Pr [ 𝒙′𝛽 − 휀 > 0 𝑥

= Pr 휀 < 𝒙′𝛽 = 𝐹[𝒙′𝛽]

where 𝐹(∙) is the CDF of 휀

Page 33: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

33

Linear probability model (LPM)

• The LPM does not use a CDF, but rather a linear function for 𝐹(∙)

• Therefore, equation (1) becomes:

𝑝𝑖 = Pr 𝑦𝑖 = 1 𝒙 = 𝒙𝑖′𝛽

• The model is estimated by OLS with error term 휀𝑖

• From basic probability theory, it should be the case that 0 ≤ 𝑝𝑖 ≤ 1

• This is not necessarily the case in the LPM, because 𝐹(∙) in not a CDF (which is bounded between 0 and 1)

• Therefore, one could estimate predicted probabilities 𝑝 𝑖 = 𝒙𝑖′𝛽 that are negative or exceed 1

• Moreover, 𝑉 휀𝑖 = 𝒙𝑖′𝛽(1 − 𝒙𝑖′𝛽) depends on 𝒙𝑖

• Therefore, there is heteroskedasticity (standard errors need to be robust)

• However, LPM provides a good guide to which variables are statistically significant

Page 34: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

LPM (in a nutshell)

• Models probability as a linear function of X

- simple to estimate and to interpret

- inference is the same as for multiple regression

(need heteroskedasticity-robust standard errors)

• Disadvantages:

- Does it make sense that the probability should be linear in X?

- Predicted probabilities can be <0 or >1!

• These disadvantages can be solved by using a nonlinear

probability model: probit and logit regression

34

Page 35: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

35

example

Page 36: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

36

Page 37: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

37

Probit Logit

𝑭(∙) is the CDF of the normal distribution

Odds Ratio (OR) of Y

Log OR of Y

Logit function : log[OR(y)] = log[y/(1-y)]

This translates back to y = 𝑒𝒙′𝛽 (1 + 𝑒𝒙

′𝛽)

𝑭(∙) is the CDF of the logistic distribution

𝒚∗

Page 38: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

38

Odds ratio in logit model

• In logit, odds ratios may be useful: giving simple interpretation for binary covariate

• The odds ratio OR ≡ 𝑝/(1 − 𝑝) is the probability that 𝑦 = 1 relative to the probability that 𝑦 = 0

• An odds ratio of 2 indicates, for instance that the probability that 𝑦 = 1 is twice the probability that 𝑦 = 0

• For the logit model:

𝑝 = 𝑒𝒙′𝛽 (1 + 𝑒𝒙

′𝛽)

OR = 𝑝/(1 − 𝑝) = 𝑒𝒙′𝛽

ln 𝑂𝑅 = 𝒙′𝛽

(the log-odds ratio is linear in the regressors)

• 𝛽𝑗 is a semi-elasticity

• If 𝛽𝑗 = 0.1, a one unit increase in regressor 𝑗 increases the odds ratio by a

multiple 0.1 Ex. Consider foreign ownership. If Coefficient is 1.24. OR= exp(1.24 ∗ 1) = 3.46

• Foreign-owned firms are nearly 4 times more likely to export than when compared to local firms.

Interpretation of coefficients (ct’d)

Page 39: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Logit and Probit(ct’d)

39

• The probit model arises if 𝐹(∙) is the CDF of the normal distribution, Φ ∙

• So Φ 𝒙′𝛽 = 𝜙 𝑧 𝑑𝑧𝑥′𝛽

−∞,

where 𝜙 ∙ ≡ Φ′ ∙ is the normal pdf

• The logit model arises if 𝐹(∙) is the CDF of the logistic distribution, Λ(∙)

• So Λ 𝒙′𝛽 =𝑒𝒙

′𝛽

1−𝑒𝒙′𝛽

Source: Hun Myoung Park (2010)

Page 40: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Predicted probabilities from estimated probit and logit models usually are very close

40

Page 41: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

41

Marginal effects

• For the model 𝑝𝑖 = Pr 𝑦𝑖 = 1 𝒙 = 𝐹 𝒙𝑖′𝛽 − 휀𝑖, the interest lies in estimating the marginal effect of the 𝑗’th regressor on 𝑝𝑖:

𝜕𝑝𝑖

𝜕𝑥𝑖𝑗= 𝐹′ 𝒙𝑖′𝛽 𝛽𝑗

• In the LPM model, 𝜕𝑝𝑖

𝜕𝑥𝑖𝑗= 𝛽𝑗

• In the probit model, 𝜕𝑝𝑖

𝜕𝑥𝑖𝑗= 𝜙 𝒙𝑖′𝛽 𝛽𝑗

• In the logit model, 𝜕𝑝𝑖

𝜕𝑥𝑖𝑗= Λ 𝒙′𝛽 [1 − Λ 𝒙𝑖′𝛽 ]𝛽𝑗

Odd Ratio (OR)

Page 42: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

The marginal impact of changing a variable is not constant, but depending on βi and all variables in the equation.

42

Marginal effects (ct’d)

Pr

(d_e

xpo

rt=1

)

lnsales lnsales

Pr

(d_e

xpo

rt=1

)

To know ME of Xi you have to choose values for all Xj. • Typical options are to set all variables to their means or their medians • Another approach is fixing all xj and let xi vary get predicted probability

of all observations --> average the predicted probability

Page 43: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

43

Maximum likelihood (ML) estimation

• Since 𝑦𝑖 is Bernoulli distributed (𝑦𝑖 = 0, 1), the density (pmf) is:

𝑓 𝑦𝑖 𝑥𝑖 = 𝑝𝑖

𝑦𝑖(1 − 𝑝𝑖)1−𝑦𝑖

Where 𝑝𝑖 = 𝐹(𝒙𝑖′𝛽)

• Given independence over 𝑖’s, the log-likelihood is:

ℒ 𝑁 𝛽 = 𝑦𝑖 ln 𝐹 𝒙𝑖′𝛽 + (1 − 𝑦𝑖) ln (1 − 𝐹 𝒙𝑖′𝛽 )

𝑁

𝑖=1

• There is no explicit solution for 𝛽 𝑀𝐿𝐸, but if the log-likelihood is concave (as in probit and logit) the iterative procedure usually converges quickly

• There is no advantage in using the robust sandwich form of the VCV matrix unless 𝐹(∙) is mis-specified

• If there is cluster sampling, standard errors should be clustered

Page 44: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

44

Rules of thumb

• The different models yield different estimates 𝛽

• This is just an artifact of using different formulas for the probabilities

• It is meaningful to compare the marginal effects, not the coefficients

• At any event, the following rules of thumb apply:

𝛽 𝐿𝑜𝑔𝑖𝑡 ≅ 1.6 𝛽 𝑃𝑟𝑜𝑏𝑖𝑡

(or 𝛽 𝐿𝑜𝑔𝑖𝑡 ≅ (𝜋

3) 𝛽 𝑃𝑟𝑜𝑏𝑖𝑡)

• The differences between probit and logit are negligible if the interest lies in the marginal effects averaged over the sample

Page 45: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

ii. Binary dependent variable models with panel data

• Individual-specific effects binary models

• Fixed effects logit

For more detail, see for ex. Sӧderbom (2009)

45

Page 46: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

46

Individual-specific effects binary models

• With panel data (each individual 𝑖 is observed 𝑡 times), the natural extension of the cross-section binary models is:

𝑝𝑖𝑡 = Pr 𝑦𝑖𝑡 = 1 𝑥𝑖𝑡 , 𝛽, 𝛼𝑖 =

𝐹(𝛼𝑖 + 𝒙′𝑖𝑡𝛽) 𝑖𝑛 𝑔𝑒𝑛𝑒𝑟𝑎𝑙

Λ(𝛼𝑖 + 𝒙′𝑖𝑡𝛽) 𝑓𝑜𝑟 𝐿𝑜𝑔𝑖𝑡 𝑚𝑜𝑑𝑒𝑙

Φ(𝛼𝑖 + 𝒙′𝑖𝑡𝛽) 𝑓𝑜𝑟 𝑃𝑟𝑜𝑏𝑖𝑡 𝑚𝑜𝑑𝑒𝑙

• Random effects estimation assumes that 𝛼𝑖~𝑁(0, 𝜎2𝛼)

Page 47: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

47

Individual-specific effects binary models (ct’d)

• Fixed effect estimation is not possible for the probit model because there is an incidental parameters problem

• 𝛽 are common parameters, i.e. they are common to all observations

• 𝛼𝑖 are incidental parameters, their estimation depends on fixed 𝑇 observations

• Estimating 𝛼𝑖 (𝑁 of them) along with 𝛽 leads to inconsistent estimators of both 𝛼𝑖 and 𝛽; the shorter 𝑇 the larger the inconsistency

• Unconditional fixed-effects probit models may be fit with the “probit” command with indicator variables for the panels. However, unconditional fixed-effects estimates are biased probit depvar indepvar1 indpevar2 dum1 dum2 …

• However, fixed effects estimation is possible with logit, using a conditional MLE that uses a conditional density (which describes a subset of the sample, namely individuals that “change state”)

Page 48: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

48

Fixed effects logit

• A conditional ML can be constructed conditioning on 𝑦𝑖𝑡 = 𝑐𝑡 , where 0 < 𝑐 < 𝑇

• The functional form of Λ(∙) allows to eliminate the individual effects and to obtain consistent estimates of 𝛽

• Notice that it is not possible to condition on 𝑦𝑖𝑡 = 0𝑡 or on 𝑦𝑖𝑡 = 𝑇𝑡

• Observations for which 𝑦𝑖𝑡 = 0𝑡 or 𝑦𝑖𝑡 = 𝑇𝑡 are dropped from the likelihood function

• That is, only the individuals that “change state” at least once are included in the likelihood function

Example

• T = 3

• We can condition on 𝑦𝑖𝑡 = 1𝑡 (possible sequences {0,0,1}, {0,1,0} and 1,0,0 or on 𝑦𝑖𝑡 = 2𝑡 (possible sequences {0,1,1}, {1,0,1} and 1,1,0 )

• All individuals with sequences {0,0,0} and {1,1,1} are not considered

Page 49: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

iii. Binary dependent variable models in Stata

• Limited dependent variable models in cross section

• Panel data applications

49

Page 50: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

50

Binary dependent variable models in cross section

• Probit

• Probit depvar indepvar1 indepvar2 …, options

• Logit – two commands providing identical results:

• Logit depvar indepvar1 indepvar2 …, options

• Logistic depvar indepvar1 indepvar2 …, options

• For logit, chose option “or” to show odds ratios instead of coefficients. For logistic, enter “logit” after the estimation to get the coefficients

• Generally speaking, results from the probit and logit models are quite close. Except in special cases, there is no general rule to prefer one over the other

Page 51: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

51

Panel data applications

• Probit and logit can both be estimated with random effects:

• To obtain probit and logit results with random effects by “id”:

• xtset id

• xtprobit depvar indepvar1 indepvar2 …, re

• xtlogit depvar indepvar1 indepvar2 …, re

• Logit models can be consistently estimated with fixed effects, and should be preferred to probit in panel data settings

• To obtain logit results with fixed effects by “id”:

• xtset id

• xtlogit depvar indepvar1 indepvar2 …, fe

• The “conditional logit” (clogit) estimation should be preferred, however, because it allows for clustered-robust standard errors

Page 52: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Logit (odds ratio)

52

OR>1 positive effects OR<1 Negative effects

Ex. Being a foreign-owned firm, the OR increase by 3.5 time (78:22) Ex. 1% higher in labor productivity. The OR decreased by 0.45 times (1-0.55) times (35:65)

(Prob>Chi2)<0.05 the model fits the data very well.

Amount of variance of y explained by x

Equivalent to t-values and two-tail p- values

Page 53: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Logit (marginal effects)

53

The AVERGE predicted probability to export is 41% for a foreign owned firm ,while it is 24% for a local firm.

Page 54: Trade Flows and Trade Policy Analysis Selected econometric... · Trade Flows and Trade Policy Analysis October 2013 Dhaka, Bangladesh Witada Anukoonwattaka Cosimo Beverelli (ESCAP)

Conditional logit (odds ratio)

54

A warning that you may need to check your data, because with some analyses there should be no more than one positive outcome. In this case that isn’t a problem, i.e. there is no reason that firms can’t be in exporting multiple points in time.

17 groups were either all exporting firm or no-exporting firm at all. Fixed-effects models are looking at the determinants of within-subject variability. If there is no variability within a subject, there is nothing to examine.