STATE DEPENDENCE IN MARKOV LABOR FORCE MODELS by Steven … · 1 . Introduction The use of Markov models in economics and other social sciences is well established. Such models have

STATE DEPENDENCE IN MARKOV LABOR FORCE MODELS

by Steven Tynan

offered as partial fulfillment

of the requirements

for a Bachelor of Arts degree

in MMSS

1984

1 . Introduction

The use of Markov models in economics and other social sciences is

well established. Such models have been used for some time as models

of labor force behavior. Recent work in developing models based on an

assumed Poisson-type job offer generating function has been done by

Flinn and Heckman (1982a,b). Recently, Burdett, et al. (1981), Plinn

and Heckman (1983), Tuma and Robins (1980), and many others have used

Markov models to analyze specific features of the labor market.

The independence of the probability of exiting one's current labor

state with respect to both work history and time already spent in the

state is associated with the Poisson assumption of Markov models.

These assumptions contradict recent theories about the job search

process. For example, theory suggests that a worker's past is very

important in determining his ability to get a job. Theory also

suggests that the length of a worker's current labor force spell

affects his probability of remaining there.

Heckman and Borjas (1980) examine the possibility of allowing three

types of non-Markov state dependence to enter into models that are

similar in structure to the Markov models. The Heckman and Borjas

paper presents and adapts a two-state model with Markov origins. The

empirical work done at the end of the paper is called "tentative" by

the authors. It is based on a study of 122 young men and serves

primarily as an illustration of techniques rather than a test of the

validity of state dependence.

Originally this paper was intended to review the work done in this

area and related concepts, such as heterogeneity, and then to retest

for state dependence in labor force state participation using

Denver/Seattle income maintenance experiments (DIME/SIME) data. This

data source shows shows labor force state participation histories for

about 4800 families and has been used by Tuma and Robins (1980),

Burdett, et al. (1981), and Weiner (1982), among others.

Unfortunately, computer problems prevented the use of the DIME/SIME

data, so this paper is largely a review of the work done in the area

of state dependence.

The model presented in Heckman and Borjas (1980) uses two states,

employment and unemployment, thereby grouping the states of

unemployment and out of the labor force into a single non-employment

state. Plinn and Heckman (1983) have shown that there is reason to

believe that the two states are significantly different from each

other. This paper, therefore, utilizes a three-state model with

features that appear in the model in Burdett, et al. (1981), while

maintaining the important features of the Heckman and Borjas model.

Those features and some others are discussed below. It is

important to understand the underlying assumptions leading to the

Poisson-like feature in Markov economic models, so this paper outlines

the development of a single state job offer generating model, through

a two-state model, to a three-state model of labor force state choice

based on stationary and stochastic characteristics.

Current labor theories are a fundamental source of controversy

about the applicability of Markov models of labor. A section after

the models makes a quick, but necessary overview of some of these

theories. The next section defines and discusses four types of state

dependence, including three not allowed in Markov models, and examines

ways to modify models to accommodate for them.

In the next two sections, the problems of heterogeneity in

variables and censoring, or truncation of data, are described and

discussed. The section after that discusses the nature of the

DIME/SIME data that was to be used. In a final section, problems that

would have been encountered in trying to use the Heckman and Borjas

methods with the DIME/SIME data are discussed.

2. A one-state model of the search

To motivate and facilitate more complicated labor force models

later it will be beneficial to examine a model of job search in which

an unemployed worker is viewed as receiving job offers generated by a

Poisson process. A model of this type is presented in Lippman and

McCall (1976) and summarized in Flinn and Heckman (1982b). We will

use that summarized model as a starting point. The one-state model is

presented here roughly following the structure of Flinn and

Heckman (1982b).

A Poisson process

In this and subsequent models, workers (or agents) are assumed to

maximize income. Job search is examined from the viewpoint of an

unemployed worker who incurs an instantaneous cost c of job search.

The agent is assumed to receive job offers generated by a Poisson

process with parameter a independent of the level of c (c > 0). The

probability that the agent gets a job offer in the marginal time

interval At is afit + o(At), where o(At) has the property

lim o(^t} _—> 0. (2.1)

Itô At

Property (2.1) says that the probability of two or more job offers

in the time interval At is negligible. Consider that the job offer

generating Poisson process has a sample space Q on which there is a

probability measure f. There is a non-negative integer ff+(ir) for each

realization w e Q at any time after the start of the process

(i.e. % > Q) . Here ^ . will represent the number of arrivals of job

offers in the interval [0,t].

As an assumed Poisson process, the arrival process in this model

exhibits two familiar properties. First, the number of arrivals

t+tit ~ "t ^n ^ e interval (t,t + fit] depends only on j\t and not on t.

Second, the number of arrivals during (tjt + 2Js1s] (i.e. Jfft.4-) is

independent of , the number of arrivals during [0,t]. Alternatively

stated, the job search process is assumed to. be a continuous

time-parameter process with stationary and independent increments.

The assumptions of stationarity and independence lie at the heart of

the question of the effects of employment history on future

employment. One justification for the Poisson wage arrival assumption

can be found in Burdett and Mortensen (1978).

Using the terminology and reasoning of Cinlar (1975) we can restate

the probability of a job offer in the marginal increment fit as

follows,

lim .— = a (2.2) Z|t-»0 £t

where f{S^ = 1 } is the probability that one job offer will arrive in

marginal time interval £t and a is the Poisson parameter defined

above.

In these terms, the assumption that no more than one offer arrives

in £ft is stated as,

P{N.t > 2} lim <LL- = Q> ( 2 # 3 )

A W O fit

Proofs for (2.2) and (2.3) and additional characteristics of a

Poisson models can be found in Cinlar (1975)•

The search

Each job offer has an associated wage offer. In this model wage

offers are seen as independent realizations of F(x), a wage

distribution common to all agents. F(x) is a known absolutely

continuous distribution with a finite mean.

Wage offers are only available once, so that a worker can not

choose to accept an offer after he has turned it down. Jobs and

workers last forever and employed workers do not participate in the

job search process.

Letting V be the value to an unemployed worker of continuing his

job search, we can show that V consists of three parts plus a

negligible component o(£lt) .

V = _ c 4 t + i l z ^ l v + a ^ E max[x/r;V] + o(4t) , for V>0, 1+rfit 1+rflt 1+rflt

(2.4) = 0 otherwise.

The three non-negligible components of the value of search equation

are: 1) the discounted cost of search in interval At, 2) the product

of the probability of not receiving a job offer (1 - afct) times the

discounted value of search at end of time interval At, and J>) the

product of the probability of being offered a job (a$t) times the

discounted expected value of the more valuable of two options. Those

options are to accept a job offer with present value x/r or to reject

the offer and continue the search with value V.

The first, second, and third non-negligible components correspond

to the first, second, and third terms, respectively, on the right of

equation (2.4) above. The second value of search equation above (V=0

otherwise) indicates that there can he no negative value for search.

An agent for whom V=0 is defined as out of the labor force. In this

one-state model, stationarity demands that an agent may not re-enter

the labor force once he has left.

Robbins (1970) shows that E(|x|)<oo is a sufficient condition for

the existence of an optimal reservation wage policy. That reservation

wage is represented by rV, which is implicitly determined in the

equation

o

c + rV = (a/r) $ (x-rV) dF(x) , for V>0. (2.5) rV

Equation (2.5) [Lippman and McCall (1976)] is obtained by collecting

the terms in (2.4) and passing to the limit. The agent accepts any

offered wage x>rV, so the probability that a wage offer is refused is

P( rV).

An unemployment spell $ a ¥ m exceed length ta contingent upon two

events. The first prerequisite event is that the agent receives j

offers in time interval tu. This event is assumed to happen with

probability

P[j offers | tu] = (atu);]'e-atu/j! , with a>0. (2.6)

The second prerequisite event is that all of the j offers are turned

down by the agent. The discussion of equation (2.5) showed that this

probability is [P(rV)]J.

With independent arrival times and wage offers, the probability

that unemployment spell fu is longer than tu is

P[Tu>ta] = $ ((atu)J/j!)e-atu'[P(rV)]^ = e-a( 1-E( rV)) V ( 2 > ? )

0

This is the product of the two previous probabilities summed over j.

Equation (2.7) is called the "survivor function" in Elinn

and Heckman (1982b). The density of t is clearly

f(tu) = a(l-P(rV))ea^1-F^rV^ta.

To find the density of accepted wage offers (ie. transfers out of

unemployment) it is important to note that this density is the density

of wage offers given that those offers are greater than rV. The

associated random variable is the truncated wage offer random variable

with rV as the lowest point. This density is therefore

f(x|x>rV) = -1^1— , x>rV. (2.8) 1-P(rV)

Because wages offers assumed to be distributed independently of

their arrival times, the product of the densities of spell durations

tj. and of accepted wages x gives the joint density of the two random

variables. That joint density is

g(ta,x) = |a(l-F(rV))expl-a(l-F(rV))ta}} f U )

1-F( rV) (2.9)

= (a-exp{-a(l-F(rV))ta})f(x) , x>rV.

A hazard rate is defined to be the conditional density of exit

times from a state given the amount of time already spent in the

state. The assumptions of this stationary search model generate a

duration model with constant hazard rate

hu(tu) = h(ta)= -d In p[Tu>tu]/dtu = aO-F(rV)). (2.10)

High hazard rates are accompanied by rapid movement from the

unemployment state. Cases of constant hazard rates are very important

within the scope of this paper. Constant hazard rates [ dh( t )/dt =0]

signal the absence of duration dependence.

3. Increasing the number of states

A natural next step given the assumptions and structure of the

Poisson process model of job search above is the presentation of a

Markov model of labor state participation. Models of employment,

unemployment, and labor force non-participation utilizing Markov

assumptions have been widely presented. [see eg. Burdett, et

al. (1981), Plinn and Heckman (1982a,b,1983), Heckman and Borjas

(1980), Tuma and Robins (1980)] Plinn and Heckman (1982b) develop

two-state and multi-state models of sequential job search which serve

as intermediary models below. Those models help to develop the

theoretical link between the Poisson model above and the Markov choice

model to be presented here.

The essential extension made in the two-state model upon the

one-state model above is the expansion of value function (2.4) above

to reflect the agent's two options of employment and unemployment.

The inclusion into the model of the choice between employment and

unemployment is reflected in the addition of the subscripts a or e on

the value variable V. Hence, the value to an agent of a job with a

flow rate of output gn and a termination rate § is defined as V-»(x) .

It is assumed here that one-half of the flow rate of output (x) goes

to the employer while the other half (again x) goes to the worker.

The value to the worker of remaining unemployed and continuing the job

search is now called V ¥u

The value of employment, Ve(x), is the sum of three components plus

negligible component o(&t) . The components are the flow of x over

interval Z\t, the value of a job times the probability of keeping it,

and the value times the probability of unemployment. All three

components are discounted at rate r. The equation for the value of

employment is therefore,

1 Ve(x) = _{x6t + 0-sfit)V (x) + s£tVu} + o(4t). (3-1)

1 +r£t

This becomes V (x)=(x + sVn)/(r+s) when the terms are collected and

At is passed to 0.

The agent is assumed to be indifferent between a job with value

Ve(x*)=Vu and unemployment. Therefore x* is the reservation wage and

has the value

Ve(x*) = (x* + sVa)/(r+s) = Va, (3-2) so

x- = rVu.

The equation for the value of unemployment is similar to value

equation (2.4-) above, with a few changes. With the introduction into

the model of an employment state, equation (2.4) becomes

1+rflt 1+rflt 1+rflt

The three non-negligible terms on the right of (3«3) above again

represent the discounted worth to the agent of 1) the cost of job

search, 2) the value times the probability of continued unemployment,

and 3) the probability of a potential job match times the expected

maximum of two values. Here the options in three are taking a job

with value Ve(x) or remaining unemployed with value Vu.

Note that a no longer represents the Poisson parameter of a job

offer generating function, but is now the rate of arrival of

encounters between agents and potential employers. This rate is

assumed to be a differentiable function of L, the number of agents.

It is assumed that a rises with L, so a=g(L), g(0)=0, and g'(l)>0 for

all L20.

Collecting the terms of equation (3.3) and passing to the limit

yields

rVu+c = a 5 (Ve(x)-Vu)dF(x) rV a °°

r+s rV (x-rVu)dF(x) , for Vu>0. (3-4)

If &s§(i) is n0"t sufficiently large to insure that Yu>0, fie., if

a<c(r+s)/ j xdF(x)] job search does not take place. 0

As in the one-state model, the hazard rate for exit from

unemployment is fca(tu) = a(l-F(rVu). Of course, the hazard rate for

employment, fre(te), is the termination rate s.

The next step from this model to a three-state Markov model of

labor state participation is straight forward. Appendix B of Flinn

and Heckman (1982b) shows the expansion of the one- and two-state

models to include a third state of out of the labor force.

4. 3-state Markov model

Having assumed, for the sake of argument, the underlying one-state

Poisson model of job search, it is straight forward to model labor

force participation state choice as a Markov decisions process. For

the structure of the three-state labor state participation model

below, I am indebted to both a two-state Markov model appearing in

Heckman and Borjas (1980) and a three-state model appearing in

Burdett, et al. (1981) .

Consider an agent who is either employed, unemployed or out of the

labor force at time %. Initially, we will assign fn(t) , where

nteN={e, o, or u} , as the respective probabilities of the agent being

in a given state at time t. As in the models above, a central feature

of this model is the Poisson assumption that the probability of

changing state more than once in a marginal time interval At is

negligible.

Burdett, et al. formalize the idea that underlying the agents'

observed choices of labor state are stationary as well as stochastic

time varying characteristics associated with individual agents. Each

agent is identified by a stationary characteristic vector x (§ X.

[Note that this vector x is not to be confused with the wage offer,

scalar x, in the one-state model. The vector might, however, include

the agents wage or expected wage as a stationary variable, for

example, as ^ in x= (x1 ,x2, . . . ,xk) ] If characteristics are not

stationary as assumed or if differences in these characteristics

between agents are overlooked, the problem of heterogeneity arises.

This problem will be discussed below.

Also associated with each agent at any given time t is vector

y(t) €t T of stochastic time varying characteristics, labeled

"disturbances" by Burdett, et al. The disturbances are paired, many

to one, with one of the time varying state choices. That is, each

y(t) is only one pair (y(t),n(t)), while each of the three n(t)'s

might be in many. X and T are real vector spaces.

Choice of a state at time t is determined by the current values of

x and y(t). Since x(t) is assumed to be stationary, changes in n(t)

result only from changes in y(t). Since this is a Markov model, the

probability of such a change is assumed to be conditional on the

current state and the stationary vector, but it is thought to be

independent of the current y(t). The probability that an agent

transfers from one state to another in interval lt is therefore equal

to the conditional probability that the agent' s disturbance will be in

the subset of disturbances for which the second state is preferred at

t + At, given n(t) and x.

The probability that the disturbance of a worker with vector x in

state n at time t will be in a subset A at t + fit (ie.

P[y(t + /it) = Aj n(t) = n and x]) can be called ^n(A|x,£.t), stressing

the link between state choice and disturbance. Assuming that this

probability divided by At has a limit as 4t—>0, that limit can be

written to show that it is the instantaneous probability of a change

in disturbance times the distribution of new disturbances given a

change.

fVA|x,dt) iim _ = ft ( x ) U (dy;x) (4 .1 ) £t->0 fit

/I

The expected time to the first change is 1/h (x) and P (dy;*) is the

new disturbance distribution.

Given this instantaneous probability of a change to state n

associated with disturbance subset A, it is necessary to restrict A

such that it corresponds with the optimal state choice m(t) € N.

Am(x) = {y€Y|Vm(x,y)=max Vn(x,y)! , mCN, (4-2)

where Vn(x,y) , n£N, is a stationary characteristic contingent utility

indicator.

The event that an agent changes from any optimal state n at t to

any new optimal state m at t + fit requires that his disturbance

changes to an element of A^, given x and n. The probability of this

event is therefore

Pm(^tfx) = P{n(t + £t)=m| n(t) = n and x}

= P{y(t + £t)4Am(x)| n(t) = n and x} (4.3)

= ^ m(Aj x,6t) , m^n.

Dividing this probability by ^t and passing to the limit gives the

state to state transition rates

a ( x ) = l i r a ™ 4 W 0 4 t _ (4.4)

= f\ n^x) J X U y j x )

for all n and m Q N optimal at times t and t + 4t, respectively, where

ri^m. The transition rate is shown as the product of the incremental

probability of changes in y(t) (h n(x)H) times the probability that

the resulting y makes the state m optimal.

The transition rate ^ ( x ) above should be interpreted such that

fj^jC^t^) = a^Cx^t + o(t) where o(t) has property (2.1) from the

Poisson model above. In the context of testing for state dependence,

"the a (x)!s are determined by firm hiring and firing practices

relative to worker characteristics. With the Markov assumption still

intact, the rates depend solely on the states in question.

The probability of finding a worker in a given state m at time

t + £t is the sum over all n of the products of switching from the

original state to m in fit times the probability of being in the

original state at t. For example, the probability of being employed

after the time increment is

Pe(t+4t,x)=(aee(x)^t)Pe(t,x) + (aae(x)^t)Pu(t,x) + (aoe(x)4t)P0(t,x)

+ o(t). (4-5)

Note in equation (4-5) that the probability of remaining in the

same state from time ^ "to $ ± £% is (ann(x)(fit + l) +o(t)). This

property is important when explaining pure Markov state dependence,

the idea that those already in a state are the ones most likely to be

there at a later date if the time increment is short enough.

Continuing on with the example of employment above, we see that the

change in the probability of employment over <Qt is

Pe(t+6t,x)-Pe(t)=(aee(x)£t)Pe(t,x) + (aue(x)/St)Pu(t,x)

+ (aoe(x)/H)P0(t,x)+o(t) . (4-6)

Dividing by $t and passing to the limit gives

dPe(t,x)/dt = aee(x)Pe(t,x) + aue(x)Pu(t,x) + aoe(x)PQ( t ,x) . (4-7)

In general, the probability of being in state n after interval$t and

the first derivative of the probability are

Pn(t+fct,x) = (ann(x)At + l)Pn(t,x) + (amn(x)5t)Pm(t,x)

+ (aln(x)4t)P1(t,x) + o(t) (4.8) and

P;(t,x) = ann(x)pn(t,x) + amn(x)Pm(t,x) + aln( x)Px( t ,x) (4-9)

for distinct 1, m, and n in N.

Since Pe(t,x) + Pu(t,x) + P0(t,x) = 1, Pg(t,x) + P^t.x) +

P (t,x) = 0. In other words, because the entire population is

accounted for by the three states, any change in the probability of

inhabiting one state is made up for by an equal and opposite change

divided among the other two probabilities. Therefore (temporarily

ignoring x)

(aee+aeu+aeo)pe^t) + aue+auu+auo)pu^t)+(aoe+aou+aoo)po(t) = 0- (4-10)

Prom the discussion above, it can be seen that for non-degenerate

states, iLnn(x)<0 and anm(x)>0, n^m. Therefore, ann(x) + a ^ x ) +

an-j_(x) = 0 for all possible combinations of distinct n, m, and 1 in N.

The hazard rate associated with any state is the conditional density

of exit times given the amount of time already spent in the state. In

this Markov model the hazard rate associated with any state is the

constant escape rate,

hn(T'|x) = an(x) = ? anm(x) , n,m£N. (4-11)

The resulting duration in n is tn, a random variable with the

conditional distribution function

P{tn<Tn|x} = 1 - exp{-Tnan(x)}. (4-12)

The unconditional probability distribution at t is the solution to the

differential equation system given in (4.9).

5• Theories of state dependence

Fundamental to the notion of state dependence in labor market

models is an economic theory generally known as scar theory. In order

to discuss state dependence below, it is essential to quickly focus

some general ideas about scar theory. Ellwood (1982) and Heckman and

Borjas (1980) present some very general and short ideas about theories

that suggest state dependence. The ideas behind the theories,

nevertheless, are very important.

Ellwood's discussion of scar theory is centered on the loss of work

experience during teenage unemployment. The ideas are easily related,

however, to general questions of work experience.

Human capital theory assumes that workers, especially early in

their careers, acquire experience or education. This acquisition is

viewed as an investment in "human" capital. This early investment

leads to concave aggregate age-earnings profiles. Loss of early

employment opportunities shifts back human capital investment and

delays increased earnings. These delayed earnings must be discounted

and total lifetime earninge might be truncated by a short career.

Unemployment later in life also involves some loss of investment.

The greatest amount of investment is undertaken by the young, however,

and later unemployment is not assumed to he as costly.

Dual lahor market theory also suggests some degree of state

dependence. Its theorists hypothesize that the main signals of

ability available to employers are education and work record. Agents

who are not in school and who show long or frequent spells out of a

job are assumed to have poor labor force attachment. These people are

poor risks.

The mirror image of scar theory which suggests the presence of

state dependence in employment is brought about by job- or

firm-specific human capital investment. Over their tenures at a

certain job, workers are assumed to develop skills which are valuable

to their employers. This suggests that the longer an agent is

employed, the more likely he is to remain in the same job. Unemployed

workers who have experience are also more attractive to employers with

similar jobs available. This suggests that their chances of being

hired are dependent upon previous occurrences of hiring.

6. Four types of state dependence

The models above exhibit two familiar key features that should be

stressed again. First, the length of time already spent in a given

spell has no effect on the rate of exit from that spell. This feature

is due to the exponential parameterization of the model and is, in

fact, unique to exponential models. Second, the labor market history

of an individual is inconsequential in determining his rates of exit

from spells. Effectively, these features preclude any structural

dependence on the state occupied, except that the model allows an

agent to remain in a state more easily than he can switch states in a

small interval.

Heckman and Borjas (1980) discuss and label four types of

structural state dependence that might exist in sequential labor state

participation models. The four types are pure Markov state

dependence, which is allowed in the Markov model above, and duration,

occurrence, and lagged-duration dependence, which are not. This

section will discuss these four types and some alterations which

Heckman and Borjas suggest to accommodate for the final three types.

Pure Markov dependence

Section 4 above has already alluded to the idea of pure Markov

state dependence. Consequently, and because it does not create major

theoretical difficulties, this discussion will be very short.

Continuous-time discrete state Markov models allow the probability

of entering a state n to differ from the probability of remaining in

that state n. Taken to the extreme, this suggests that for very small

fit's, the probability of changing states goes to zero. Prom the

probabilities given above it is clear that

lim a (x)(dt > 0 and lim (a ( x ) 4 t + 1) > 1 (6 .1 ) Awo™ d w o

for all distinct n and m in IT. Markov state dependence also allows

the probability of entering one state, say e, from another, say u, to

differ from the probability of entering first state from a third, say

o.

Duration dependence

The idea of duration dependence captures the notion that the length

of time spent in a state during a given spell will affect transition

from that spell. In the topic of labor state participation, this

encompasses two ideas. First, each unit of time spent in an

employment spell might diminish the chances of being fired. Second,

for either non-employment state, continued lack of a job might affect

transition rates either positively, due to decreasing reservation

wage, or negatively, due to decreasing attractiveness of the worker to

employers.

The exclusion of duration dependence is a result of the exponential

functional form assigned to exit time distributions in the model

above. Exponential distributions uniquely ignore time spent in the

spell in determining exit times. The model can be adjusted to allow

or to test for duration dependence by replacing equation (4-12) above

with some conditional distribution function f (T|X) where T is the

time in the state before termination. Pn(T|x) has conditional density

fn(T|x), is not affected by previous spells, and cannot be

exponential.

Recalling the concept of a hazard rate function, we see that the

hazard rate is

fn(T|x) h (T|x) = — (6.2)

1-Fn(T|x)

With these hazard functions for states, it is possible to derive the

distribution functions and densities which are

"T

Fn(Tlx) = 1-exp{- Jh (u|x)du} (6.3) and

fn(T|x) = hn(T|x)'exp-{ hn(u|x)du} , n£ N. (6.4)

c

If 4hn(Tlx)/dT>0, rf=N, the hazard rate increases with T, so we say

positive duration dependence exists. If the derivative is negative

(or 0), negative (or no) duration dependence exists.

Occurrence dependence

The idea behind what Heckman and Borjas call occurrence dependence

is that each spell spent in a given state by an agent and the total

number of those spells might affect the agent' s future probability of

re-entering the state once he has left it. The theoretical basis for

such state dependence in labor markets is found in scar theory as

discussed above and in the concept of human capital.

Since allowing occurrence without duration dependence removes only

the restriction against work history affecting exit times, time

already spent in the current spell is still assumed not to affect exit

times. Exponential distribution functions can still be used when

allowing for occurrence dependence alone.

To modify the model above to allow occurrence dependence, it is

necessary only to distinguish between the escape rates a^(x) for each

spell based upon the number h of previous spells. This can be

accomplished in either of two ways depending upon the underlying

assumptions. If only spells in state a are thought to affect a (x),

the hazard rate for an agent with h previous spells of state m can be

designated &!}(x). If previous spells in any state in N can affect the

&n(x)'s, designate the hazard rates as a^1*I(x) , where h, i, and j are

the number of spells in 1, m, and n. For simplicity we will assume

the first case. If i£(x) a** (x) for h^h1 , occurrence dependence is

said to exist. If ar(x) > a**~'(x), for all h, increased incidence of

a state decreases the expected waiting time of returning to the state.

Lagged duration dependence

If the concepts of occurrence dependence and duration dependence

are combined, they suggest that the total time previously spent in

certain spells affects the current probability of exiting a state.

This is called lagged duration dependence. To introduce lagged

duration dependence into the model it is necessary to restate the

hazard function as a function of past and present state durations. If

there is no current state duration dependence, the derivative for time

spent in the present spell is zero. The lagged duration hazard rate

is

hn(Tjjij|x) = gn(T^,...,T

1e,Ti,...,T

1a,TJ,...,T

10!x), n£N (6.5)

If duration, occurrence, and lagged duration dependence are allowed in

model, the hazard function above can be written as

h h i j ( T h i j J x ) . g h i j ( . } > n 4 N ( 6 . 6 )

7- Heterogeneity

The problem of heterogeneity has been widely discussed in the

literature cited for this paper. [see Ellwood (1982), Plinn and

Heckman (1982a,b), Heckman and Borjas (1980), Salant (1977)] Broadly

interpreted, the term "heterogeneity" refers to any differences in

observed or unobserved variables, either among members of a population

or for an individual over time. These differences in variables can be

further classified as "pure heterogeneity" or "state dependent

heterogeneity" as is done in Heckman and Borjas (1980). In its more

precise usage, the term "heterogeneity" is commonly used to refer to

pure heterogeneity in unobserved components.

Pure vs. State Dependent Heterogeneity

The major problem due to heterogeneity in a sample population is

caused by the inability to always distinguish between pure

heterogeneity and state dependent heterogeneity. The essential

distinction between the definitions of these two terms lies in the

cause of the changes in observed or unobserved variables.

Differences in variables are termed as "pure heterogeneity" if

changes in those variables are not determined by the outcomes of

participation in any specific state of the labor pool (ie.

employment, unemployment or out of the labor force). While purely

heterogeneous characteristics may change over time, their changes are

not a function of state participation. As exogenous variables to the

state selection process, these components affect the process, but they

are not affected by it.

Pure heterogeneity, as defined in the previous paragraph, creates a

problem that has been called "the Mover-Stayer problem". Basically,

the problem is that pre-existing differences among agents might

falsely indicate that a worker's participation in a state affects his

escape rate from that state. These differences must therefore be

controlled for. A major conceptual problem arises in trying to

control for this heterogeneity. Salant (1977) discusses this problem.

Theories of job search predict a constant or decreasing reservation

wage for individual job seekers. The escape rates for these agents

should therefore remain constant or rise. This seemingly plausible

assumption seems, however to contradict empirical evidence of a

declining aggregate escape rate.

The apparent contradiction above can be explained by assuming the

presence of pure heterogeneity. In fact, differences among agents in

the determinants of a reservation wage are necessary to prevent the

distribution of wages from collapsing to a single wage.

The assumption of pure heterogeneity among workers explains the

contradictions discussed above. Search theory predicts a constant or

rising escape rate among individuals, but pure heterogeneity allows

groups of individuals to have different constant rates. In any state

there will be agents with high escape rates and agents with low escape

rates. Those with high rates are termed "Movers" while those with low

rates are termed "Stayers". As time passes, the Movers will, on

average, leave more quickly while Stayers will generally remain in the

state. Thus, while each individual might have a constant escape rate,

the aggregate escape rate falls over time. Failure to account for

pure heterogeneity in unobserved variables across agents will lead to

an over-estimation of duration dependence.

Heterogeneity in unobserved variables that is caused by or affected

by the state selection process (ie. state dependent heterogeneity)

can cause problems in the detection of duration dependence as well.

Spells in a given state may affect different agents differently. For

example, unemployment might inflict different costs on one worker than

it does on another. In such a case, it is difficult to measure

comparative losses. Such changes in variables, however, are affected

by the state selection process, and in turn, affect it. Unlike

variables exhibiting pure heterogeneity, variables with state

dependent heterogeneity have joint distributions which are partially

determined by the history of the process.

Even if there is no state dependence in observed variables, it is

possible to find it in unobserved variables. In any event, failure to

correctly control for heterogeneity in dynamic models can seriously

bias the estimation of state dependence.

Functional forms of job search model parameters

The theoretical solution to the problem of heterogeneity in the

estimation of state dependence in dynamic job search models involves

assigning functional forms to the parameters of the model. In

principle, the parameters of the search model are functions of

observed or unobserved variables. Parameters associated with

observables can be estimated. Unobservables must be assigned

distributions with parameters that can be estimated and then

integrated out.

In practice, the problem of functional form selection arises when

trying to use the method outlined in the previous paragraph. Economic

theory commonly does not suggest functional forms that might be

applicable to a given parameter. Aside from certain qualitative

guidelines, the selection of functional forms for unobservables seems

to be quite arbitrary.

It is obvious that heterogeneity and the associated problem of

functional form selection are important problems in the study of state

dependence. In fact, Plinn and Heckman (1982b) report that an

unpublished paper by Heckman and Singer suggests that estimates of

parameters for unobservables are very sensitive to functional form

choice. The Heckman and Singer paper does, however, suggest a

non-parametric estimator of unobservable variable distribution

functions to partially diminish the arbitrariness of functional form

selection. Flinn and Heckman (1982b) demonstrate this method of

controlling for heterogeneity, first in observables and then in

unobservables, for the one-state search model above. Heckman and

Borjas (1980) use a simpler method of parameterizing the hazard

functions. Their analysis deals with log-normal and Weibull

distributions and suggests guidelines for more general

parameterizations.

For their analysis, Heckman and Borjas assumed that the explanatory

variables in vectors x and y are spell-specific, remaining constant

throughout a spell. This assumption helps make the DIME/SIME data

especially well suited to the Heckman and Borjas methods at least from

the standpoint of observahles. As Weiner (1980) points out, each

DIME/SIME spell has an associated vector of variables measured at the

beginning of the spell. The assumption might seem artificial, but for

many variables such as education or age, initial values are most

important. Especially for employment spells, most variables are most

important at the beginning of the spell for making hiring decisions.

The procedure for controlling for heterogeneity

Following the techniques used by Heckman and Borjas, each

stochastic state-determining vector y(t) explained above is separated

into an exogenous explanatory vector z(t) and an unmeasured

explanatory vector v(t). These vectors actually depend on the states

of origin ,n, and destination, m, and might depend on some or all

previous spells. Their true representations should therefore be

zniJ(Te'---'Tl'Tu'*--'Tu'To'---'To) a n d vnmJ(,)' b u t f o r s i mP l i c i ty we

will refer to them as z and v, with only those labels that are needed

for clarity. Stationary characteristics are still contained in vector

x, but extreme care must be taken in classifying a characteristic's

stationarity.

One example in Heckman and Borjas uses these vectors to

parameterize the hazard function assuming a Weibull distribution.

With this assumption the hazard is distributed as

hn(Tn|x) = a n y n ( T n ) ^ -1 - (7.1)

With this distribution,Y n>1 signifies positive duration dependence,

V n<1 signifies negative, and "7^=1 signifies the lack of duration

dependence with an exponential distribution as in Markov models.

Introducing the possibilities of occurrence and lagged-duration

dependence, the hazard is distributed as

^./JkllogT^+i 4 n(k)logii-kl. (7.2)

The < / (k), p&N, are parameters which equal 0 if waiting time

distributions for the current spell are unaffected by visits to state

P-

To parameterize the Weibull hazard rate to reflect heterogeneity in

z and v, write

ahi^ = exp-{BhiJzhiJ + vhiJ) (7-3) n r 'rn n n ; w y'

Where p is a parameter vector with the appropriate dimension. One

extreme treatment of unmeasured pure heterogeneity sets v as a vector

of constants for all combinations of spell counts hij and for all

states n. The other extreme treatment lets the v's be mutually

independent for each work history and in any current state. A

treatment that is more intuitively appealing sets vj| = crj1*' 'f where

c is a spell-specific factor loading coefficient vector and y is &

person-specific vector of constants.

Heckman and Borjas also show that in general, characteristics can

be allowed to change within a spell while still remaining stationary

for small, discrete time periods. To allow this non-stationarity

within spells, write the hazard for a spell starting at calendar time

X [h^CTJjiJ)] as

h S i J C T S- z S ( T S + ' t : } - T S " 1 - • • • » T i » T i » • • • » T l » T i i i . v S 1 J ^ S + " ^ ) • ( 7 - 4 )

The density function for exit times from the hth spell of n, Tz, given

the unobservables at all times within the spell fv •'(a +T )» a<I 1 is

h^^Têxpf- ^ hhij(a)du|. ((7-5)

Breaking the spell down into periods in which characteristics are

constant, we can replace the integral with a finite sum to obtain

likelihoods conditional on unobservables. The unobservables can then

be integrated out.

8. Censoring

Miller (1981) discusses three classes of censored observations. A

censored observation is one which contains only partial information

about the random variable in question. In the context that is used

here, censoring results from a time truncation of the period over

which labor force spells are observed.

Miller's initial classification of censored spells determines

whether we observe only partial information due to a time limit or due

to a limit on the number of observations. Miller classifies these

causes as Type I and Type II, respectively. Miller's third class of

censoring, random censoring, contains the type of spells with which we

are concerned. Spells in this class might be truncated due not only

to a pre-set ending time for the experiment, but also due to agents

leaving the sample before the end of the experiment. In the DIME/SIME

data, some mid-experiment truncation resulted from interview refusals,

expulsion due to fraud, and relocation out of the Denver or Seattle

areas.

The distinction between right and left censoring is also discussed

briefly in Miller's book. Right censored spells are truncated such

that an ending time is not observed. Eor left censored spells, no

starting time is observed and the spell is joined in progress. It is

clear that the DIME and SIME experiments caught some spells already in

progress and left some spells still in progress. When selecting a

span of the experiment from which to obtain observations, however, it

is advisable to pick a period near the end of the experiment. In this

way, a labor market history of at least two or three years can be

associated with each spell. Since the spells in question took place

after the surveys had been conducted for a while, their starting times

can probably be observed. We need to be concerned, then, primarily

with random right censored observations.

Formally, S.. ,Sp, • • • ,S_ are the iid censoring times associated with

spell lengths f, ,T2,.-.,Tn (not necessarily iid). For each spell, we

see either the true or the censored length and we know if the spell is

censored or not. That is, we observe (S1 ,I1 ),...,(Dk,Ik) where

Di = minCT^Si) = T± A S±. (8.1)

I± = l(Tij<Si) = 0 if censored (ie., Ti>Si)

= 1 if not (ie., T^Sj) . (8.2)

With this type of random censoring it becomes crucial when

estimating parameters to make the assumption that §. and S are

independent of each other. For this to be so, attrition must be

random and not a result of the labor state selection process. This

assumption seems reasonable for all three causes of attrition;

expulsion, interview refusal, and relocation. It must be pointed out,

however, that there is a slight possibility that interview refusals

(and a very slight possibility that mobility) is related to the

agent's job status. Even if there is a relationship between state and

attrition, however, it is even less likely that the censoring is

related to the length of the spell.

Maximum likelihood estimation with random censoring

For reasons discussed in the final section of this paper, I feel

that the method of maximum likelihood is the best estimation method

for the problem at hand. Miller explains the use of maximum

likelihood estimators in his second chapter.

Each uncensored spell (ditl) has a likelihood function defined by

the density f(dA). The likelihood for a censored spell (dÔ) is

defined by the survivor function

S(d.) = 1-FUi) = P{T>di}. (8.3)

The likelihood for a spell that might or might not be censored is

therefore

LUî.) = f(d.)i S U . )1 " 1 , (8.4)

so the likelihood for the sample as a whole is

n

J^Kdîi) =Tf f(di)"^S(di), (8.5)

where ' and " are products over uncensored and censored spells,

respectively. Recall that this likelihood function assumes the

independence of true and censored spell lengths.

If J| = (p. , . . . ,p )' is a vector of parameters, maximizing L(jB) over

all J3 is equivalent to solving for p in the likelihood equations

sp.iog Kp) = ^^ . log igCdîi) = i | i o g fe(d i)4-nj1°« V d i ) = 0.

Solving generally requires iteration on a computer. Miller (1981)

gives several methods of solution.

9• DIME/SIME data

A test of state dependence using the methods of Heckman and Borjas

requires individual state duration data. A history of continuous

spells of employment, unemployment, and non-participation must he

acquired or constructed for each individual in the survey. The

National Longitudinal Survey of Young Men (NLS) used by Heckman and

Borjas provides such histories. Unfortunately, the sample size used

by Heckman and Borjas was necessarily very small to avoid

heterogeneity and spell censoring.

The Public Use Files of the Denver and Seattle Income Maintenance

Experiments (DIME/SIME) provide the information needed to construct

labor market histories for a much larger sample of workers. It is

likely that the increased reliability of estimates due to the greater

sample size would far outweigh the added problems brought about by

increased diversity in the sample. The addition of heterogeneity and

censoring must be controlled for in the manners discussed earlier.

It is also important not to over-emphasize the extent of the added

diversity in the new sample. It must be remembered that while the

young men in the NLS sample may seem similar to each other, they are

not totally homogeneous. Introducing and controlling for an element

of heterogeneity might, in fact, lead to more accurate estimates than

assuming that similar workers are homogeneous.

The primary purpose of the DIME and SIME surveys was to monitor

the effects of negative income tax on labor supply, participation in

transfer programs, and marital stability. Eour years of panel data

sets were compiled into the Public Use Piles. In Denver, about 2800

families were surveyed between January, 1971 and December, 1974* In

Seattle, about 2000 families were surveyed from January, 1971 through

December, 1973. Each of the families that remained in the program was

surveyed at regular intervals throughout the test periods.

At the outset of the experiments, participants had to meet a series

of requirements. Weiner (1982) says that these requirements were not

as strictly enforced after the experiment was underway. At the

outset, enrolled families had to have either two heads or one head

with at least one dependent. According to Weiner, this requirement

was later slackened. The heads of these families were required to be

between 18 and 58 and able to work.

Pre-experiment earnings were restricted to below $9000 for

single-head families (based on a family size of four) and to below

$11,000 for double-head families. Both Burdett, et al. (1981) and

Weiner (1982) note that these single year earnings restrictions might

not exclude as many higher-income families as might be expected.

Transitory downward fluctuations in income allowed some families with

normally higher incomes to participate. The exclusion of middle

income families resulting in estimates that show "higher rates of

leaving employment and lower rates of entering employment than the

population at large" noted in Tuma and Robins (1980) might not be as

serious as it seems

For each spell, the length of the spell and a vector of individual

characteristics measured at the beginning of the spell were recorded.

The characteristics were education, race, age, number of children,

marital status, wage, liquid assets, and skill level. Wages were

recorded only for spells of employment, but expected wages can be

estimated for the other two states by regressing against some of the

other variables. Since wages were reported for different years, they

must be deflated. Liquid assets included stocks, cash, checking

accounts, equity in homes or cars, and so on. Skill level was

classified as either skilled or unskilled and was determined by the

worker's anticipated ability to find work in a spot auction market.

Since transition between states undoubtedly depends not only upon

individual characteristics, but also on cyclical forces, an attempt

must be made to correct for any non-stationarity in the environment.

Denver experienced little fluctuation in their economy over the sample

period, so we need to be concerned only with corrections for Seattle.

Weiner (1982) suggests the use of two local cyclical measures: the

Conference Board's monthly (standardized months) help-wanted ad index

and the Employment Security Department of Washington's Seattle/Everett

SMSA unemployment rate.

Attrition from the sample resulted due to interview refusals,

expulsion due to fraud, or relocation. Depending upon the techniques

used to select spells for observation, it might be necessary to

correct for this censoring as well as the censoring of spells that

were still in progress at the end of the experiment.

Finally, since sixty percent of the families in these studies were

given negative income tax treatments, their histories are probably not

representative of society at large. These "test" families should be

excluded. Those families in the control groups still provide a large

sample of labor state histories. Even when the DIME/SIME data is

limited to that from the control families, it provides a larger sample

to test the methods and findings of Heckman and Borjas (1980) than the

NLS data.

10. Problems in matching the data to the methods

Undoubtedly, a number of problems would be encountered while trying

to use the methods of Heckman and Borjas to test for state dependence

in DIME/SIME spells. A number of these problems have already been

discussed in the contexts of previous sections. I am sure that I

haven't even thought of a good many others. Three conceptual problems

that I feel are very important, however, will he discussed here.

Those problems are the selection of the correct variable to measure,

selection of an assumed distribution for unobservables, and selection

of a method of estimation.

Selecting observation variables

The problem of variable selection can take two forms. First, there

is the problem of selecting a sufficient number of appropriate

explanatory variables. A second problem is the question of what

exactly to measure to determine the durations of past or current

spells. These measurements are then used to estimate the parameters

important in determining state dependence.

The first problem is a moot question with respect to the DIME/SIME

data. Explanatory variables have already been observed, so the only

question is whether to use all of them. Obviously, it is wise to use

as many as possible without over-complicating the analysis. The

important question, then, is what to use as an indicator of a person's

work history.

Possible indicators of an agent's work record are the number of

spells, the length of individual spells, or the total length of time

spent in a given state 1, m, or n. The number of previous visits to a

state can be misleading because it fails to say anything about time

spent in the spell. An agent might have very few spells of

unemployment and therefore seem to be able to hold a job, when in

actuality, he has a few very long unemployment spells.

Average lengths of individual spells, on the other hand, fail to

capture the number of transitions. A person with short unemployment

spells might be thought to be employed most of the time, when his

unemployment spells are really so short because he has so many. The

best duration variable to measure, then, is total time in a given

state.

Selecting distributions for unobservables

As has been said above, the problem of assigning distributions to

unobservables for the purpose of integration has very arbitrary

solutions. Nevertheless, a distribution must be assigned. The

solutions listed here are very tentative and undoubtedly will lead to

some error. Some structure is needed, however, and hopefully the

error will be minimized.

There is nothing to indicate that most unobservables will not be

distributed about a mean with decreasing frequency of observation with

increasing distance from the mean in either direction. That is to

say, it seems harmless to assume a distribution that at least looks

somewhat like a Normal distribution. For some values of a and , the

Weibull distribution demonstrates a bell-shaped curve of this type.

Considering the Markov-exponential history of the model (a special

case of the Weibull) it is appealing to think of the variables as

Weibull-distributed.

If evidence should exist that contradicts the assumption of a

Weibull distribution, it is reasonable to assume a Normal or

log-Normal distribution Procedures that can be used under either

assumption are well documented in the literature.

Selection of an estimation method

A final problem area deals with the choice of a method to estimate

parameters given the observations of an experiment like DIME/SIME.

Certain properties of the regression procedure make it unsuitable for

use with the DIME/SIME data. I would use the maximum likelihood

method mentioned above.

Regression procedures have two very serious limitations which are

noted in Heckman and Borjas (1980). First, regressions using censored

observations result in serious bias. Maximum likelihood estimators,

as it has been shown, are well suited for censored data. Second,

regression techniques make it difficult to introduce explanatory-

variables. Since the DIME/SIME data contains a significant number of

censored spells and explanatory variables must be introduced to

decrease the possibility of spurious, heterogeneity dependence,

maximum likelihood methods must be chosen over regression.

11. Conclusion

Heckman and Borjas (1980) develop some very important procedures to

test for state dependence in labor force transitions. They tested

these methods with a very small study of young men's labor force state

participation. Their test found no duration or occurrence dependence

after controlling for heterogeneity. The test was, however, really

just a demonstration of the procedures. A true test remains to be

done on a large sample such as the DIME/SIME data.

REFERENCES

Burdett, Kenneth, et al. A Markov Model of Employment, Unemployment, and Labor Force Participation; Estimates from the DIME Data, The Center Tor Mathematical ~STudies Irf Economics and Management Science, Northwestern University Discussion Paper No. 483, May 1981 .

Burdett, Kenneth and Dale Mortensen. "Labor Supply under Uncertainty," Research in Labor Economics, Vol. 2 (1978), pp. 109-157-

Cinlar, Erhan. Introduction to Stochastic Processes. Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1975-

Plinn, Christopher J. and James J. Heckman. "Models for the Analysis of Labor Force Dynamics," Advances in Econometrics, Vol. 1 (1982a), pp.35-95. See Erratum in Vol. 2.

Plinn, Christopher J. and James J. Heckman. "New Methods for Analyzing Structural Models of Labor Force Dynamics," Journal of Econometrics, Vol. 18, No. 1 (1982b), pp. 115-168.

Plinn, Christopher J. and James J. Heckman. "Are Unemployment and Out of the Labor Force Behaviorally Distinct States?" Journal of Labor Economics, Vol. 1, No. 1 (Jan. 1983), pp. 28-42.

Ellwood, David T. "Teenage Unemployment: Permanent Scars or Temporary Blemishes?" in Richard B. Freeman and David A. Wise, eds., The Youth Labor Market Problem. Chicago: The University of Chicago Press, 1982.

Lippman, S., and J. McCall. "The Economics of Job Search: A Survey, Part I," Economic Inquiry, Vol. 14 (June 1976), pp. 155—189 •

Heckman, James J. and George J. Borjas. "Does Unemployment Cause Future Unemployment? Definitions, Questions and Answers from a Continuous Time Model of Heterogeneity and State Dependence," Economica, Vol. 47, No. 187 (Aug. 1980), pp. 247-283-

Miller, Rupert G. Survival Analysis. New York: Wiley, 1981.

Robbins, H. "Optimal Stopping," American Mathematical Monthly, Vol. 77(1970), pp.333-343-

Salant, Steve. "Search Theory and Duration Data: A Theory of Sorts," Quarterly Journal of Economics, Vol. 91 , No. 1 (Feb. 1977), pp. 39-58.

Tuma, Nancy Brandon and Philip K. Robins. "A Dynamic Model of Employment Behavior: An Application to the Seattle and Denver Income Maintenance Experiments," Econometrica, Vol. 48, No. 4 (May 1980), pp. 1031-1052.

Weiner, Stuart. A Survival Analysis of Adult Black-White Unemployment Rate Differentials, Ph.D. dissertation, Northwestern University, 1982.

ACKNOWLEDGEMENTS

Although a paragraph such as this suggests that this paper is of a higher quality than it actually is, I owe and offer my gratitude to the following people. Any shortcomings of this paper are my own fault as all contrihutions by those listed below were purely positive. Dale Mortensen has served me as seminar co-leader, professor, advisor, reference source, bibliography, library, tutor, and department chairman throughout the year. Michael Dacey also served as seminar co-leader. Prior to and perhaps even more important than that, he attempted to teach my classmates and me the fundamentals of stochastic processes even though he was suffering from very poor health at the time. Erhan Cinlar, who wrote the text, continued our instruction in stochastic processes while Professor Dacey recuperated.

Although I have never met him, I would like to thank James J. Heckman of the University of Chicago. Much of my most important source material was written either by Professor Heckman with an associate or by professors in the basement of Andersen Hall or elsewhere on Northwestern's campus. Most of all, I thank everyone in the program for giving me a second chance after I messed up the first one.

Documents

STATE DEPENDENCE IN MARKOV LABOR FORCE MODELS by Steven … · 1 . Introduction The use of Markov models in economics and other social sciences is well established. Such models have