Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
Pareto Improving Coordination Policies in QueueingSystems: Application to Flow Control in Emergency
Medical Services
Hung Tuan DoSchool of Business Administration, The University of Vermont, Burlington, VT, [email protected]
Masha ShunkoKrannert School of Management, Purdue University, West Lafayette, IN, [email protected]
One of the well known methods to improve performance in a queueing system is implementing coordination
policies that balance the load among servers. However, in decentralized queueing systems where each service
agent can decide whether to participate in the coordination mechanism or not, a sustainable policy has
to not only benefit the system, but also be beneficial to all agents. In particular, agents are willing to
participate in a coordination policy only if the performance of their individual queue is not hindered and if
their revenues are not decreased. As a motivating example, we use the emergency medicine setting, in which
emergency departments (EDs) act as independent agents and overcrowding in the EDs has direct impact on
the quality of service. In such setting, EDs are interested in seeing improvements in performance measures
that address the expected number of patients (or expected census, which is a widely studied and applied
metric in the emergency medicine literature) and the risk of having high census. We focus on reducing the
expected census, the variance of census, the probability of having high census, and the expected census in
the overcrowded state; and propose classes of coordination policies that provide improvement on all of these
measures for all agents. In addition, agents who receive revenue based on the processed load, are interested
in preserving the long-term average load. Hence, our proposed classes of coordination policies guarantee that
the expected arrival rate and hence, the expected revenue, is preserved for each agent in the system. We
include a discussion of the implementation issues and propose a policy that is easy to implement in practice.
Key words : Flow control, Queueing control, Ambulance coordination, Pareto improvement, Stochastic
orders
1. Introduction
Emergency medicine in the United States is in a critical condition: the number of emergency
departments (EDs) in the nation is decreasing while the number of ED visits is increasing, which
results in substantial patient crowding (Eitel and Samuelson 2011). One common approach to
reduce crowding in the EDs is to control the inflow of patients using coordination of ambulance
traffic, with an intent to balance the patient load between the EDs servicing a geographic area.
One of the well known coordination policies in the U.S. is ambulance diversion, which allows
individual EDs to signal to an emergency medical service (EMS) that the ED is crowded and
therefore, the ambulance traffic should be diverted to other EDs in the area. The EMS dispatcher,
in turn, passes these signals to the ambulance crews and advises the latter on better destination
1
2
EDs. Since EMSs in the U.S. are largely governed by state regulations, there exists a large variety
in EMS structures and policies, including different variations of diversion policies implemented
in different states (Gundlach 2010). Not surprisingly, there has also been a lot of debate in the
media and in the emergency medical literature concerning the overall usefulness of ambulance
diversion and different diversion policies. In fact, the state of Massachusetts banned ambulance
diversion as of January 1st, 2009. Using data from Massachusetts before and after the ban, Burke
et al. (2013) show that ambulance diversion did not reduce the median length of stay in the ED
and conclude that there is thus no value in the ambulance diversion. Deo and Gurvich (2011)
provide one explanation for the non-effectiveness of the simple ambulance diversion mechanism: in
the presence of competition, equilibrium behavior of the EDs that attempt to minimize expected
wait time subject to a service level constraint is for all emergency departments to go on diversion
simultaneously, which is not helpful for load balancing.
We conducted interviews with emergency medicine professionals in the City of Pittsburgh and
found a similar aversion to diversion: The City of Pittsburgh has two hospital groups in the area
that operate several EDs each. Currently EDs do not actively use ambulance diversion and do
not coordinate traffic, even between the EDs that belong to the same hospital group; a decade
ago, however, diversion had been a common practice and the city still has an IT system in place
to implement signaling (Guyette 2010). We collected data on ED visits to three neighboring EDs
that belong to the same hospital group for a period of one year. We analyzed the data and found
the evidence of load imbalance between the EDs; in particular, we observe census in each ED at
the beginning of each hour during the year and compare it to the capacity of the ED (including
beds and treatment spaces), we find that in 22% of such observations, only one out of three EDs
was overcrowded; in 17 % - two EDs were overcrowded while the third ED had idle capacity. This
data illustrates that there exist multiple times when one ED is overcrowded while another ED has
idle capacity. During such times, load balancing should be beneficial to the system. Nevertheless,
discussion with the administrative personnel at the involved EDs revealed the following: One of
the reasons why the EDs in the area do not actively engage in diversion is independent decision
making by the involved EDs and EDs’ unwillingness to lose potential patients (Guyette 2010).
This observation motivates an interesting research question, which is novel in queueing control: in
a decentralized queueing system with independent agents that can decide whether to participate in
the coordination system or not, what coordination policy will be sustainable and beneficial to all
agents and the system as whole?
One of the challenges in identifying a good coordination policy for a queueing system is the fact
that there is no single commonly accepted performance measurement. We use the EMS context to
motivate different performance measurements that are based on the stochastic properties of the
3
number of patients in the system. In EMS context, several studies have looked at ED coordination
using ambulance diversion from the perspective of the average wait time reduction (see e.g. Deo
and Gurvich (2011)), which is equivalent to the expected queue length reduction by Little’s Law.
In emergency medicine literature, crowding is commonly defined based on the total number of
patients in the ED (McCarthy 2011), commonly referred to as census, and we adopt expected
census as one of the performance metrics for our study. While this metric is undoubtedly important
from the patients’ and EDs’ perspectives, there are additional metrics that EDs need to consider:
• What is the probability that an ED is overcrowded? In the emergency medicine setting, many
metrics used to evaluate EDs’ performance are based on the ability of EDs to provide care within
a certain period of time, for example, the percentage of acute myocardial infarction patients with
door-to-balloon time less than 90 minutes as imposed by the American College of Cardiology,
and/or the percentage of time that the ED is on diversion (or overcrowded) (Hopp and Lovejoy
2012). We proxy such metrics using the probability of having high census in the ED.
• What is the variability of the number of patients in the system? Basic operations management
theory suggests that process performance may be improved if variability is reduced. Hence, one
goal of a coordination policy could be to reduce the variability of patient arrivals. Using a queueing
model of a supply chain, Do (2012) shows that variability in the order arrival process per se is
not the key, rather, performance of the system depends on correlation or synchronization of the
arrival process and the service process; and, control policies that reduce and smooth out the queue
improve the performance of the queueing system. Hence, we seek coordination policies that reduce
the variance of census in the system.
• What is the expected number of patients in the ED given that the ED is overcrowded? Lower
than usual census is not problematic for EDs, however, high census often leads to negative conse-
quences (Moskop et al. 2009, Sprivulis et al. 2006). This indicates that EDs are also interested in
improving the right-tail behavior of the number of patients in the system: reducing the frequency
and extent of extreme high observations of census. We capture this objective by measuring the
expected census in the ED conditional on being overcrowded.
All of the above mentioned objectives (expected census, probability of having high census, vari-
ance of census, and expected census conditional on being overcrowded) are important to queueing
system participants, and a single or composite measurement may not capture all of the objectives
in a fair way for all contexts. Hence, we consider multiple performance measures that address
the questions raised above and propose a class of coordination policies that guarantees Pareto
improvement on all objectives (or thus, on any polynomial combination of these objectives with
non-negative weights).
4
Moreover, in a decentralized queueing system where the service agents receive revenue based on
the processed load and/or who compete based on market share, the agents’ are unlikely to accept a
coordination system that may decrease their long-term average load. Based on our interviews with
emergency medicine professionals in the City of Pittsburgh, this observation is consistent with their
view of the problem and the EDs are concerned about ensuring that the average arrival stream
of patients does not decrease as a result of a coordination mechanism (Guyette 2010). Hence, in
order to provide Pareto improvement, a good coordination policy should also preserve the average
arrival rate to each participating agent.
Our goal is to propose classes of Pareto improving coordination mechanisms that operate as
follows. Central controller (EMS) designs and announces the coordination policy. The agents (EDs)
have a choice whether to participate in the coordination policy or not; and we assume that if the
policy is Pareto-improving, the agents choose to participate. Participation in the system involves
the following: agents (EDs) reveal truthful information to the controller (EMS) and the con-
troller (EMS) uses this information to make coordinating decisions (in our case, decisions to divert
patients). Practically speaking, EDs may provide access to their census tracking information sys-
tem to the EMS or EMS may employ some monitoring mechanism to ensure truthful information
sharing (e.g. random inspections using independent observers similar to the ones used by the Joint
Commission to evaluate EDs and/or ambulance crews). Notice also that our goal is not to find
the optimal coordination policy (which will differ based on the particular measure combinations
that can be different for different EDs and EMS systems), but rather to propose a class of policies
that provides a structure that guarantees that all players are better off and, hence, the players
can find the optimal policy for their particular system and objectives within our defined class.
A pertinent issue is whether our proposed policy is sustainable; we will show in Section 2.2 that
neither individual EDs nor EMS have an incentive to withdraw from the coordination mechanism
and hence, all EDs participating in the coordination policy is an equilibrium.
Do and Iyer (2013) use flow control policy in a supply chain setting with two retailers facing
independent arrival streams and placing orders to a single manufacturer. In their setting, the
retailers can use price promotions to control their arrival streams and the authors show that given
information on the manufacturer’s queue size, under a simple promotion policy with threshold
structure, the queue at the manufacturer is stochastically smaller with smaller variance that leads
to cost savings for both retailers and the manufacturer. In our paper, we model and analyze a
queueing system (e.g., a network of EDs) with multiple single-server queues that face a shared total
arrival stream with exogenous rate, which means that a change in flow to one queue impacts the
flow to the other queues, and that is controlled by a central coordinator (EMS in our motivating
healthcare example). We propose a class of coordination policies with a threshold structure imposed
5
on conditional expected arrival rates to each server and demonstrate that any policy in the class
reduces and smoothes each queue, and thus guarantees Pareto improvement on multi-dimensional
performance measures for each player in the queueing system. We demonstrate how the commonly
utilized ambulance diversion policy can be modified to satisfy our proposed threshold structure.
Then, we show how a policy in the proposed class can be enhanced to provide an even higher
improvement for each player based on the multi-dimensional performance criteria.
Deo and Gurvich (2011) analyzed a policy with full diversion with a focus on optimizing a single
objective function (expected wait time), they looked at a decentralized setting where diversion
threshold are set by the EDs and the centralized setting where EMS assigns diversion thresholds
to each ED in the network; this centralized full diversion policy is not Pareto improving as one of
the EDs may be worse off in terms of the expected arrival rate and/or in terms of the expected
wait time, and hence, is not sustainable. Moreover, as is clear from Deo and Gurvich analysis, the
decentralized full diversion policy analyzed in their paper has negative externalities: it can trigger
gaming behavior of the EDs, which leads to a poor equilibrium solution. We propose a policy class,
which is Pareto improving on multi-dimensional performance measure (and hence it works for any
individual objective function of EDs and EMS with general and reasonable properties presented in
Section 7.3) and does not incentivize gaming.
Gurvich and Perry (2012) consider a service network (e.g. a call center) operated under an
overflow mechanism where calls are routed to a dedicated service station with a finite buffer. When
the buffer is full, calls are forwarded to an overflow station. The authors provide an approximation
for the overflow processes via limit theorem and prove asymptotic independence between dedicated
stations and overflow station. Although, overflow is relevant to diversion flow, there are fundamental
differences between this model and ours: a) there is no dedicated overflow station that only processes
overflow demand in our model and b) diversions occur if the destination ED is crowded and there
is available capacity elsewhere in the network.
To summarize, we contribute to the queueing control and EMS coordination literature by finding
classes of coordination policies that preserve the long-term average arrival rate and guarantee that
the census in each ED is stochastically smaller, has smaller variance, and smaller expectation
conditional on being in overcrowding state. Using EMS setting as motivating example, we develop
a subclass of policies within our proposed class, which is based on the idea of an existing ambulance
diversion policy. As opposed to the existing ambulance diversion policy, which does not preserve
the average arrival rate and can hurt performance of one of the network participants (an ED), our
subclass guarantees that all participants are better off.
The rest of the paper is organized as following. In Section 2 we introduce our model of the ED
network and formulate the performance measurements. In Section 3 we formally introduce the
6
class of coordination policies and provide an example of a practical policy within the proposed
class. In Section 4 we prove that the proposed class of policies is Pareto improving; we provide
numerical experiments to illustrate our findings and demonstrate the sensitivity of our results
in Section 5. In Section 6, we propose a policy that can be easily implemented in practice and
numerically evaluate the performance of this policy. In Section 7 we discuss different applications
and extensions for our policy class, also provide a technique for improving coordination policies,
and discuss implementation nuances. We conclude in Section 8.
2. Model
We consider a network of N non-symmetric EDs in a region. The region serviced by the EDs
faces a total arrival rate λ. Not all patients can be rerouted from their intended destinations:
e.g. most walk-in patients, patients insisting to go to a specific ED cannot be routed against
their will by the ambulance crews, patients whose condition does not favor rerouting, etc. Hence,
we split the arrival rate into two streams: controllable arrivals λc and uncontrollable arrivals λu,
such that λc + λu = λ. Similarly to Deo and Gurvich (2011), we assume that the arrivals follow
Poisson process and present the model and its analysis with this assumption for ease of exposition.
In emergency medical setting, however, arrivals may be more variable and/or more bursty than
Poisson, for example, due to accidents that result in multiple casualties. Our result is robust to any
memoryless arrival process and we demonstrate in Section 7.5 how our analysis can be extended
to accommodate Markov-Modulated Poisson Process (MMPP) for arrivals.
Without coordination, the arrival rate to each ED k is λok, which consists of two streams: uncon-
trollable stream with the rate λuk , and controllable stream with the rate λck such that λok = λuk +λck.
Let the total rate to the network be∑k
λok = λ which consists of the total uncontrollable stream rate
λu =∑k
λuk and controllable stream rate λc =∑k
λck. A controllable stream may include ambulance
patients, who are willing to change destination ED and/or walk-in patients who are willing to self
re-route upon getting (or observing) information about the ED congestion status. An uncontrol-
lable stream may include patients whose condition (or strong will) requires a certain destination
ED and/or walk-in patients who are not able or not willing to of self re-route. We assume that
λok is an equilibrium solution for the network - i.e. EDs are not able to manipulate λok anymore
without a policy change.
Let P be a Markov and stationary coordination policy that splits the controllable arrival stream
λc among different EDs based on the census at all EDs. Then, under any coordination policy P, the
arrival rate becomes a function of the census. Let ~LP = (LP1 ,LP2 , ...,L
PN) be the random vector of
the number of patients in each of the N EDs under the coordination policy P and ~l= (l1, l2, ..., lN)
be the realization of the number of patients at the N EDs. Then, Λk(~LP) represents the random
7
variable for the arrival rate to the k’th ED under coordination policy P1. Service rate at ED k
equals µk and µ=∑k
µk. We assume that µk >λok ∀ k (which implies that µ> λ), this assumption
guarantees that the queueing system under no coordination is stable.
2.1. Comparing performance of coordination policies.
As mentioned in the introduction, a good coordination policy should perform well with respect to
multiple measures, and should provide incentives to the EDs to participate in the system. First,
we introduce a set of four performance measures on which we will compare coordination policies.
Then, we introduce a constraint, without which the EDs will not be willing to participate in the
coordination system.
1. One of the widely used metrics for measuring the EDs’ and hospitals’ performance is the
average length of stay in the ED (Hopp and Lovejoy 2012). This metric represents the expected
time in system, which is equivalent to the expected number of patients in the ED (using Little’s
Law). Hence, one of the performance metrics for a coordination policy is the expected number of
patients in the ED (or expected census), E[LPk ], and to say that policy P2 performs better than
P1, we require:
E[LP1k ]≥E[LP2k ]. (1)
2. As argued earlier, another important performance comparison measure is probability that the
ED is overcrowded or that the number of patients in ED k is larger than a critical threshold Sk
(e.g. Sk may represent ED’s capacity): P [LPk ≥ Sk]. To say that policy P2 performs better than
policy P1 with respect to the probability of exceeding the critical threshold, we require:
P [LP1k ≥ Sk]≥ P [LP2k ≥ Sk]. (2)
3. Next, we consider the variance of the number of patients in the ED (defining variance of
random variable X as V [X] = E[(X − E[X])2]). Smaller variability of the number of patients in
the ED is attractive for multiple purposes: for example, EDs can reduce their operational costs
because of smoother resource scheduling, and EMS can make better decisions since the crowding
information has less noise.In order to say that policy P2 is preferred to policy P1 with respect to
variance, we require:
V [LP1k ]≥ V [LP2k ] ∀ k. (3)
1 Notice that both the arrival rate Λ and the number of patients in the ED L depend on the policy P; for cleanerexposition, we suppress one superscript and use Λk(~LP) instead of ΛPk (~LP).
8
4. Finally, we capture the patient’s delay risk by quantifying the expected census in the ED
given that the patient finds this ED overcrowded. Let α represent a service level, such that the
ED k is considered to be overcrowded if the number of patients in the ED exceeds Value-at-Risk
V aRα[LPk ], where V aRα[X] is the left-continuous inverse of FX (CDF of random variable X):
V aRα[X] = F−1X (α) =min{x : FX(x)≥ α}. Then, for a given value of α we use the following risk
measure that captures conditional expected census in overcrowded state:
CEoverα [LPk ] =E[LPk |LPk ≥ V aRα[LPk ]].
This measure is of particular importance from ambulances’ perspective because it addresses the
expected number of patients in the ED during the problematic (overcrowded) times - the times
when diversion decisions need to be made; and also from the quality of service perspective because
the risk of adverse care outcomes is higher during overcrowded times. Notice that the conditional
expectation in overcrowded state measure is equivalent to the definition of the Lower (or Tail)
Conditional-Value-at-Risk measure commonly used in financial risk analysis: CV aR−α [X] = [X|X ≥
V aRα[X]] (Rockafellar and Uryasev 2002). This measure has been used as a performance measure
of wait time in healthcare simulation studies, see e.g. (Dehlendorff et al. 2010). In order to say
that policy P2 is preferred to policy P1 with respect to conditional expected census in overcrowded
state, we require:
CEoverα [LP1k ]≥CEover
α [LP2k ] ∀ k. (4)
Inequality 4 is sufficient to provide weak improvement to the EDs and we will use this inequality
in the definition of Pareto improvement. Later, however, we will also show a stronger result that
under our proposed class of policies, the difference between the conditional expected census in
overcrowded state (CEoverα [LP1k ]−CEover
α [LP2k ]) is greater than the difference between the expected
census (E[LP1k ]−E[LP2k ]).
In addition to defining performance metrics, we also impose a condition that guarantees that the
players involved (the EDs) have an incentive to participate in the proposed coordination mecha-
nism. One key factor determining EDs’ market share and revenues is the number of patients served.
Hence, in order to provide an incentive for the EDs to participate in any coordination policy P, the
policy needs to guarantee that the expected arrival rate of patients E[Λk(~LP)] to ED k is at least as
large as the time-average arrival rate to ED k without coordination: E[Λk(~LP)]≥ λok. Notice that
we assume that all patients are similar and bring in the same amount of revenue, our work can be
easily extended to the case with multiple patient classes and we discuss this extension in Section
7.1. While some EDs may have preservation of the long-term arrival rate not as their primary
concern, this condition can also be viewed as preservation of long-term fairness in the system that
9
is important from the system’s viewpoint to sustain participation in the policy in the long-term.
We refer to this constraint as our participation condition (in expected arrival rate). Notice that
since the total arrival rate to the network of all EDs is fixed, the only way to guarantee that the
participation constraint is satisfied for all EDs is to impose the following necessary condition:
Condition 1. Participation Condition. E[Λk(~LP)] = λok for all k.
Participation Condition 1 preserves the expected revenue as long as the revenue is an absolutely
bounded function of expected arrival rate (which is also bounded), which is the case in reality.
However, the difference between census at a specific ED under the proposed policy and census
under no coordination can go to infinity as time goes to infinity: Let NPk (t) be the number of
patients who have entered ED k by time t under policy P. The arrival rate to ED k is then limt→∞
NPk (t)
t
under policy P and limt→∞
Nok (t)
twithout coordination. Participation Condition 1 is then equivalent
to limt→∞
NPk (t)
t− lim
t→∞
Nok (t)
t= lim
t→∞
NPk (t)−Nok (t)
t= 0. It is then possible that NP
k (t) − N ok (t) = ±o(t),
where o(t) goes to infinity slower than t and hence, limt→∞
NPk (t) − NO
k (t) = limt→∞±o(t) = ±∞. If
this is a concern in practice, practitioner can impose the following restriction to avoid this issue:
|NPk (t)−N ok (t)| ≤M <∞ ∀ t, where M could be a large positive number.
All of the objectives introduced above and captured by inequalities 1, 2, 3, and 4 are important
along with the Participation Condition 1. Instead of creating a single composite performance
metric that encompasses these criteria, we focus on finding Pareto improving policies, where Pareto
improvement is defined as follows:
Definition 1. Pareto Dominance. A policy P2 is Pareto improving over policy P1, P2 ≥P P1, if
it satisfies Participation Condition 1 and inequalities 1, 2, 3, and 4, where at least one of these
inequalities is strict.
Later, in Section 7.3, we summarize the conditions under which our results extend to different
composite objective functions.
2.2. Decision making and participation
In our model, the diversion policy is announced by the central controller, EMS, who also makes
diversion decisions. EDs have a choice of whether to participate in the coordination system or not.
Participation in the system involves sharing truthful information with the EMS; namely, the EDs
report their diversion threshold to the EMS (diversion threshold is set once and cannot be changed
once announced) and continuously reveal census information. We further assume that the EDs
participate in the system if the coordination policy is Pareto improving: participation condition
is met and performance of the ED is at least as good on all performance measures and is strictly
better on one of the performance measures introduced above. It is important to note that it is
10
incentive compatible for the EDs to participate in the coordination mechanism and that no ED
has an incentive to withdraw from the coordination system: Assume that out of N EDs, 1 ED
decides to withdraw from the system. Since the expected arrival rate stays the same within the
network of participating EDs (by Participation Condition 1), withdrawing ED will not gain any
advantage in terms of the expected arrival rate and moreover, will be strictly worse off in at least
one of the performance measures. Hence, as long as preservation of the expected arrival rates and
the defined performance measures are important for the EDs, there is no incentive to withdraw
from the system, which implies that our proposed policy is an equilibrium.
Participation in the system in our model also implies that the EDs reveal truthful information
about their congestion to the EMS; for example, through giving the EMS controller access to
their census tracking information system (this can be implemented in practice when, for example,
the central control is performed by the hospital group that owns the individual EDs in the area
and hence, can have access to the internal information system). If the EDs do not provide direct
information access, we assume that they truthfully reveal census status. It is theoretically possible
for the EDs to benefit from misrepresenting their census information in the short-term; however,
we assume that EMS can check and/or monitor the truthful behavior of the EDs and hence,
untruthful behavior will be punished and the ED will be removed from the coordination system,
which is not beneficial to the ED in the long-term. We leave this short-term issue outside of the
scope of our paper and assume truthful revelation of the census information, which is in the best
long-term interest of the ED. Moreover, from the social perspective, the ED that makes the census
information (or other information that can help improve the performance of the system and the
well-being of patients) public may be viewed as socially responsible, which creates an additional
incentive for the EDs to truthfully share information.
2.3. Benchmark policy
If the Participation Condition 1 is not satisfied or the policy does not provide a Pareto-improving
solution, we assume that the EDs are not willing to participate in the diversion mechanism and
share information (lk) and hence, coordination is not possible. This policy is representative of the
situation in the City of Pittsburgh in 2009 (Guyette 2010). We use such a case as a benchmark
and denote it as policy Po. Under policy Po, the arrival rates to each ED, λok are a result of
an equilibrium that ambulances and patients achieve over time without information lk. In the
numerical section, we also compare one of our proposed policies to the current policy of ambulance
diversion, denoted with PFD for Full Diversion, and demonstrate that Full Diversion does not
provide a Pareto improving solution to all players. Full Diversion policy PFD is identical to the
policy analyzed by Deo and Gurvich (2011).
11
Now we define the class of all Pareto-improving coordination policies as:
P , {Pi :P i ≥P Po}. (5)
3. Coordination policies with threshold structure.
In this section we introduce a class of coordination policies for the queueing system controller.
Again, we use EMS setting as a motivating example. A commonly used ambulance diversion policy
has a threshold structure: when an ED reaches a certain state, typically expressed as a threshold
based on the number of patients in the ED, it announces diversion; and the EMS attempts to
route all traffic away from the ED on diversion. Such policy, however, does not satisfy Participation
Condition 1 because the resulting arrival rate to such ED will be altered and some of the EDs in
the network will see a decrease in revenue and in market share. Moreover, this policy may hinder
performance according to one or more performance measures for some of the EDs in the network.
Hence, such policy may be not sustainable in practice and we propose an enhanced coordination
mechanism, which still has threshold structure, but in addition, preserves the expected arrival rate
to each ED and guarantees improvement according to all performance measures.
Next, we define a condition on the expected arrival rate that has the following threshold structure:
For a given threshold mk (selected by the EMS), the condition ensures that the expected arrival
rate to ED k is greater than the time-average (equilibrium) arrival rate until this threshold and
lower than the time-average arrival rate above the threshold.
Condition 2. Threshold Condition. For all Pi ∈PT , there exists a finite threshold mk > 0 such
that: {E[Λk(~L
Pi)|Lk = l]≥ λok, ∀ l ≤ mk and ∃ l≤ mk s.t. E[Λk(~LPi)|Lk = l]>λok
E[Λk(~LPi)|Lk = l]≤ λok, otherwise.
Policy class PT represents coordinating mechanisms that satisfy Participation Condition 1 and
Threshold Condition 2. The class PT contains multiple policies; notice, however, that policy Po 6∈
PT because there is no l such that E[Λk(~LPo)|Lk = l]>λok.
Next, we describe a subclass of polices from class PT : policy subclass PCD, where diversion is
C ontrolled. The policies in this subclass are of special interest to us since they closely resemble
full diversion policy that is been used in practice and that has been analyzed in Deo and Gurvich
(2011) with a single objective to minimize expected waiting time; however, in contrast to the
full diversion policy, at least one of the diversion rates in subclass PCD is bound from above
such that the Participation Condition 1 is satisfied. We will later analyze policies within the
subclass using numerical experiments, which allow us to provide additional insights on setting
policy parameters and lets us quantify and assess potential performance improvements that can
result from implementing a policy in class PT .
12
3.1. Policies with Controlled Diversion (Subclass PCD)
Consider a case with two EDs with finite diversion thresholds S1 and S2 that indicate some capacity
measurement. Assume that the diversion thresholds S1 and S2 are fixed by the EDs and are
truthfully revealed to the decision maker (EMS)2. The arrival rates to EDs 1 and 2 under the
current (equilibrium) scenario without coordination are λo1 and λo2. We propose a sample subclass
of control policies as follows. Let l1 and l2 be the censuses (observable by EMS) of ED 1 and ED
2, respectively. The diversion rates d12 ∈ [0, λc1] and d21 ∈ [0, λc2] are used as outlined in Table 1.
Since diversion rates can take multiple values, there are multiple policies in this class (we denote
a sample diversion policy from this class with P(d12,d21)D ), each policy is Markov and stationary,
P(0,0)D is equivalent to Po. We let PCD represent all policies that follow routing rules specified in
Table 1 and that have (d12, d21) controlled such that Participation Condition 1 is satisfied.
This class of policies resembles the ambulance diversion policy that is widely used in practice and
is analyzed in Deo and Gurvich (2011); in their paper EDs announce diversion when they reach the
threshold Sk and the EMS tries to divert all controllable traffic from these EDs, hence, we refer to
this policy as full diversion: PFD ≡P(λc1,λ
c2)
D . The difference in our policy is that the diversion rate
is controlled by the EMS: instead of diverting all traffic from ED 1 to ED 2, only d12 is diverted;
with this control, the policy ensures that each ED receives the same number of patients on average
as in the non-coordinated equilibrium state and hence, EDs maintain their revenue and market
share. We note that in some EMS systems, there are attempts to control the diversion rates: e.g. in
several counties in California, there are strict regulations on how long the EDs are allowed to stay
on diversion (California Healthcare Foundation 2009); such policy effectively adjusts the diversion
rate.
ED 2
ED 1
l2 <S2 l2 ≥ S2
l1 <S1Rate to ED1: λo1 Rate to ED1: λo1 + d21
Rate to ED2: λo2 Rate to ED2: λo2− d21
l1 ≥ S1Rate to ED1: λo1− d12 Rate to ED1: λo1Rate to ED2: λo2 + d12 Rate to ED2: λo2
Table 1 Sample diversion policy P(d12,d21)D .
The arrival rates to EDs depend on the diversion rates and the vector of the number of patients in
the ED: Λ1(d12, d21, ~LP(d12,d21)D ) and Λ2(d12, d21, ~L
P(d12,d21)D ). Since the rates are deterministic, we will
denote the arrival rate with λk(d12, d21, ~LP(d12,d21)D ) = Λk(d12, d21, ~L
P(d12,d21)D ). For a fixed diversion
2 Once EDs choose the diversion thresholds and announce them to the EMS, S1 and S2 are fixed and cannot bechanged. Our results hold for any pair of thresholds. Furthermore, our analysis can accommodate the case whereEMS also optimizes over S1 and S2: all results on Pareto improving policies remain the same.
13
rate d12, the rate d21 has to be chosen such that Participation Condition 1 holds for each ED.
Hence, we need to solve the following equation to find d21:
E[λk(d12, d21, ~LP(d12,d21)D )] = Pr(L1 <S1 And L2 <S2)λo1 +Pr(L1 ≥ S1 And L2 ≥ S2)λo1 +
(λo1 + d21)Pr(L1 <S1 And L2 ≥ S2) +
(λo1− d12)Pr(L1 ≥ S1 And L2 <S2) = λo1. (6)
It is clear that the system described by policy P(d12,d21)D is ergodic and hence, the stationary
distribution exists.
Lemma 1. 1. There exist d12 ∈ (0, λc1] and d21 ∈ (0, λc2], such that P(d12,d21)D ∈PCD;
2. PCD ⊂PT .
In Lemma 1, we show that there exists policy P(d12,d21)D that satisfies Participation Condition 1
and that all such policies satisfy Threshold Condition 2 (All proofs are provided in Section 9). We
will then use policies from class PCD to illustrate our results and to obtain additional insights
in Section 5. Notice that the parameters S1 and S2 in this sample policy are fixed by the EDs
because these thresholds depend on the internal processes and policies of the EDs, EMS knows the
thresholds (S1 and S2) and controls the diversion rates (d12 and d21). Next, we analyze stochastic
properties of the number of patients in the system under any policy in class PT (including subclass
of policies PCD) to show that the class PT is Pareto improving.
4. Analysis of policy class PT
In order to show that the proposed class of coordination policies PT is Pareto improving, we will
first derive stochastic properties of the number of patients in the system under any coordination
policy Pi in class PT . First, we obtain stochastic ordering of the censuses:
Theorem 1. Under any coordination policy from class PT , the number of patients in each ED is
stochastically smaller than the number of patients in the ED without coordination:
LPik ≤st LPok ∀ k and hence, ~LPi ≤st ~LPo ∀ Pi ∈PT . (7)
Stochastically smaller census has two desirable properties, which follow from the definition of
stochastic ordering of random variables (Shaked and Shanthikumar 2007).
Property 1. Under any coordination policy from class PT , the expected number of patients at an
ED is lower than under non-coordinated policy Po:
E[LPik ]<E[LPok ] ∀ k and ∀ Pi ∈PT .
14
This property implies that the average number of patients at an ED that participates in coordi-
nation is smaller, and consequently, by Little’s Law, the average length of stay is lower. Guttmann
et al. (2011) show in an empirical study of ED visits that each hour of average waiting time increases
the risk of patient death within seven days following the ED visit. Assuming that the service time
remains the same regardless of the coordination policy, Property 1 implies that participating in
the coordination policy can decrease this mortality risk via decreasing the expected wait time.
Property 2. Under any coordination policy from class PT , the probability that the number of
patients in ED k is larger than a critical threshold Sk is lower than under non-coordinated policy
Po:
P [LPik ≥ Sk]<P [LPok ≥ Sk] ∀ k and ∀ Pi ∈PT .
This property guarantees that the probability of being overcrowded for each ED is lower if the ED
participates in the coordination policy Pi ∈PT . Overcrowding has been linked to higher mortality
rates (Sprivulis et al. 2006) and other adverse patient outcomes (McCarthy 2011); hence, reducing
the probability of being in an overcrowded state may lead to improved health outcomes, which is
beneficial for all players in the healthcare system.
Before presenting the results on the risk measures of the number of patients in the system
(variance and expected census conditional on being overcrowded) under our coordination policy, we
present a technique that allows us to prove the stochastic properties of the number of patients in the
ED. The technique, which was introduced in Do (2012), transforms a discrete random variable into
a continuous random variable that has several desirable properties. The transformation operator
T is defined as follows:
Definition 2. (Do 2012) Given a discrete (non-negative integer) random variable Y with mass
function pY (z) where z ∈Z+ (set of non-negative integers), T (Y ) is a non-negative continuous ran-
dom variable with the right-continuous density function defined as: pT (Y )(t) =∑z∈Z+
pY (z)Iz≤t<z+1,
where Iz≤t<z+1 is an indicator function that equals to 1 when z ≤ t < z+ 1, and 0 otherwise.
To gain intuition about the transformation operator T (·), consider a simple example: Y is a
discrete random variable with the following probability mass function:
pY (z) =
12
for z = 2,12
for z = 4,
0 o/w.
Let FY (z) =∑k≤z
pY (k) and FT (Y )(t) =t∫
0
pT (Y )(v)dv be the cumulative distribution functions of
Y and T (Y ). As illustrated in Figure 1, this technique simply transforms a point function (Figure
1(a)) into a step function (Figure 1(b)), which, in turn, implies that the cumulative probability of
15
1" 2" 3" 4" 5"
½""
p Y(z)
1"
z"
(a) Probability mass function of Y .
1" 2" 3" 4" 5"
½""
p τ(Y)(t)
1"
t"(b) Probability density function of T (Y ).
Figure 1 Illustrating the transformation T (Y ) on the probability function.
½""
F Y(z)
1"
z"1" 2" 3" 4" 5"
(a) Cumulative distribution function of Y .
½""F τ
(Y)(t)
1"
t"1" 2" 3" 4" 5"
(b) Cumulative distribution function of T (Y ).
Figure 2 Illustrating the transformation operator T (Y ) on the distribution function.
the transformed random variable T (Y ) (Figure 2(a)) is equal to the cumulative probability of Y
(Figure 2(b)) at all integer points.
The transformed random variable T (Y ) has the following properties:
Lemma 2.
T (Y +C) = T (Y ) +C; (8)
E[T (Y )] = E[Y ] +1
2; (9)
V [T (Y )] = V [Y ] +1
12; (10)
CEoverα [T (Y )] = E[T (Y )|T (Y )≥ V aRα[Y ]] =E[Y |Y ≥ V aRα[Y ]] +
1
2=CEover
α [Y ] +1
2(11)
∀ α= FY [w] for w ∈Z+, where Z+ is the set of non-negative integers.
Lemma 2 demonstrates that the expectation, variance, and the conditional expected census in
overcrowded state of the transformed continuous random variable are equal to the corresponding
measures of the original discrete random variable shifted by a constant (equalities 8, 9, and 10 were
shown in Do (2012)). Denote the difference between the expected census at ED k under policy
16
Po and census under policy Pi ∈PT as ∆k,i = E[LPok ]− E[LPik ]. From Property 1, we know that
E[LPik ]≤E[LPok ] and hence, ∆k,i ≥ 0 ∀ k and Pi. Let ~∆i represent the vector of ∆k,i ∀ k. Next, we
show that the transformed random variable T (Q) is smaller in convex order under the coordination
policy Pi:
Theorem 2. Under any coordination policy from class PT , the transformed number of patients
in the system is smaller in convex order than under non-coordinated policy Po:
T (LPik ) + ∆k,i <cx T (LPok ) ∀ k and hence, T (~LPi) + ~∆i ≤cx T (~LPo) ∀ Pi ∈PT .
Using Lemmas 2 and 3 and Theorem 2, we can now show that the number of patients in the ED
under any coordination policy from class PT has the following desirable property:
Property 3. Under any coordination policy from class PT , the number of patients in the ED is
less variable than under the non-coordinated control policy Po:
V [LPik ]<V [LPok ] ∀ k and ∀ Pi ∈PT .
This property implies that the number of patients in the ED under the coordination policy is less
“risky” and hence, any risk-averse player has an interest in switching to this policy provided that
the expected number of patients in the ED is the same or smaller, which follows from Property
1. Moreover, lower variability has multiple benefits for the healthcare system: EDs are better off
because they can plan their internal processes better, improve scheduling in the ED and have
less need for expensive surge and flexible capacity; in addition, ambulance crews can make better
routing decisions because the information signal has less noise.
The queueing system participants, however, are also interested in the reduction of the right tail of
the number of patients in the ED, hence, we now analyze the CEoverα , which measures the expected
number of patients in the ED conditional on being overcrowded, or in the states in which re-routing
decision may need to be made. First, we show the result for the transformed random variable
T (LPik ), which allows us to prove the further result for LPik . Following from Theorem 2 and using
the definition of convex order (Shaked and Shanthikumar 2007), we can immediately conclude that
the CEoverα of the transformed random variable T (Q) is smaller under any coordination policy from
class PT :
Lemma 3. Under any coordination policy from class PT , the expected transformed number of
patients in the ED in an overcrowded state is smaller than under non-coordinated control policy Poby more than the difference between the expected censuses at ED k under policies Po and Pi (∆k,i):
CEoverα [T (LPik )]<CEover
α [T (LPik ) + ∆k,i]−∆k,i ≤CEoverα [T (LPok )]−∆k,i,
∀ k,∀ Pi ∈PT , and ∀ α∈ [0,1).
17
Now, using Lemma 2, Theorem 2, and Lemma 3, we show the following property of the ED
census:
Property 4. Under any coordination policy from class PT , the conditional expected census in the
ED in overcrowded state is smaller than under non-coordinated control policy Po by more than the
difference between the expected census at ED k under policy Po and census under policy Pi (∆k,i):
CEoverα [LPik ]<CEover
α [LPok ]−∆k,i ∀ k,∀ Pi ∈PT , and ∀ α= FLPik
[w] for w ∈Z+.
This property demonstrates that for a service level α, which corresponds to an integer census, the
conditional expected census in the ED in the overcrowded state is smaller under any coordination
policy from class PT . Notice that CEoverα [LPik ] ≤ CEover
α [LPok ] relationship follows directly from
Theorem 1: Theorem 1 implies that LPik ≤icx LPok (Muller and Stoyan 2002) and CEover
α is monotone
in increasing convex order (which follows from the results on CV aRα shown in (Pflug 2000)).
However, the result in Property 4 is stronger and shows that the improvement in the conditional
expected census is greater than the difference in the expected census at ED k, ∆k,i.
Finally, using Properties 1, 2, 3, and 4 we conclude that the coordination policy PT is Pareto-
improving.
Theorem 3. For all Pi ∈PT ,Pi ≥P Po, hence, PT ⊆P.
Being a Pareto-improving policy implies that all service agents in the queueing system have an
incentive to participate in the coordination policy as the policy performs better according to all
performance measures defined in Section 2 and preserves the long-term average arrival rate. In
practice, EMS controllers and EDs can use our result as a decision tool for identifying whether a
particular control policy will be sustainable and will provide benefits to the healthcare system.
5. Numerical analysis
In this section, we present numerical experiments using sample policies from subclass PCD that
achieves the following goals: a) We demonstrate numerically that the performance of policies within
subclass PCD is monotonically increasing in the diversion rate d12 (and consequently, in d21) and
hence, we can provide guidance on how to select diversion rates for policies within this subclass; b)
We calculate the percentage improvement in performance measures for sample cases to assess the
magnitude of potential improvement from using proposed policy with Controlled Diversion; and
c) We compare the improvement that the EDs can attain from adopting a policy with Controlled
Diversion as opposed to improvement that can be attained from Full Diversion.
To evaluate our performance measurements numerically, we first have to find the stationary
distribution for the described process, which involves solving the Markov chain that captures
18
transitions in policies from subclass PCD. For a network with two EDs, the Markov chain represents
a 2-dimensional birth-death process. Let state (i, j) represent i (j) patients in ED 1 (2). An example
in Figure 3 depicts the transition diagram for this policy, with diversion thresholds S1 and S2 equal
to 2 at each ED. To find the stationary distribution, we use the approach summarized in Deo and
Gurvich (2011): we truncate the state space at large numbers Mk, such that if we increase the
state space to Mk + 1, the stationary probabilities change by no more than 10−4. The stationary
distribution obtained by solving the set of balance equations for the truncated Markov chain using
Matlab is representative of the actual stationary distribution.
0,3$
0,2$
0,1$
λ 2
1,3$
1,2$
0,0$
λ 2
λ1
1,1$
2,3$
2,2$
3,3$
3,2$
λ1
2,1$
λ1
2,0$
3,1$
3,0$1,0$
μ1 μ1
μ 2
μ1 μ1
μ1 μ1
μ1 μ1
λ1-d12
λ1
λ1
μ1
μ1
μ1
μ1
λ 2
λ 2-
d 21
λ1-d12
λ 2+
d 12
λ 2+
d 12
λ 2
λ 2+
d 12
λ 2-
d 21
λ1+d21 λ1+d21
λ 2+
d 12
λ 2
λ1+d21 λ1+d21
λ1
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
μ 2
λ 2
μ 2
λ 2-
d 21
μ 2
…%
…%
…%
…%
λ 2
μ 2
μ 2
λ1-d12
λ1
λ1
μ1
μ1
μ1
μ1
λ1-d12
…%
…%
…%
…%
λ 2-
d 21
Figure 3 The underlying Markov chain for a sample policy from subclass PCD with S1 = S2 = 2.
We use two numerical settings to calculate diversion rates and percentage improvement in per-
formance measures in Sections 5.1 and 5.2 : Setting 1 is motivated by the utilization data from the
City of Pittsburgh where average utilization is 67% and we set µ1 = µ2 = 1, λo1 = λo2 = 0.67. One
of the limitations of our work is that each ED is modeled as a single-server system, which implies
that for utilization of 67% the average census in the ED is about 2 people. Hence, in this numerical
setting we set the diversion thresholds relatively low: S1 = 3 and S2 = 2 so that we achieve states
with census above the crowding thresholds with non-negligible probability. In order to explore a
scenario where the thresholds can be set high, we introduce Setting 2, in which utilization is 90%3
3 90% utilization was also the highest value used for the cases with symmetric utilization in the numerical study of(Deo and Gurvich 2011)
19
(µ1 = µ2 = 1, λo1 = λo2 = 0.9), which implies the average census of 9 and we then set high thresholds:
S1 = 20 and S2 = 15. In our dataset, 24% of patients arrive by ambulance and we assume that
the controllable portion of arriving patients is bound by λci = λoi ∗ 24%. This implies λci = 0.16 for
i∈ {1,2} in Setting 1 and λci = 0.216 for i∈ {1,2} in Setting 2. Note that the diversion rate cannot
exceed the controllable portion of arrivals: dij ≤ λci .
5.1. Diversion rates that satisfy Participation Condition 1
We first need to identify pairs of diversion rates (d12, d21) that guarantee that policy P(d12,d21)D
satisfies Participation Condition 1 and hence, belongs to class PCD. In Setting 1 (2), we let d12 take
values from {0.04,0.08,0.12,0.16} ({0.06,0.11,0.16,0.21}) and then we find d21 such that Equation
6 is satisfied within 10−3 tolerance. We plot the resulting diversion rate from ED 2 to ED 1 as a
function of the diversion rate d12 in Figure 4 for Settings 1 (left pane) and 2 (right pane). The
squares represent the case with S1 = S2 where the diversion rates at both EDs are equal, which is
intuitive because the EDs are symmetric and they reach their diversion thresholds with the same
average frequency. The circles represent the asymmetric cases: when diversion threshold at ED 2 is
lower than at ED 1, ED 2 reaches the overcrowded state more frequently, and hence, to maintain
fairness in the arrival rate, the diversion rate from ED 2 to ED 1 should be set lower than the
diversion rate from ED 1 to ED 2. Clearly, d21 is increasing in d12 because as ED 2 receives more
diverted arrivals from ED 1, they reach their diversion threshold fast and need to divert more
frequently as well.
0.04 0.08 0.12 0.16Diversion Rate from ED 1 to ED 2
0.02
0.04
0.06
0.08
0.10
0.12
0.14
0.16
Diversion rate from ED 2 to ED 1
0.0178 0.0346
0.0513
0.0680
0.0400
0.0800
0.1200
0.1600
Setting 1: Utilization = 67%, S1=3, S2=2
0.06 0.11 0.16 0.21Diversion Rate from ED 1 to ED 2
0.03
0.06
0.09
0.12
0.15
0.18
0.21
Diversion rate from ED 2 to ED 1
0.0280 0.04900.0650
0.0950
0.0600
0.1100
0.1600
0.2100
Setting 2: Utilization = 90%, S1=20, S2=15
F1
Full Diversion No Diversion Our policy
0.65
0.70
Lambda:
2
5
E
0.4
0.6
Prob
10
30
Var
10
15
CVaR
Sheet 1F1 CVaR E Lambda: Prob Var
Full Diversion
No Diversion
Our policy 23.29
31.71
10.15
0.69
0.70
0.59
0.67
0.67
0.63
4.50
5.15
2.93
16.33
18.15
10.76
Sheet 5
Threshold settings:Asymmetric thresholds: S1 > S2 Symmetric thresholds: S1 = S2
Figure 4 Diversion rates that satisfy Participation Condition 1 under the policies in subclass PCD for Settings
1 and 2.
20
5.2. Comparison of policies within Controlled Diversion subclass PCD.
The numerical study in this subsection achieves two goals: a) it quantifies the magnitude of improve-
ment in performance measures that the system can achieve from adopting a policy with Controlled
Diversion, and b) it demonstrates that the improvement is monotonically increasing in diversion
rate, which provides an insight on how to set the diversion rates.
For each performance measure, we compute the percentage improvement relative to the No
Diversion policy (i.e. Po, which is equivalent to P(0,0)D ) as follows: for example, for Variance measure,
we calculateV [LPo
k]−V [L
P(d12,d21)D
k]
V [LPok
]for pairs of diversion rates (d12, d21) that were identified in Section
5.1. Similarly, for all other performance measures. We then show the sensitivity of the percentage
improvement relative to the diversion rate d12 in Figure 5 (top (bottom) pane illustrates Setting 1
(2) and left (right) pane illustrates ED 1 (2)). We observe that Policy P(d12,d21)D is improving over the
benchmark case Po for all tested pairs of (d12, d21) as is shown in our analysis in Section 4. Moreover,
the numerical results show that the improvement is monotone increasing in d12 and the highest is
achieved when d12 is at its highest feasible value: d12 = λc1. This observation obtained numerically
provides an important insight for policy designers: within the subclass PCD it is best to set one of
the diversion rates as high as possible (in this example, d12 = λc1) for any objective function that
satisfies the conditions outlined in Section 7.3. Notice that d21 will not be at its highest value of
d21 = λc2, but will be determined by Participation Condition 1. This is an important distinction of
our policy from the commonly used ambulance diversion policy (PFD =P(λc1,λc2)
D ) analyzed in Deo
and Gurvich (2011) with a single objective to minimize expected waiting time, which is not Pareto
improving and leads to the equilibrium solution of all-on-diversion or none-on-diversion and may
not be incentive compatible for the EDs.
5.3. Controlled Diversion versus Full Diversion
Next, we compare performance measures for each ED under our proposed Controlled Diversion
policy to No Diversion policy (Po) and to Full Diversion policy (PFD). For the numerical compar-
ison, we pick a case with asymmetric EDs from the numerical study of Deo and Gurvich (2011),
we match it on utilizations and on the controllable portion of arrivals and use it as our Setting 3.
Namely, the utilizations at EDs 1 and 2 are 0.846 and 0.954 correspondingly. To match utilization,
we set service rates at µ1 = µ2 = 1, and the arrival rates at λo1 = 0.846 and λ02 = 0.954 (the arrival
rates are different from the ones used in Deo and Gurvich (2011) because we model EDs a single-
server queues and hence, have to scale down their arrival rates used in the multi-server model by
the number of servers to match utilization). We match their assumption of 25% of patients arriving
by ambulance and hence, set λc1 = 25% ∗ 0.846 = 0.2115 and λc2 = 25% ∗ 0.954 = 0.2385. Finally, we
21
ED 1 ED 2
0.04 0.06 0.08 0.10 0.12 0.14 0.16Diversion Rate from ED 1 to ED 2
0.04 0.06 0.08 0.10 0.12 0.14 0.16Diversion Rate from ED 1 to ED 2
0%
10%
20%
30%
Per
cent
age
impr
ovem
ent i
npe
rform
ance
mea
sure
Setting 1
a.) Expected Censusb.) Probability of Overcrowding
c.) Variance of Censusd.) Expected Census in Overcrowded State
ED 1 ED 2
0.06 0.08 0.10 0.12 0.14 0.16 0.18 0.20Diversion Rate from ED 1 to ED 2
0.06 0.08 0.10 0.12 0.14 0.16 0.18 0.20Diversion Rate from ED 1 to ED 2
10%
20%
30%
40%
50%
Per
cent
age
impr
ovem
ent i
npe
rform
ance
mea
sure
Setting 2
Figure 5 Percentage improvement of performance measures for EDs 1 and 2 under Settings 1 and 2 implementing
policy from subclass PCD as a function of diversion rate d12.
match the diversion thresholds and set them at S1 = 14 and S2 = 11. For comparability, we keep
the diversion thresholds the same for both policies. Such comparison is practical when the EDs
truthfully share information with the EMS and set their thresholds, e.g. at capacity; the EMS then
compares the performance of Full Diversion and Controlled Diversion. Using our observation from
Section 5.2, we set the diversion rate d12 equal to its highest possible value: d12 = λc1 in Controlled
Diversion. First, we note that the expected arrival rate under Full Diversion is 0.87 to ED 1 and
0.92 to ED 2, which indicates that Full Diversion may have negative implications for the EDs: ED
2 may see reduced revenue and/or market share, ED 1 may see an undesired increase in patients
that exceeds planned capacity. Hence, Full Diversion Policy may be not incentive compatible for
the EDs to keep participating in the diversion system.
Next, we quantitatively assess performance of the EDs and show the result in Figure 6 where
we plot the performance measures for ED 1 (left pane) and ED 2 (right pane) under different
22
diversion policies: No Diversion policy Po in light-gray on the left, Full Diversion PFD in black
on the right, and Controlled Diversion P(0.2115,0.0135)D in dark-gray in the middle. We observe that
several performance measures, namely, expected census (pane a.) and probability of overcrowding
(pane b.) are higher for ED 1 than without diversion, which demonstrates that policy PFD is not
Pareto-improving and ED 1 may have an incentive to not participate in the diversion system. Notice
that Controlled Diversion mechanism, on the other hand, provides improvement on all performance
measures and preserves the expected arrival rate for both EDs. Moreover, for ED 1, Controlled
Diversion provides higher improvement in all performance measures.
6. Implementation
The sample policy presented in Section 3.1 may be hard to implement due to its probabilistic
nature: the policy prescribes to divert a rate of patients; however, it does not provide guidance
to the decision maker about what course of action to take with a particular patient. In practice,
there exist different diversion systems that try to control the rate: e.g. specifying a time interval,
during which to divert all patients and a time interval during which there can be no diversion,
implementing diversion for certain types of patients (e.g. divert all patients with neurological
problems) etc. Parameters of such policies are hard to calculate optimally and there is no empirical
evidence of success of such policies. Here, we propose an alternative sample policy, Policy IOU,
that is based on the idea of our threshold policy and is easy to implement in practice: this policy
deviates slightly from the definition of our proposed class PT , however, prescribes a specific course
of action in each state.
Consider a setting with 2 EDs with diversion thresholds S1 and S2. As opposed to the sample
policy described earlier in Section 3.1, we introduce an additional parameter: “IOU” counter K
that keeps track of the diverted patients in the following manner4. If a patient is diverted from ED
2 to ED 1, we increase the counter by one K =K + 1, which implies that ED 1 now “owes” one
more patient arrival to ED 2. Similarly, when a patient is diverted from ED 1 to ED 2, we decrease
the counter by one: K =K − 1, a negative K value implies that ED 2 “owes” one more patient to
ED 1. The ambulance crews can then use the census, diversion threshold, and counter information
as follows (the rules are summarized in Table 2): when only one of the EDs is overcrowded, all
controllable patients are diverted to the other ED and the counter is updated accordingly; when
both EDs are overcrowded, no diversion is made; when neither ED is overcrowded diversions are
made for all controllable patients according to the status of the counter: if K = 0, no diversion
is made, if K > 0, all controllable patients are diverted from ED 1 to ED 2, and if K < 0, all
4 In the case of three or more EDs, we will need to use multiple counters Kij .
23
ED 1 ED 2
NoDiversion
ControlledDiversion
FullDiversion
NoDiversion
ControlledDiversion
FullDiversion
a.) ExpectedCensus
b.) ProbabilityofOvercrowding
c.) Varianceof Census
d.) ExpectedCensus inOvercrowdedState
0.0
10.0
20.0
0.0
0.2
0.4
0.6
0.0
200.0
400.0
0.0
20.0
40.0
60.0
5.49 5.16 6.44
20.4518.66
7.56
0.080.130.10
0.580.59
0.26
28.19 34.3935.67
331.08
413.60
41.07
16.2818.49 17.76
66.9860.18
20.69
PolicyNo Diversion
Controlled Diversion
Full Diversion
Value as an attribute for each Policy broken down by ED vs. Measure. Color shows detailsabout Policy. The data is filtered on Utilization, which keeps Setting 3. The view is filtered onMeasure, which keeps d.) Expected Census in Overcrowded State, a.) Expected Census, b.)Probability of Overcrowding and c.) Variance of Census.
Figure 6 Performance measures for EDs 1 and 2 under Full Diversion policy (PFD), No Diversion policy (Po),
and Controlled Diversion policy with rates d12 = 0.16 and d21 = 0.03 (a policy from subclass PCD)
.
controllable patients are diverted from ED 2 to ED 1. Theoretically, as explained in Section 2.1,
number of “owed” patients can go to infinity: denoting number of “owed” patients at time t as
K(t), limt→∞|K(t)|=∞. If this is a concern in practice, practitioner can introduce an upper bound
on K(t): |K(t)| ≤M <∞, where M could be a large positive number. This policy can be refined
to accommodate multiple patient classes by introducing additional counters Kclass.
To confirm that this policy behaves similarly to the policy class analyzed above, we apply this
policy to numerical Setting 1 and plot the expected conditional arrival rate to EDs 1 (left pane)
24
ED 1 ED 2
-1 0 1 2 3 4 5 6 7 8 9 10Census
-1 0 1 2 3 4 5 6 7 8 9 10Census
0.55
0.60
0.65
0.70
0.75
Conditional Arrival rate
ED / Policy
ED 1 ED 2
Controlled diversion No Diversion Full Diversion Controlled diversion No Diversion Full Diversion
a.) Expectedarrival rate:
b.) ExpectedCensus:
c.) Probability ofOvercrowding:
d.) Variance ofCensus:
e.) ExpectedCensus inOvercrowdedState:
0.0
0.5
0
6
0.0
0.5
0
40
0
20
0.67 0.67 0.71 0.67 0.67 0.63
2.092.031.91
5.15
2.934.50
0.29 0.330.30
0.69 0.590.70
6.155.09 5.2710.15
23.2931.71
7.78 8.03 7.73
18.1516.3310.76
Comparison of three policies
Policy Controlled Diversion No Diversion Policy IOU
Figure 7 Expected arrival rate conditional on the ED census under Policy IOU (triangles) and Controlled Diver-
sion Policy P(d12=0.16,d21=0.068)D (squares) when λ1 = λ2 = 0.67, µ1 = µ2 = 1, S1 = 3, and S2 = 2.
ED 2Not crowded Overcrowded
(l2 <S2) (l2 ≥ S2)
ED 1Not crowded
(l1 <S1 )If K = 0, no diversion
If K > 0, divert from ED 1 to ED 2If K < 0, divert from ED 2 to ED 1
Divert from ED 2 to ED 1Update K=K+1
Rate to ED 1 is λ01 +λc2
Rate to ED 2 is λu2
Overcrowded(l1 ≥ S1 )
Divert from ED1 to ED2Update K =K − 1Rate to ED 1 is λu1
Rate to ED 2 is λ02 +λc1
Rate to ED1 is λo1Rate to ED2 is λo2
Table 2 Diversion decision rules under Policy IOU.
and 2 (right pane) in Figure 7 as a function of census in the ED and observe that it follows
a similar threshold pattern as the arrival rate in policy with Controlled Diversion5. Finally, we
compute all performance measures using Policy IOU and compare them to the benchmark policy
and the policy with Controlled Diversion for Setting 1 in Figure 8. Notice that performance under
Policy IOU deviates slightly from the performance under Controlled Diversion, but is comparable
in magnitude.
7. Policy applications and improvements
In this section, we discuss applications of our policy class PT to different realistic settings, discuss
improvements that can be made to an existing coordination policy to attain Pareto improvement,
and finally, introduce a subclass of policies PM within the class PT that Pareto-dominate all
policies in PT \PM .
5 Note that we do not have an analytical result that shows that the expected conditional arrival rate has thresholdstructure, rather we show it numerically in this section.
25
ED 1 ED 2
NoDiversion
ControlledDiversion
PolicyIOU
NoDiversion
ControlledDiversion
PolicyIOU
a.) ExpectedCensus
b.) Probability ofOvercrowding
c.) Variance ofCensus
d.) ExpectedCensus inOvercrowded State
0
1
2
0.0
0.2
0.4
0
2
4
6
0
2
4
6
8
1.8162.030
1.794 1.8362.030
1.664
0.277 0.2830.301
0.4250.449 0.438
4.269
6.152
4.331 4.6783.610
6.152
6.5177.030
5.5046.673
5.393
7.030
Figure 8 Performance measures under No Diversion, Controlled Diversion (P(d12=0.16,d21=0.068)D ), and Policy IOU
for Setting 1 (λ1 = λ2 = 0.67, µ1 = µ2 = 1, S1 = 3, and S2 = 2).
7.1. Multiple patient classes.
In the definition of our model, we assumed that there is one arrival stream of patients Λ. However,
this stream may consist of multiple classes of patients. For example, patients with different acuity
levels. To accommodate this practical setting, our definition of Pareto improvement can be extended
by creating duplicates of inequalities 1, 2, 3, and 4 for each patient class. For example, instead of
26
inequality 2, we will have P [LP1k,c ≥ Sk,c]≥ P [LP2k,c ≥ Sk,c] for each patient class c. Our result holds
after this modification and any policy in class PT will provide Pareto improvement according to
the modified definition.
7.2. Enhancing coordination policies.
Next, we show an auxiliary lemma that demonstrates how any Pareto improving coordination
policy implemented in practice can be enhanced to guarantee better performance.
Lemma 4. Let P1, P2 be two coordination policies satisfying Condition 1 and with the conditional
expected arrival rates sequence:{E[Λk( ~Q
P1)|Lk = l] = al
}∞l=0
and{E[Λk( ~Q
P2)|Lk = l] = bl
}∞l=0
,
where al = bl ∀ l ∈ {{0, .., n− 1}∪ {n+ 2, ..,∞}} and bn >an, then bn+1 <an+1 and P1 ≤P P2.
Lemma 4 shows that by re-allocating some conditional expected arrival rate towards a less
congested ED, a coordination policy P2 will Pareto-dominate policy P1. Using this re-allocation
principle from Lemma 4, next we define a class of policies PM that we will show to be a subset of
PT and to be Pareto-dominating over all policies in PT \PM .
Definition 3. Let PM be a set of coordination policies, where Participation Condition 1 holds
and the arrival rate to ED k satisfies the following condition:
Condition 3. Monotonicity Condition. Conditional expected arrival rate is monotone in the num-
ber of patients at ED k: For all k and P ∈PM , E[Λk(~LP)] = λok and
E[Λk(~LP)|Lk =m]=E[Λk(~L
P)|Lk =m+ 1] ∀ m= 0,1,2, ... (12)
and there exists m such that:
E[Λk(~LP)|Lk = m]>E[Λk(~L
P)|Lk = m+ 1] (13)
This condition implies that whenever the census at ED k increases, the conditional expected
arrival rate to ED k will decrease. This monotonicity condition is intuitive, however, such a policy
may be harder to implement in practice as compared to a threshold policy in class PT without
monotonicity. For example, in a case with symmetric EDs, join the shortest queue policy will satisfy
Monotonicity Condition 3.
Theorem 4. 1. PM ⊂PT ⊂P
2. For any policy P1 ∈PT \PM , there exists P2 ∈PM , such that P1 ≤P P2.
3. For any policy P1 ∈P \PT , there exists P2 ∈PT , such that P1 ≤P P2.
27
This theorem presents a strong nested classification of coordination policies: any Pareto improv-
ing policy with the monotone structure dominates policies that have threshold structure, but are
not monotone, which in turn, dominate all Pareto-improving policies that do not have threshold
structure. These policies along with the nested classification can be used as a decision tool by
EMS and EDs in constructing and improving coordination policies given their possible practical
constraints.
7.3. Composite performance measures.
We note that Pareto-improvement is a robust result that implies that any policy in class P is
beneficial for the EDs according to all measures defined in Section 2.1 treated equally and/or
according to any composite measure, which is a convex combination of the defined objectives
(expected census in the ED, probability of the census in the ED not exceeding a pre-specified
threshold, variance, and expected census conditional on being overcrowded) or even more generally,
any polynomial of the objectives with non-negative coefficients (weights). Furthermore, Pareto-
improvement also applies to any increasing function in each objective. As an example, consider
increasing cost functions that map our performance measures to operational cost of the ED k:
C1(E[LPk ]), C2(P [LPk ≥ Sk]), C3(V [LPk ]), and C4(CEoverα [LPk ]). We can create a composite cost mea-
sure by assigning different weights to the cost functions: C(LPk ) = γ1C1(E[LPk ]) + γ2C2(P [LPk ≥
Sk]) + γ3C3(V [LPk ]) + γ4C4(CEoverα [LPk ]). Consider two coordination policies P1 and P2, P1 ≤P P2
will imply C(LP1k ) ≤ C(LP2k ). Each ED can have a different set of weights and/or different cost
functions Ci(·). For a given composite measure, each ED can find a specific policy within class PM
that will be optimal for the particular ED.
7.4. What performance improvements do patients see?
Notice that we performed our analysis from the perspective of time-average distribution. If we
take the patient perspective, the distribution observed by patients differs from the time-average
because the arrival stream to each ED is an outcome of a coordination policy and hence, it is
no longer Poisson and PASTA (Poisson arrivals see time-average) does not apply. Consider the
flow of virtual patients who arrive to ED k according to the Poisson process with rate λok, but
only observe the system, never enter the system as opposed to the actual patients. Since virtual
patients arrive according to a Poisson process, they observe the time-average distribution πPik (n). In
healthcare practice, for example, The Joint Commission sends their inspectors to evaluate hospitals
and EDs based on the set of performance measures that includes observing congestion in the ED.
Assuming that the inspectors arrive according to the Poisson process, the performance observed
and reported by the inspectors, which is used to evaluate the ED performance, can be seen as
28
performance observed by the virtual customers. Hence, all measures calculated based on the time-
average distribution are representative of the ED perspective.
To evaluate the performance improvement observed by patients, first, we compare the time-
average distribution observed by the virtual patients, πPik (n), to the distribution observed by actual
patients, πPik (n), where n represents an aggregate state: probability that the number of patients
in ED k is n aggregating over the census possibilities at the other EDs. Then, we establish a
relationship between the two distributions. Let λPik (n) be the arrival rate while there are n patients
at ED k. First, we state the following relationship:
Lemma 5. πPik (n) =λPik
(n)
λokπPik (n) ∀ Pi, l, and k. 6
This intuitive relationship between the distributions allows us to show in the next Theorem
that the patients will see a distribution of the number of patients in the system that is better
than the stationary distribution in the analysis when a policy Pi ∈ PT is implemented because
the distribution observed by arriving patients is more balanced: patients are less likely to arrive
(because of lower arrival rates) during crowded times and more likely to arrive (because of larger
arrival rates) during non-crowded times. Before stating the next result, we define Pareto dominance
from patients’ perspective, where we use LPik to denote the number of patients in the system
observed by patients:
Definition 4. Pareto Dominance from patients’ perspective. A policy P2 is Pareto improving from
patients’ perspective over policy P1, P2 ≥P P1 if it satisfies inequalities 1, 2, 3, and 4, where all
the performance measures are defined using the distribution observed by patients (E[LPik ], V [LPik ],
P [LPik >Sk], and CEoverα [LPik ]) and at least one of these inequalities is strict.
Theorem 5. 1. The number of patients observed by patients is stochastically smaller than the
time-average number of patients in the ED: LPik ≤st LPik ∀ Pi ∈PT and ∀ k;
2. T (LPik ) + ∆Pik ≤cx T (LPik ) ∀ Pi ∈PT and ∀ k, where ∆Pik =E[LPik ]−E[LPik ];
3. For any policy P1 ∈PT \PM , there exists P2 ∈PM , such that P1 ≤P P2.
4. For any policy P1 ∈P \PT , there exists P2 ∈PT , such that P1 ≤P P2.
As a result, all the aforementioned measures as experienced by the actual patients will be lower
than the same measures calculated using the time-average distribution. Hence, the results in Theo-
rems 3 and 4 hold from the patients’ perspective as well as from the EDs’ perspective. Consequently,
the performance improvements perceived by patients are even higher than the improvements from
the ED’s perspective.
6 This relationship can be obtained using Palm Theory (see e.g. Shanthikumar and Zazanis (1999)), however, wepresent a simple intuitive proof using a counting argument in the Appendix, which is specific to our modeling setup.
29
7.5. Extension to Markov Modulated Poisson Processes
In this section, we demonstrate that our analytical model and approach can be extended to the
case where the total arrival process is modulated by some exogenous Markov process, i.e., Markov
Modulated Poisson Process (MMPP). For simplicity, we assume that there are two states of the
world: Let Z indicate the random variable that evolves between the two states zL and zH with γLH
(γHL) indicating transition probability from zL (zH) to zH (zL).
When there is no flow rate control between the EDs, we denote λk(Z|Z = zL) with λLk and
λk(Z|Z = zH) with λHk . The total arrival rates in each state are then λL =∑N
k=1 λLk and λH =∑N
k=1 λHk , where λLk (λHk ) consists of λLuk (λHuk ) and λLck (λHck ) for uncontrollable and controllable
streams of patients, similarly to our main model description. Let (lk,Z) be the aggregate state of
the system where the number of patients in the ED k is lk and the state of nature is Z taken across
all other patient numbers at the other EDs. As compared to the Markov diagram with Poisson
arrival rate analyzed in the main body of the paper (see Figure 9(a)), the diagram of the system
with MMPP will change as plotted in Figure 9(b).
l+1!l!
λk λkλk
µk µk µk
(a) Poisson arrival process.
l+1,"ZH"l,"ZH"
l+1,"ZL"l,"ZL"
λkH λk
HλkH
λkL λk
LλkL
µk
µk
µk
µk
µk
µk
γ LH γ LHγ HLγ HL
(b) MMPP arrival process.
Figure 9 Illustrating the change in Markov diagram for an ED with the MMPP arrival process.
Now, it is easy to see that if we adjust the participation condition to take expectation over all
states in Z: E[λk(~L(Pi),Z)] = λok, all the remaining analysis will remain the same. Hence, we can
define policy classes PT and PM with this adjusted participation condition and attain the same
results on Pareto improvement. In implementation, however, the central controller will now have
30
more control parameters and can decide the values of arrival rate to each ED given the state is zL
or zH . For example, in our sample policy PS, the controller will have to specify diversion rates for
each state of the world: dL12, dH12, dL21, and dH21.
As a special case of the MMPP described above, we can extend our result to a process where
the state of the world is represented by the time of day: the arrival rate changes throughout
the day, which is commonly observed in healthcare and ED applications. In such case, the state
transitions are controlled by the clock, rather than by transition probabilities γi and if we adjust
the participation condition accordingly, we will again be able to show that policy classes PT and
PM with the modified condition result in Pareto improvement for all participants.
8. Conclusion
ED overcrowding in the U.S. is a growing problem that requires robust solutions. Many EMS
across the U.S. are attempting to find a solution; for example, EMS policies in Kansas have a
sophisticated diversion system that allows EDs to announce diversion for specific types of patients
(for example, closed to trauma patients). In some counties in California, EMS imposes time limits
on diversion status; several EMS systems, including that of the City of Pittsburgh in 2004, use
multiple level diversion signals based on the extent of crowding in the EDs. However, there is no
consensus on what policy is best and no systematic way to evaluate and compare such policies.
We use this issue to motivate a general problem: how to coordinate flow in a queueing system with
independent agents that can decide whether to participate in the coordination system or not, that
have multiple performance measures, and that receive revenue based on the processed load.
We find classes of coordination policies that are Pareto improving for all agents in the system:
under the coordination policies within our defined classes, all agents receive the same expected
load, they see stochastically smaller number of patients with smaller variance and lower expected
number of patients in overcrowded situations. This analytical result proposes a convenient decision
tool that can help queueing system coordinators (such as EMS in healthcare practice) compare
and select coordination policies that guarantee performance improvement according to multiple
measures. We demonstrate that our solution is robust: our results hold under multiple patient
classes, under different composite measures, from the patients’ perspective, and under Markov
Modulated Poisson Process. Moreover, our result on policy improvements (Lemma 4) provides a
tool to refine any existing coordination policy, the performance of which will Pareto-dominate the
existing policy.
Numerous extensions of our work are possible by relaxing assumptions of the model, considering
multiple-server system, considering non-stationary arrival process, etc. Here, we also suggest a
direction that may lead to immediate practical implications for queueing system coordination. The
31
focus of our work has been on identifying a class of policies that provide Pareto improvement. Once
the participants select their performance measures, each participant (including service agents and
the system coordinator) can find an optimal policy within the defined classes. For example, in our
sample policy Ps, the service agents (EDs) select the diversion thresholds, while the coordinator
(EMS) selects the diversion rates. This problem may be formulated as a repeated game to find the
optimal policy for the EMS for the selected set of performance measures.
9. Mathematical Proofs of Results
Proof of Lemma 1 1. First, we show existence of d12 and d21:
The expected arrival rate to ED 1, E[λ1(d12, d21, ~LP(d12,d21)D )], is continuous in both d12 and d21.
Evaluating E[λ1(d12, d21, ~LP(d12,d21)D )] at three extreme points:
(a) When there is no coordination: E[λ1(0,0, ~LPo)] = λo1;
(b) ED 2 diverts all to ED 1 when it is crowded, but ED 1 never diverts to ED 2:
E[λ1(0, λc2, ~LP
(0,λc2)
D )]>λo1.
(c) ED 1 diverts all to ED 2 when it is crowded, but ED 2 never diverts to ED 1:
E[λ1(λc1,0, ~LP
(λc1,0)
D )]<λo1.
Note that E[λ1(λc1, λc2, ~L
P(λc1,λ
c2)
D )] could be larger, equal or smaller than λo1.
Since E[λ1(d12, d21, ~LP(d12,d21)D )] is continuous in both d12 and d21, E[λ1(d12, d21, ~L
P(d12,d21)D )] is a
surface w.r.t d12 ∈ [0, λc1] and d21 ∈ [0, λc2]. Consider a fragment of the plane: λ1(d12, d21, ~LP(d12,d21)D ) =
λo1 for d12 ∈ [0, λc1] and d21 ∈ [0, λc2], from the three points above, the surface has to intersect with
the part of the plane and the intersection must be a non-trivial continuous curve on the plane with
one point of (0,0).
Let this curve be C and define: d12 = sup{d12 : (d12, d21)∈C} and d21 = sup{d21 : (d12, d21)∈C}.
Clearly, d12 > 0 and d21 > 0 and there is no point on C other than (0,0) such that d12 = 0 or d21 = 0.
By the continuity of the curve, for each d12 ∈ [0, d12], there exists some d21 ∈ [0, d21] such that
E[λ1(d12, d21, ~LP(d12,d21)D )] = λo1.
Furthermore, if E[λ1(λc1, λc2, ~L
P(λc1,λ
c2)
D )] > λo1, since E[λ1(λc1,0, ~LP
(λc1,0)
D )] < λo1, there exists d21 < λc2
such that E[λ1(λc1, d21, ~LP
(λc1,d21)
D )] = λo1.
Similarly, if E[λ1(λc1, λc2, ~L
P(λc1,λ
c2)
D )] < λo1 then E[λ2(λc1, λc2, ~L
P(λc1,λ
c2)
D )] > λo2, there exists d12 < λc1
such that E[λ2(d12, λc2, ~L
P(d12,λ
c2)
D )] = λo2 (and hence, E[λ1(d12, λc2, ~L
P(d12,λ
c2)
D )] = λo1).
2. Let π(m, l) represent the stationary probability of being in state (m, l). By definition,
E[λk(~LP(d12,d21)D )] = λok ∀ P
(d12,d21)D ∈PCD. For ED 1, the conditional expected arrival rate when the
number of patients in the system is less than S1 is:
E[λ1(~LP(d12,d21)D )|L1 =m;m<S1] =
1
Pr{L1 =m}
∞∑l=0
π(m, l)λ1(m, l)
32
=1
Pr{L1 =m}
(S2−1∑l=0
π(m, l)λo1 +∞∑l=S2
π(m, l)(λo1 + d21)
)
≥ 1
Pr{L1 =m}
(S2−1∑l=0
π(m, l)λo1 +∞∑l=S2
π(m, l)(λo1)
)
≥ λo1Pr{L1 =m}
(S2−1∑l=0
π(m, l) +∞∑l=S2
π(m, l)
)= λo1
The equality occurs if d21 = 0, i.e., there is no diversion from ED 2 to ED 1. Similarly, we can
establish that E[λ1(~LP(d12,d21)D )|L1 =m;m≥ S1]≤ λo1. The proof for ED 2 is similar.
Proof of Theorem 1 We present the following proof for a pure policy, hence we use E[λk(~LPi)]
instead of E[Λk(~LPi)]. The proof can be easily modified for a non-pure policy.
Let ~l−k denote the vector of censuses with k-th component removed. Denote the arrival rate
to ED k with λk[~l|lk = m] = λk(m,~l−k), given the current census of this ED is m and the corre-
sponding stationary probability is π(m,~l−k). We use πk(m) =∑~l−k
π(m,~l−k) to denote the stationary
probability of having m patients in ED k aggregated across all censuses at other EDs.
Consider the N − 1 dimension hyperplane that separates the aggregate states (lk = m) and
(lk = m + 1)). The rate of up-crossing (transitioning from states with (lk 6 m) to states with
(lk >m+ 1)) is only from some state in the hyperplane (lk =m), which is:∑~l−k
λk(m,~l−k)π(m,~l−k)
The down-crossing (transitioning from states with (lk >m+ 1) to states with (lk 6m)) of this
hyperplane is only from the hyperplane (lk = m + 1) and the down-crossing rate is:∑~l−k
µk(m +
1,~l−k)π(m+ 1,~l−k) = µk∑~l−k
π(m+ 1,~l−k) = µkπk(m+ 1).
When the system is stationary, the up-crossing rate must be equal to the down-crossing rate, so
we have: ∑~l−k
λk(m,~l−k)π(m,~l−k) = µkπk(m+ 1)
Expanding πk(m), we have:∑~l−k
λk(m,~l−k)π(m,~l−k) = πk(m)1∑
~l−k
π(m,~l−k)
∑~l−k
λk(m,~l−k)π(m,~l−k) =
πk(m)∑~l−k
λk(m,~l−k)P (~L|Lk =m) = πk(m)E[Λk(~L)|lk =m]
Thus,
πk(m+ 1) = πk(m)E[λk(~L)|Lk =m]
µk
33
Now, consider the probability of having no patient in ED k, for an ergodic system under a policy
P, we have using Little’s Law:
πPk (0) = 1− E[λk(~LP)]
µk
Since we have: E[λk(~LPi)] =E[λk(~L
Po)] = λ0k, we have:
πPik (0) = πPok (0) = πk(0) = 1− λ0kµk
.
Let δPk (m) = πPk (m+ 1)− λokµkπPk (m) = πPk (m)
(E[λk(~LP )|Lk=m]−λok
µk
). Thus, πPk (m+ 1) =
λokµkπPk (m) +
δPk (m).
From Condition 2, for all Pi ∈PT :
E[λk(~LPi)|Lk = l]
{≥ λok, ∀ l ≤ m≤ λok, otherwise.
So, δPok (m) = 0 ∀ m∈Z+, and
δPik (m)
{≥ 0, ∀ m ≤ m< 0, otherwise.
We can show using contradiction that:
∃ l∗ s.t. πPik (m)
{≥ πPok (m), ∀ m ≤ l∗
≤ πPok (m), otherwise.
Because πPik (0) = πPok (0) = πk(0) and πk(m+ 1) = πk(m)E[Λk(~Q)|Lk=m]
µkfor any stationary control
policy, we have: πPik (1) =λokµkπPik (0) + δPik (0)>
λokµkπPik (0) =
λokµkπPok (0) + δPok (0) = πPok (1). Recursively,
∀ m ≤ m, πPik (m)>πPok (m).
Now we will prove that there must be a unique finite number l∗ such that πPik (l∗)> πPok (l∗) and
πPik (l∗+1)6 πPok (l∗+1) where one of the inequalities must be strict. First, there must exist a finite
l such that πPik (l + 1) 6 πPok (l + 1). Suppose not, then∑∞
m=0 πPik (m) >
∑∞m=0 π
Pok (m) = 1 which
is a contradiction. Next, we define: l∗ = min{l : πPik (l + 1) 6 πPok (l + 1) } and by this definition,
πPik (l∗)> πPok (l∗).
Apparently, we can see that l∗ > m, so δPik (m)< 0 ∀ m> l∗. Hence, πPik (m)< πPok (m)∀m>l∗ + 1
and that means the number l∗ is unique.
So, the interpolated line of the PDF of LPik crosses that of LPok from above at only one point. Now,
we prove that LPik ≤st LPok ∀ k using one of its definitions:
∑l
m=0 πPik (m)>
∑l
m=0 πPok (m) ∀ l . We
know from the above that∑l
m=0 πPik (m)>
∑l
m=0 πPok (m) ∀ l6 l∗. Suppose there exists a finite
m∗ ,min{l :∑l
m=0 πPik (m)<
∑l
m=0 πPok (m)}, then we must have m∗ > l∗+ 1 and hence, πPik (m)6
πPok (m)∀m>m∗. Consequently, the following must hold:
1 =∑∞
m=0 πPik (m) =
∑m∗
m=0 πPik (m) +
∑∞m=m∗+1 π
Pik (m) <
∑m∗
m=0 πPok (m) +
∑∞m=m∗+1 π
Pok (m) = 1,
which is a contradiction. �
Proof of Lemma 2 Show the shift in mean:
E[T (Y )] =
∞∫0
tpT (Y )(t)dt=
∞∫0
∞∑z=0
pY (z)Iz≤t<z+1tdt
34
Using the monotone convergence theorem, we can interchange the summation and integration:
E[T (Y )] =
∞∫0
∞∑z=0
pY (z)Iz≤t<z+1tdt=∞∑z=0
z+1∫z
tpY (z)dt=∞∑z=0
pY (z)
z+1∫z
tdt=∞∑z=0
pY (z)
(z+
1
2
)=
=∞∑z=0
pY (z)z+1
2=E[Y ] +
1
2
Similarly, we show the shift in the second moment:
E[(T (Y ))2] =∞∑z=0
z+1∫z
t2pY (z)dt=∞∑z=0
pY (z)
z+1∫z
t2dt=∞∑z=0
pY (z)
1∫0
(z+u)2du=
=∞∑z=0
pY (z)
(z2 + z+
1
3
)=E[Y 2] +E[Y ] +
1
3
Now, we can represent variance as:
V [T (Y )] =E[(T (Y ))2]−E2[T (Y )] =E[Y 2] +E[Y ] +1
3− (E[Y ] +
1
2)2 =
=E[Y 2] +E[Y ] +1
3−E2[Y ]−E[Y ]− 1
4=E[Y 2]−E2[Y ] +
1
12= V [Y ] +
1
12
Let α= FY [K] for some K ∈Z+ and note that the CDFs of T (Y ) and Y are equal at K. Then,
we can re-write CEoverα [T (Y )] as:
CEoverα [T (Y )] =E[T (Y )|T (Y )≥K] =
∞∫0
tpT (Y )(T (Y ) = t|T (Y )≥K)dt=
=
∞∫0
∞∑z=0
pY (z|Y ≥K)Iz≤t<z+1tdt=∞∑z=K
z+1∫z
tpY (z)
P (Y ≥ z)dt=
1
P (Y ≥ z)
∞∑z=K
pY (z)
z+1∫z
(t)dt=
=1
P (Y ≥ z)
∞∑z=K
pY (z)
(z+
1
2
)=
1
P (Y ≥ z)
∞∑z=K
(pY (z)z+
pY (z)
2
)=
=∞∑z=K
(pY (z)z
P (Y ≥ z)+
pY (z)
2P (Y ≥ z)
)=E[Y |Y ≥K] +
1
2=CEover
α [Y ] +1
2�
Proof of Theorem 2 Let FX(t) be the cumulative probability function of random variable X.
From Property 1, we have E[LPik ]≤ E[LPok ], combining it with property (2) from Lemma 2, we
get: ∆k,i =E[LPok ]−E[LPik ] =E[LPok ] + 1/2−E[LPik ]− 1/2 =E[T (LPok )]−E[T (LPik )].
Let X i = T (LPik ) + ∆k,i and Xo = T (LPok ). Thus, from the above: E[X i] = E[T (LPik )] + ∆k,i =
E[T (LPok )] =E[Xo]. This means that X i and Xo have the same mean.
Since FXi(t) = 0 < FXo(t) for 0 ≤ t ≤ ∆k,i and E[X i] = E[Xo], the curve of FXi(t) must cross
FXo(t) at some point t1 >∆k,i, otherwise FXi(t)≤ FXo(t) ∀ t≥ 0, i.e. X i >st Xo and this implies
E[X i]>E[Xo], which is a contradiction.
35
In the proof of Theorem 1, we have shown that there exists a finite integer l∗, such that πPik (m)>
πPok (m) ∀ m≤ l∗ and πPik (m)≤ πPok (m) ∀ m≥ l∗ + 1. This means there must also exist a point t
such that the slope of FXi(t) is larger than that of FXo(t) for t≤ t and is smaller otherwise. It can
be seen that t1 < t.
We argue that the curve of FXi(t) crosses that of FXo(t) at only one point, i.e., the point t1
is unique, and it is from below, by contradiction. Suppose FXi(t) crosses FXo(t) at a point t > t
from above then for t≥ t, the slope of FXi(t) must be smaller than the slope of FXo(t), i.e. t is the
last point FXi(t) crosses FXo(t). But when FXo(t)−FXi(t) is increasing in t for t≥ t there exists
t2 > t and ε > 0 such that FXo(t)−FXi(t)≥ ε ∀ t≥ t2. Therefore, 0< ε≤ limt→∞
(FXo(t)−FXi(t)) =
limt→∞
FXo(t)− limt→∞
FXi(t) = 1− 1 = 0, which is a contradiction.
According to Muller and Stoyan (2002), if ∃ t∗ s.t. FX(t)≤ FY (t) ∀ t≤ t∗ and FX(t)≥ FY (t) ∀ t≥t∗, then Y ≥icxX. Hence, we have established that X i is less than Xo in increasing convex order:
X i ≤icx Xo. Combining with the fact E[X i] = E[Xo], we have: X i ≤cx Xo, i.e., T (LPik ) + ∆k,i ≤cxT (LPok ). The last part of the theorem in vector forms follows immediately. �
Proof of Property 3 From Theorem 2, for all k and i s.t. Pi ∈PT , we have T (LPik ) + ∆k,i ≤cxT (LPok ). By definition of convex order, it implies: V [T (LPik ) + ∆k,i] ≤ V [T (LPok ))]. Hence, using
Lemma 2, we have:
V [LPik ] = V [T (LPik )]− 1
12= V [T (LPik ) + ∆k,i]−
1
12≤ V [T (LPok )]− 1
12= V [LPok ] �
Proof of Lemma 3 From Theorem 2, for all k and i s.t. Pi ∈PT , we have T (LPik ) + ∆k,i ≤cxT (LPok ). Since CV aR− is a convex measure (Uryasev 2000), the same applies to CEover
α , this
implies CEoverα [T [LPik ] + ∆k,i]≤CEover
α [T [LPok ]]. Also, CV aR− is translation-equivariant (Uryasev
2000), thus: CEoverα [T [LPik ] + ∆k,i] =CEover
α [T [LPok ]] + ∆k,i . �
Proof of Property 4 From Theorem 2, for all k and Pi ∈PT , we have T (LPik )+∆k,i ≤cx T (LPok ).
Using Lemmas 2 and 3, ∀ α= FLPik
[K],K ∈Z+, we have:
CEoverα [LPik ] =CEover
α [T [LPik ]]− 1/2 =
CEoverα [T [LPik ] + ∆k,i]−∆k,i− 1/2≤CEover
α [T [LPok ]]−∆k,i− 1/2 (14)
Let α+ , min{γ : γ ≥ α and γ = FLPok
[D],D ∈ Z+}. Since LPok is a discrete random variable,
CEoverα [Lok] =CEover
α+ [LPok ]. Let K1 = V aRα[T [LPok ]] and K2 = V aRα+ [T [LPok ]]. By definition of α+,
K2 ∈Z+ and K2− 1<K ≤K2.
CEoverα [T (LPok )] =E[T (LPok )|T (LPok )≥K1] =
1
P{T (LPok )≥K1}
∞∫K1
tdFT (LPok
)(t)
=1
P{T (LPok )≥K1}
∞∫K1
tpT (LPok
)(t)dt=
1
P{T (LPok )≥K1}
K2∫K1
tpT (LPok
)(t)dt+
∞∫K2
tpT (LPok
)(t)dt
36
Similarly,
CEoverα+ [T (LPok )] =
1
P{T (LPok )≥K2}
∞∫K2
tpT (LPok
)(t)dt
Note that pT (LPok
)(t) =
∞∑z=0
pLPok
(z)Iz≤t<z+1, so pT (LPok
)(t) = p
LPok
(K2 − 1) ∀ t ∈ [K2 − 1,K2]. Let
A, pLPok
(K2− 1). Then,K2∫K1
tpT (LPok
)(t)dt=A
K2∫K1
tdt=AK2
2−K21
2.
We introduce the following notation to simplify the exposition of the rest of the proof: Let
P{K1 ≤T (LPok )≤K2}=A1 and P{T (LPok )≥K2}=B and∞∫K2
tpT (LPok
)(t)dt=C. Now, P{T (LPok )≥
K1}= P{K1 ≤T (LPok )≤K2}+P{T (LPok )≥K2}=A1 +B.
We can rewrite: CEoverα [T (LPok )] = 1
A1+B
(AK2
2−K21
2+C
)and CEover
α+ [T (LPok )] = CB
. We will
show that CEoverα [T (LPok )] ≤ CEover
α+ [T (LPok )]. Multiplying both sides by (A1 + B)B, we get:(AK2
2−K21
2+C
)B ≤C(A1 +B)⇔
(AK2
2−K21
2
)B ≤CA1⇔ 1
2(K2
2 −K21 )B ≤ A1
AC.
It is obvious that A1A
= K2−K1K2−(K2−1)
=K2−K1, thus, the above inequality is equivalent to: 12(K2 +
K1)B ≤C.
Now, we have C =∞∫K2
tpT (LPok
)(t)dt >
∞∫K2
K2pT (LPok
)(t)dt = K2
∞∫K2
pT (LPok
)(t)dt = K2P{T (LPok ) ≥
K2}=K2B > 12(K2 +K1)B.
So, we have CEoverα [T (LPok )] ≤ CEover
α+ [T (LPok )] and note that the equality holds if and only if
K1 = V aRα[T (LPok )] =K2 ∈Z+.
From Equation 14, we obtain:
CEoverα [LPik ]≤CEover
α [T (LPok )]−∆k,i−1
2≤CEover
α+ [T (LPok )]−∆k,i−1
2=
CEoverα+ [LPok ]−∆k,i =CEover
α [LPok ]−∆k,i �
Proof of Lemma 4 Since both P1 and P2 satisfy Participation Condition 1, we have: πP1k (0) =
πP2k (0) =λokµk
. From the balance equations as shown in the proof of Theorem 1, we have:
πP1k (l) = πP1k (0) 1µk
Πlj=1aj−1 and πP2k (l) = πP2k (0) 1
µkΠlj=1bj−1
Thus, πP1k (l) = πP2k (l) ∀ l ∈ {0, .., n}. Now, consider the balance equations, we have: πP2k (n+ 1) =
bnµkπP2k (n)> an
µkπP2k (n) = an
µkπP1k (n) = πP1k (n+ 1).
We will show that bn+1 < an+1 by contradiction. Suppose bn+1 ≥ an+1 then πP2k (n + 2) =bn+1
µkπP2k (n+ 1)≥ an+1
µkπP2k (n+ 1)>
an+1
µkπP1k (n+ 1) = πP1k (n+ 2).
This implies πP2k (l)> πP1k (l) ∀ l≥ n+ 3 and hence,∞∑l=0
πP2k (l)>∞∑l=0
πP1k (l) = 1, which contradicts
the fact that∞∑l=0
πP2k (l) = 1. Thus, bn+1 < an+1 and πP2k (n + 2) < πP1k (n + 2). Since al = bl ∀ l ∈
0, .., n− 1∪n+ 2, ..,∞, using the balance equation as in the proof of Theorem 1, it implies that
πP2k (l)< πP1k (l) ∀ l ≥ n+ 2. This means that the cdf of LP2k crosses the cdf of LP1k from above at
37
only 1 point. Following the same method as applied in the proofs of Theorems 1 and 2, we conclude
that P1 ≤P P2. �
Proof of Theorem 4 (i) From the monotonicity of the conditional expected arrival rate of a
policy Pi in PM (equation 12) we have: E[Λk(~LPi)|Lk = 0] > E[Λk(~L
Pi)] = λok. Otherwise, if
E[Λk(LPi)|Lk = 0]≤ λok, then E[Λk(L
Pi)] = E(Lk)E[Λk(LPi)|Lk]<E[Λk(L
Pi)|Lk = 0]≤ λok, which is
a contradiction. Also, there must exist a finite m∗ such that:{E[Λk(~L
Pi)|Lk = l]>λok, ∀ l ≤ mE[Λk(~L
Pi)|Lk = l]≤ λok, otherwise.
Otherwise, if there is no such finite m∗, then E[Λk(LPi)|Lk = l]≥ λok ∀ l(with at least some strict
inequality), and hence, E[Λk(LPi)] =ELk [E[Λk(L
Pi)|Lk]]>λok, which is a contradiction.
Thus, Pi ∈ PT and hence, we have established that PM ⊂ PT . Combined with the result of
Theorem 3, we have PM ⊂PT ⊂P .
(ii) and (iii) follow directly from Lemma 4. �
Proof of Lemma 5 We take the perspective of one ED (hence, we suppress the subscript k in
the following proof), where l represents the aggregate state: number of patients in ED k taken
across all other censuses at other EDs. Let the number of virtual patients up to time t be NPiv (t)
and the number of actual patients be NPi(t). Observe, that both numbers have to satisfy the
following equality: limt→∞
NPiv (t)
t= lim
t→∞NPi (t)
t= λo. πPi(l) is then the percentage of time the system
spends in state l; during this time, the arrival rate is λPi(l). Let NPil (t) be the number of patients
arriving when the queue size is l counted up to time t. So, we have:∞∑l=1
NPil (t) =NPi(t).
Up to time t, the fraction of patients observing l number of patients in the system is:NPil
(t)
NPi (t)
and hence, πPi(l) representing the percentage of patients who observe l number of patients in the
system is: πPi(l) = limt→∞
NPil
(t)
NPi (t).
πPi(l) = limt→∞
NPil
(t)
NPi (t)= lim
t→∞
NPil
(t)
t
NPi (t)t
=limt→∞
NPil
(t)
t
limt→∞
NPi (t)t
. Note that limt→∞
NPi (t)t
= λo and hence, πPi(l) =
1λo
limt→∞
NPil
(t)
t.
Let tl be the total time the system (under control policy Pi) spends in aggregate state l. From
the time-average distribution πPi(l), we get: limt→∞
tlt
= πPi(l). So,
πPi(l) =1
λolimt→∞
NPil (t)
t=
1
λolimt→∞
NPil (t)
tl
tlt
=1
λolimt→∞
NPil (t)
tllimt→∞
tlt
Since limt→∞
NPil
(t)
tl= λPi(l) and lim
t→∞tlt
= πPi(l), πPi(l) = λPi (l)λo
πPi(l). �
38
References
Burke, L. G., N. Joyce, W. E. Baker, P. D. Biddinger, K. S. Dyer, F. D. Friedman, J. Imperato, A. King,
T. M. Maciejko, M. D. Pearlmutter, A. Sayah, R. D. Zane, S. K. Epstein. 2013. The effect of an
ambulance diversion ban on emergency department length of stay and ambulance turnaround time.
Annals of Emergency Medicine 61(3) 303–311.
California Healthcare Foundation. 2009. Reducing ambulance diversion in California: Strategies and best
practices. Tech. rep.
Dehlendorff, C., M. Kulahci, S. Merser, K. K. Andersen. 2010. Conditional value at risk as a measure for
waiting time in simulations of hospital units. Quality Technology and Quantitative Management 7
321–336.
Deo, S., I. Gurvich. 2011. Centralized vs. decentralized ambulance diversion: A network perspective. Man-
agement Science 57(7) 1300–1319.
Do, H. 2012. Information control and usage for pareto improvement of supply chains. Ph.D. thesis, Purdue
University.
Do, H., A. Iyer. 2013. Supply state driven promotion strategies in supply chains. Working paper.
Eitel, D., D. Samuelson. 2011. OR in the ER. OR/MS Today 32–35.
Gundlach, J. 2010. The problem of ambulance diversion and some potential solutions. New York University
Journal of Legislation and Public Policy 13.
Gurvich, I., O. Perry. 2012. Overflow networks: Approximations and implications to call center outsourcing.
Operations Research 60(4) 996–1009.
Guttmann, A., M.J. Schull, M.J. Vermeulen, T.A. Stukel. 2011. Association between waiting times and short
term mortality and hospital admission after departure from emergency department: population based
cohort study from Ontario, Canada. BMJ 342.
Guyette, F.X. 2010. Personal communication.
Hopp, W.J., W.S. Lovejoy. 2012. Hospital Operations: Principles of High Efficiency Health Care. FT Press
Operations Management, Pearson Education.
McCarthy, M. L. 2011. Overcrowding in emergency departments and adverse outcomes. BMJ 342.
Moskop, J. C, D. P Sklar, J. M Geiderman, R. M Schears, K. J Bookman. 2009. Emergency department
crowding, Part 1–Concept, causes, and moral consequences. Annals of Emergency Medicine 53(5)
605–611.
Muller, A., D. Stoyan. 2002. Comparison Methods for Stochastic Models and Risks. Wiley Series in Proba-
bility and Statistics, Wiley.
Pflug, Georg Ch. 2000. Some remarks on the value-at-risk and the conditional value-at-risk. Probabilistic
constrained optimization. Springer, 272–281.
39
Rockafellar, R. T., S. Uryasev. 2002. Conditional value-at-risk for general loss distributions. Journal of
Banking & Finance 26(7) 1443 – 1471.
Shaked, M., J.G. Shanthikumar. 2007. Stochastic Orders. Springer Series in Statistics, Springer.
Shanthikumar, G., M. Zazanis. 1999. Inequalities between event and time averages. Journal of Probability
in the Engineering and Informational Sciences 13(3) 293–308.
Sprivulis, P. C, J. Da Silva, I. Jacobs, G. Jelinek, A. Frazer. 2006. The association between hospital over-
crowding and mortality among patients admitted via Western Australian emergency departments.
Medical Journal of Australia 184 208–212.
Uryasev, S. 2000. Probabilistic Constrained Optimization: Methodology and Applications. Nonconvex Opti-
mization and Its Applications, Springer.