31
Staffing and Routing in Large-Scale Service Systems with Heterogeneous- Servers Mor Armony Based on joint papers with Avi Mandelbaum and Amy Ward

Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

  • Upload
    tori

  • View
    23

  • Download
    2

Embed Size (px)

DESCRIPTION

Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers. Mor Armony. Based on joint papers with Avi Mandelbaum and Amy Ward. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A A A A. Motivation: Call Centers. - PowerPoint PPT Presentation

Citation preview

Page 1: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Staffing and Routing in Large-Scale Service Systems with Heterogeneous-

Servers

Mor Armony

Based on joint papers with Avi Mandelbaum and

Amy Ward

¸ < ¹

Page 2: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Motivation: Call Centers

Page 3: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Inverted-V Model

NK

K

K 21

• Calls arrive at rate (Poisson process).

• K server pools.

• Service times in pool k are exponential with rate k

N1

1

¹ 2 > ¹ 1

¹ > ¹

Experienced employees on averageprocess requests faster than new hires.Gans, Mandelbaum and Shen (2007)

Page 4: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Problem

Routing: When an incoming call arrives to an empty queue, which agent pool should take the call?

Staffing: How many servers should be working in each pool?

¹ 2 > ¹ 1

¹ > ¹

x = y

NK

K

K 21

N1

1

Page 5: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Background: Human Effects in Large-Scale Service

Systems

M/M/N

M/M/N+M+M/M/N+

M/M/N+M

M/M/N++

Halfin & Whitt ’81

Borst et al ’04

Garnett et al ’02

Mandelbaum & Zeltyn ’08

Page 6: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Talk Outline

• M/M/N+ (Armony ‘05)• M/M/N++M (Armony & Mandelbaum

’08)• M/M/N++☺ (Armony & Ward ’08)

Page 7: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Problem: M/M/N+

¹ 2 > ¹ 1

¹ > ¹

x = y

NK

K

K 21

N1

1

Minimize C1(N1) + ::: + CK (NK )Subject to P (wait > 0) · ®;

Assumption: FCFS

For some routing policy

Page 8: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Routing Problem

¹ 2 > ¹ 1

¹ > ¹

x = y

Minimize C1(N1) + ::: + CK (NK )Subject to P (wait > 0) · ®;

For some routing policy

• For N1=N2=1 optimal routing is of a threshold form (the slow server problem)

• For general N, structure of optimal routing is an open problem (de Vericourt & Zhou)

• The optimal preemptive policy is FSFP (Proof: Sample-path argument)

Page 9: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Asymptotic RegimeHalfin-Whitt (QED)

¹ 2 > ¹ 1

¹ > ¹

x = y

oN

K

k kk

1

, As

X̂ ¸ = X ¸ ¡ N ¸p

N ¸hX̂ ¸

i += scaled queue length

hX̂ ¸

i ¡= scaled # of idle servers

NK

K

K 21

N1

1

Page 10: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotically Optimal Routing

Proposition: The non-preemptive routing policy FSF is asymptotically optimal

Proof: State-space collapse: in the limit faster servers are always busy.

The preemptive and non-preemptive policies are asymptotically the same

Note: Thresholds are not-needed: The Halfin-Whitt regime is different from the conventional

heavy- traffic regime (Teh & Ward ’02).

Page 11: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotically Feasible Region

Page 12: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotic Feasibility

• Proposition: Under FSF

if and only if

where

provided that

lim¸ ! 1

P (wait > 0) = ® 0· ®· 1;

¹ 1N1 + ¹ 2N2 + ::: + ¹ K NK ¼¸ + ±p

¸; 0 · ±· 1

®=·1+

(±=p

¹ 1)©(±=p

¹ 1)Á(±=

p¹ 1)

¸¡ 1

;

liminf¸ ! 1 N1=N > 0.

Page 13: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotically Optimal Staffing

• All solutions of the form

have approximately the same cost

• Let C=inf {C(N) | ¹1N1+…+¹KNk=¸}

• Definition (Asymptotic Optimality)1. N* Asymptotically Feasible and2. (C(N*)-C)/(C(N)- C) · 1 (in the limit)

¹ 1N1 + ¹ 2N2 + ::: + ¹ K NK = ¸ + ±p

¸; 0< ±< 1

Page 14: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotically Optimal Staffing

Page 15: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Staffing Example:Homogeneous Cost Function

• Problem:

• Solve:

• To obtain:

• Note:

Minimize C1Np1 + C2N

p2 + ::: + CK N p

K ; p> 1Subject to P (wait > 0) · ®; for some routing policy

Minimize C1N p1 + C2N

p2 + ::: + CK N p

K ; p> 1Subject to ¹ 1N1 + ¹ 2N2 + ::: + ¹ K NK ¸ ¸ + ±

Nk

Nj=

µ¹ k=Ck

¹ j =Cj

¶1=(p¡ 1)

(1)

¹ 1N1 + ¹ 2N2 + ::: + ¹ K NK = ¸ + ±p

¸ (2)

N1=N > 0

Page 16: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Summary: M/M/N+• Routing: FSF• Staffing: Square-root safety capacity (QED

regime as an outcome)• Under FCFS non-idling is asymptotically

optimal• For non-idling policies: min P(W>0) min EW• Outperforming M/M/N• Faster servers are never idle• All idleness is experienced by the slowest

servers

Page 17: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Adding Fairness

Page 18: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers
Page 19: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Fairness in Call Center

¹ 2 > ¹ 1

¹ > ¹

x = y

Call centers care aboutEmployee burnout and turnover.

Some call centers address fairness byrouting to the server that has idled the longest (LISF).

How does LISF perform?

Do any other fair policies perform better?

NK

K

K 21

N1

1

Page 20: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Fairness Problem

¹ 2 > ¹ 1

¹ > ¹

x = y

Minimize C1(N1)+…+CK(NK)

Subject to:

E(Waiting time)· W

E[# of idle servers of pool k]=

fk

E[Total # of idle servers]

* f1 + f2 + … + fK = 1

Assumption: Non-idling

NK

K

K 21

N1

1

Page 21: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

The Fairness Problem: Routing

Minimize E[Waiting Time]Subject to:

E[# of idle servers of pool k]= fk

E[Total # of idle servers]

Analysis: Sample-path arguments are not straightforward even if preemption is allowed.

¹ 2 > ¹ 1

¹ > ¹

x = y

Page 22: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

MDP Approach: Routing(Assumption: non-idling)

Q=1 Q=2 Q=31,1

1,00,0

0,1

= 1+ 2 N1 = N2 = 1

2

1

1

2Pslow

Pfast

Infinite state space

Page 23: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Numeric Example

Page 24: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

MDP as an LP

• Complexity: Polynomial in N, Exponential in K• Solution: Switching curve (Difficult to

characterize explicitly).• How does solution perform vs. LISF?• Staffing search: Too long!!!• Instead, we propose an asymptotic approach.

Page 25: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Threshold Routing Control

)0 and(with

levelsat control dA threshol

0

11

K

K

LN L

N L...L

x = y2

NL1 L3 L2

FSF

FSFw/opool 3 FSF w/o pool 2

0

FSFw/opool 4

Page 26: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Outline of Asymptotic Analysis

• Formulation of a Diffusion Control Problem (DCP)

• Solution of DCP: Multi-Threshold Control

• Note: Resulting Diffusion has Discontinuous Drift

• Policy Translation: Multi-Threshold Policy

• Policy Adjustment: -Threshold Policy

• Establishing Asymptotic Optimality

Page 27: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

²-Threshold Policy

X

Death rate

slope ¹2

slope ¹1

L N

Page 28: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Asymptotic Performance (Simulation)

1 = 1, 2 = 2, = 1, = 1.5, 2 = 2 = 3, N1=300, N2=200, ¸=674

A Simulation Comparison of the Threshold and LWISF Policy

0.0000002.000000

4.0000006.000000

8.00000010.000000

12.000000

0 0.5 1

Slow Server Idleness Proportion

E[N

um

be

r o

f W

ait

ing

C

us

tom

ers

]

Threshold

LWISF

Page 29: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Literature Review

• MDP approach to constrained optimization– Gans and Zhou (2003), Bhulai and Koole (2003)

• The Limit Regime– Halfin and Whitt (1981)

• The Inverted V (and more general) Models– Tezcan (2006), Atar (2007), Atar & Shwartz

(2008), Atar, Shaki & Shwartz (2009), Tseytlin (2008)

- Gurvich and Whitt (2007)

• Customer / Flow Fairness literature– Harchol-Balter and Wierman (2003, 2007)– Jahn et al (2005) & Schulz and Stier-Moses (2006)

• Fairness literature in HRM

Page 30: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Summary

• Server Heterogeneity: Effect on Staffing and Routing

• Incorporation of customer abandonment

• Incorporation of server fairness

• Simple routing schemes (priorities and threshold) • Simple staffing schemes (square-root safety

staffing)

Page 31: Staffing and Routing in Large-Scale Service Systems with Heterogeneous-Servers

Further Research

• Multi-skill environment (ongoing with Kocaga)

• LWISF policy (ongoing with Gurvich)• Non-idling assumption• Incorporate abandonment

(M/M/N++M+☺)• Other fairness criteria• Server compensation schemes

Acknowledgement: Rami Atar, Ashish Goel, Itay Gurvich, Tolga Tezcan & Assaf Zeevi