Optimal Pricing and Replenishment in a Single-Product

Optimal Pricing and Replenishment in a

Single-Product Inventory System

Hong ChenCheung Kong Graduate School of Business, China

Owen Q. Wu∗

Sauder School of Business, University of British Columbia, Canada

David D. Yao†

IEOR Dept., Columbia University, New York, USA

March 1, 2004

Abstract

We study an inventory system that supplies price-sensitive demand modeled by Brownianmotion, focusing on the optimal pricing and inventory replenishment decisions, under bothlong-run average and discounted objectives. Analytical solutions are obtained in all cases,and related to or contrasted against previously known results. In addition, we bring out theinterplay between the pricing and the replenishment decisions, and the way they react todemand uncertainty. We show that the joint optimization of both decisions may result insignificant profit improvement over the traditional way of making the decisions separatelyor sequentially. We also show that multiple price changes will only result in a limited profitimprovement over a single price.

Keywords: Joint pricing-replenishment decision, price sensitive demand, Brownian model.

∗Supported in part by University Graduate Fellowship from the University of British Columbia.†Supported in part by NSF grant DMI-0085124. Part of this author’s research was undertaken while at the

Dept of Systems Engineering and Engineering Management, Chinese University of Hong Kong, and supportedby HK/RGC Grant CUHK4173/03E.

1

1 Introduction

We study a single-product continuous-review inventory model with price-sensitive demand.

The cumulative demand process is modeled by a Brownian motion with a drift rate that is a

function of the price. Replenishment is instantaneous, and demands are satisfied immediately

upon arrival. Consequently, the replenishment follows a simple order-up-to policy, with the

order-up-to level denoted by S.

We allow the pricing decisions to be dynamically adjusted over time. Specifically, we divide

S into N equal segments, with N a given integer. For each segment, there is a price; and all N

prices are optimally determined, jointly with the replenishment level S, so as to maximize the

expected long-run average or discounted profit.

There has been a substantial and growing literature on the joint pricing and inventory con-

trol. We refer the reader to two recent survey papers, Yano and Gilbert (2002), and Elmaghraby

and Keskinocak (2003). Our review below will focus on those works that relate closely to our

study.

Whitin (1955), Porteus (1985a), Rajan et al. (1992), among others, study demands that

are deterministic functions of prices. Whitin (1955) connects pricing and inventory control

in the EOQ (economic order quantity) framework, and Porteus (1985a) provides an explicit

solution for the linear demand instance. Rajan et al. (1992) investigates continuous pricing for

perishable products for which demands may diminish as products age.

Other works study stochastic demand models. Li (1988) considers a make-to-order produc-

tion system with price-sensitive demand. Both production and demand are modeled by Poisson

processes with controllable intensities. The control of demand intensity is through pricing. A

barrier policy is shown to be optimal: when the inventory level reaches an upper barrier, the

production stops; when the inventory level drops to zero, the demand stops (or the demand is

lost). It is also shown that the optimal price is a non-increasing function of the inventory level.

Federgruen and Heching (1999) examines a model in which the firm periodically reviews

inventory and decides both the replenishment quantity and the price to charge over the period.

The replenishment cost is linear, without a fixed setup cost. The prices are changed only at

the beginning of each period (as opposed to the continuous pricing scheme in Rajan et al.

1992). It is shown that a base-stock list-price policy is optimal for both average and discounted

objectives. Earlier related works include Zabel (1972) and Thowsen (1975).

Thomas (1974), Polatoglu and Sahin (2000), Chen and Simchi-Levi (2003a,b), Feng and

2

Chen (2003), and Chen et al. (2003) extend the above model to include a replenishment setup

cost. In the full-backlog setting, it is first conjectured in Thomas (1974), and then proved

in Chen and Simchi-Levi (2003a), that the (s, S, p) policy is optimal for additive demand (a

deterministic demand function plus a random noise) in a finite horizon. Chen and Simchi-

Levi (2003b) further proves the optimalify of the stationary (s, S, p) policy for general demand

in an infinite horizon. Feng and Chen (2003) proves the optimality of (s, S, p) policy under

more general demand functions, but restricting the prices to a finite set. Assuming lost sales,

Polatoglu and Sahin (2000) obtains rather involved optimal policies under a general demand

model and provides restrictive conditions under which the (s, S, p) policy is optimal. Chen et

al. (2003) proves that (s, S, p) policy is optimal under additive demand and lost sales.

Continuous-review models are studied in Feng and Chen (2002), and Chen and Simchi-Levi

(2003c). Feng and Chen (2002) models the demand as a price-sensitive Poisson process. Pricing

and replenishment decisions are made upon finishing serving each demand, but the prices are

restricted to a given finite set. An (s, S, p) policy is proved optimal, with the optimal prices

depending on the inventory level in a rather structured manner. Chen and Simchi-Levi (2003c)

generalizes this model by considering compound renewal demand process with both the inter-

arrival times and the size of the demand depending on the price. It is shown that the (s, S, p)

policy is still optimal.

Our work differs from the above papers in two aspects. First, we model the demand process

by Brownian motion, with a drift term being a function of the price. The Brownian model, with

its continuous path, is appropriate for modeling fast-moving items. It is a natural model when

demand forecast involves Gaussian noises. Furthermore, it allows us to bring out explicitly

the impact of demand variability in the optimal pricing and replenishment decisions, whereas

results along this line are quite limited in the existing literature. (For other works that model

production-inventory systems using Brownian motion, we refer to Puterman (1975) and Har-

rison (1985).) Second, we examine the number of price changes allowed as inventory depletes,

and demonstrate that using a small number of prices, optimally determined, is usually good

enough.

Some of the key findings and new insights from our study include:

• Demand variability incurs an additional inventory holding cost.

• As demand variability increases, the optimal price decreases and the optimal replenish-

ment level increases.

3

• The traditional way of separating the pricing and replenishment decisions could result in

significant profit loss, as compared with the joint decision.

• Multiple price changes will only result in a limited profit improvement over a single price

(when both are optimally determined). The relative improvement, however, becomes

more significant in applications where the profit margin is low.

The rest of the paper is organized as follows. In Section 2, we present a formal description of

our model, in terms of the demand and inventory processes, the cost functions, and the pricing

and replenishment decisions. In Section 3, we study the optimal pricing and replenishment

decisions under the long-run average objective. We start from making these decisions separately,

so as to highlight the comparisons against prior studies and known results, and then present

the joint optimization model and demonstrate the profit improvement. In Section 4, we present

analogous results under the discounted objective, emphasizing the contrasts against the average

objective case. We conclude the paper pointing out possible extensions in Section 5.

2 Model Description and Preliminary Results

We consider a continuous-review inventory model with a price-sensitive demand. The objective

is to determine the inventory replenishment and pricing decisions that strike a balance between

the sales revenue and the cost for holding and replenishing inventory over time, so as to maximize

the expected long-run average or discounted profit. The specifics of the demand model, the

cost parameters and the control policy are elaborated in the following three subsections.

2.1 The Demand Model

The subject of our study is a single-product inventory system supplying a price-sensitive demand

stream. The cumulative demand up to time t is denoted as D(t), and modeled by a diffusion

process:

D(t) =∫ t

0λ(pu)du+ σB(t), t ≥ 0, (1)

where pt is the price charged at time t; λ(pt) is the demand rate at time t, which is a decreas-

ing function of pt; B(t) denotes the standard Brownian motion; and σ is a positive constant

measuring the variability of the demand (or the error of demand forecast).

This Brownian demand model can be related to other discrete (i.e., integer-valued) demand

processes through strong approximation (Csorgo and Horvath 1993). Consider, for instance, a

4

Poisson demand process with an instantaneous rate λ(pt), and write the cumulative demand

as A( ∫ t

0λ(pu)du

), where A(·) denotes the Poisson process with a unit rate. Then the strong

approximation implies that a version of A on a suitable probability space satisfies

sup0≤t≤T

∣∣∣A( ∫ t

0λ(pu)du

)−∫ t

0λ(pu)du−B

( ∫ t

0λ(pu)du

)∣∣∣ = O(log(T )),

where B is a standard Brownian motion defined on the same probability space. If the process

A follows a more general probability law (i.e., not necessarily Poisson), then the order of ap-

proximation will be O(T 1/r) for a constant r ∈ (2, 4). Using Brownian model to approximate

discrete (point) process has been a standard approach in many other applications, stochastic

networks in particular; refer to, e.g., Harrison (1988, 2003), and Chen and Yao (2001).

Let P and L denote, respectively, the domain and the range of the demand rate function

λ(·). Both are assumed to be intervals of <+ (the set of nonnegative real numbers); in addition,

0 6∈ L.

Assumption 1 (on demand rate) The demand rate λ(p) and its inverse p(λ) are both positive-

valued, strictly decreasing, and twice continuously differentiable in the interior of P and L,respectively. The revenue rate r(λ) = p(λ)λ is strictly concave in λ.

Many commonly-used demand functions satisfy the above assumption, including the follow-

ing examples, where the parameters α, β and δ are all positive:

• The linear demand function λ = α−βp, p ∈ [0, α/β]: r(λ) = αβλ−

1βλ

2 is strictly concave;

• The exponential demand function λ = αe−βp, p ≥ 0: r(λ) = − 1βλ log(λ/α) is strictly

concave;

• The power demand function λ = βp−δ, p ≥ 0: r(λ) = λ−1δ+1β

1δ is strictly concave if

δ ≥ 1.

2.2 Cost Parameters

Let h be the cost for holding one unit of inventory for one unit of time. Let c(S) be the cost to

replenish S units of inventory.

Assumption 2 (on replenishment cost) The replenishment cost function c(S) is twice con-

tinuously differentiable and increasing in S for S ∈ (0,∞). The average cost a(S) = c(S)/S is

strictly convex in S, and a(S)→∞, as S → 0.

5

Consider a special case: c(S) = K + cSδ, S > 0, where K, c, δ > 0. When δ = 1, this

is the most commonly used linear function with a setup cost K. The average cost function,

a(S) = KS + cSδ−1 is convex if δ ∈ (0, 1] ∪ [2,∞]. Furthermore, as we shall demonstrate below,

when S satisfies a′(S) ≤ 0, which is the primary case of interest, a(S) is convex for all δ > 0.

To see this, consider δ ∈ (1, 2), and note that

a′(S) =1S2

(−K + c(δ − 1)Sδ);

and when a′(S) ≤ 0, we have

a′′(S) = −2− δS3

(− 2

2− δK + c(δ − 1)Sδ

)≥ −a

′(S)(2− δ)S

≥ 0.

2.3 Pricing and Replenishment Policies

Assume replenishment is instantaneous, i.e., with zero leadtime. We further assume that all

orders (demand) will be supplied immediately upon arrival; i.e., no back-order is allowed, or

there is an infinite back-order cost penalty. (Our results extend readily to the back-order case;

refer to Section 5.)

The replenishment follows a continuous-review, order-up-to policy. Specifically, whenever

the inventory level drops to zero, it is brought up to S instantaneously via a replenishment,

where S is a decision variable. We shall refer to the time between two consecutive replenishments

as a cycle.

We adopt the following dynamic pricing strategy. Let N ≥ 1 be a given integer, and let

S = S0 > S1 > · · · > SN−1 > SN = 0. Immediately after a replenishment at the beginning

of a cycle, price p1 is charged until the inventory drops to S1; price p2 is then charged until

the inventory drops to S2; ...; and finally when the inventory level drops to SN−1, price pN is

charged until the inventory drops to SN = 0, when another cycle begins. The same pricing

strategy applies to all cycles. For simplicity, we set Sn = S(N − n)/N . That is, we divide the

full inventory of S units into N equal segments, and price each segment with a different price

as the inventory is depleted by demand.

In summary, the decision variables are: (S,p), where S ∈ <+, and p = (p1, . . . , pN ) ∈ PN .

Within a cycle, we shall refer to the time when the price pn is applied as period n.

2.4 The Inventory Process

Without loss of generality, suppose at time zero the inventory is filled up to S. We focus on

the first cycle which ends at the time when inventory reaches zero. Let T0 = 0, and let Tn be

6

the first time when inventory drops to Sn:

Tn := inf{t ≥ 0 : D(t) = nS/N}, n = 1, 2, . . . , N.

The length of period n is therefore τn := Tn − Tn−1. Since Tn’s are stopping times, by the

strong Markov property of Brownian motion, τn is just the time during which S/N units of

demand has occurred under the price pn. That is,

τndist.= inf{t ≥ 0 : λ(pn)t+ σB(t) = S/N}. (2)

Let X(t) denote the inventory-level at t. We have,

X(t) = S −D(t), t ∈ [0, TN ).

Since our replenishment-pricing policy is stationary, X(t) is a regenerative process with the

replenishment epochs being its regenerative points.

We conclude this section with a lemma, which gives the first two moments and the generating

function of the stopping time τn. (The proof is in the appendix.)

Lemma 2.1 For the stopping time τn in (2), we have

E(τn) =S

Nλn, (3)

E(τ2n) =

σ2

λ2n

E(τn) + E2(τn) =σ2S

Nλ3n

+S2

N2λ2n

, (4)

E(e−γτn) = exp[−√

λ2n+2σ2γ−λn

σ2SN ], (5)

where λn = λ(pn) > 0, and γ > 0 is a parameter.

3 Long-Run Average Objective

To optimize the long-run average profit, thanks to the regenerative structure of the inventory

process, it suffices for us to focus on the first cycle. Recall that period n refers to the period in

which the price pn applies, and the inventory drops from Sn−1 = (N−n+1)SN to Sn = (N−n)S

N . We

first derive the total inventory over this period. Applying integration by parts, and recognizing

dX(t) = −dD(t), X(Tn−1) = Sn−1, X(Tn) = Sn,

7

we have∫ Tn

Tn−1

X(t)dt = TnSn − Tn−1Sn−1 −∫ Tn

Tn−1

tdX(t)

= TnSn − Tn−1Sn−1 −∫ Tn

Tn−1

Tn−1dX(t) +∫ Tn

Tn−1

(t− Tn−1)dD(t)

= τnSn +∫ Tn

Tn−1

(t− Tn−1)[λ(pn)dt+ σdB(t)

].

A simple change of variable yields∫ Tn

Tn−1

(t− Tn−1)λ(pn)dt =∫ τn

0uλ(pn)du =

12λ(pn)τ2

n;

whereas

E

[ ∫ Tn

Tn−1

(t− Tn−1)dB(t)]

= E

[ ∫ Tn

Tn−1

tdB(t)]− E

[Tn−1

]E[B(Tn)−B(Tn−1)

]= 0,

follows from the martingale property of B(t) and the optional stopping theorem.

Let vn(S, pn) denote the expected profit (sales revenue minus inventory holding cost) during

period n. Then, making use of the above derivation, along with Lemma 2.1, we have

vn(S, pn) =pnS

N− E

[ ∫ Tn

Tn−1

hX(t)dt]

=pnS

N− hE[τn]Sn −

12hλ(pn)E[τ2

n]

=pnS

N−hS2(N − n+ 1

2)N2λ(pn)

− hσ2S

2Nλ(pn)2, (6)

Note in the above expression, the first term is the sales revenue from period n, the second term

is the inventory holding cost attributed to the deterministic part of the demand (i.e., the drift

part of the Brownian motion), and the last term is the additional holding cost incurred by

demand uncertainty.

For ease of analysis, below we shall often use {µn = λ(pn)−1, n = 1, . . . , N} as decision

variables and denote µ = (µ1, . . . , µN ) ∈ MN and M = {λ(p)−1 : p ∈ P}. Then, the long-run

average objective can be written as follows:

V (S,µ) =∑N

n=1 vn(S, pn)− c(S)SN

∑Nn=1 µn

=

∑Nn=1

[p( 1

µn)− hS

N (N − n+ 12)µn − hσ2

2 µ2n − a(S)

]∑N

n=1 µn

. (7)

The additional holding cost due to demand uncertainty is represented by hσ2∑N

n=1 µ2n

2∑N

n=1 µn.

8

For the special case when N = 1, the price and the demand rate are both constants in a

cycle (and hence constant throughout the horizon). The objective function (7) takes a simpler

form. For comparison with some classical work, we use λ as the decision variable and denote

the long-run average profit under single-price policy as V (S, λ). Then,

V (S, λ) =v1(S, p(λ))− c(S)

S/λ= r(λ)− hS

2− λa(S)− hσ2

2λ. (8)

The classical EOQ model only consists of the second and third terms in (8), which are the total

cost if the demand is deterministic with a constant rate. Whitin (1955) and Porteus (1985a)

considered the price-sensitive EOQ model, which involves the first three terms in (8). Our

model gives rise to an additional holding cost due to demand variability.

3.1 Optimal Replenishment with Fixed Prices

In this case, the set of prices p, or equivalently, µ is given, and the firm’s problem is

maxS>0

V (S,µ).

Under Assumption 1, V (S,µ) is strictly concave in S. Note that V (S,µ)→ −∞ as S →∞ or

S → 0 (the latter is due to Assumption 2 that a(S)→∞ as S → 0). Thus, the unique optimal

replenishment level is determined by the first-order condition:

S∗ = a′−1(− h

N2

N∑n=1

(N − n+12)µn

), (9)

where a′−1 is well-defined since a(·) is strictly convex and a′(·) is strictly increasing under As-

sumption 2. In practice, the average cost as a function of quantity is usually first decreasing

(due to economy of scale) and then increasing (due to capacity or other technological restric-

tions). However, at the optimal replenishment level, we have a′(S∗) < 0 for any fixed prices.

This observation helps to reduce the search space when the replenishment level is optimized

jointly with pricing decisions (see Section 3.3).

The way that the demand variability and the holding cost impact the optimal replenishment

level can be readily derived from (9).

Proposition 3.1 With prices fixed,

(i) the optimal replenishment level S∗ is independent of σ, and decreasing in h and in pn (for

any n);

(ii) the optimal profit is decreasing in σ and h.

9

Proof. Part (i) is obvious from (9). (Note, in particular, that the additional holding cost due

to demand variability is independent of the replenishment level.) For (ii), consider σ1 ≤ σ2,

and denote the corresponding maximizers as S∗1 and S∗2 and the objective values as V (S∗1 ,µ, σ1)

and V (S∗2 ,µ, σ1), respectively. We have V (S∗1 ,µ, σ1) ≥ V (S∗2 ,µ, σ1) ≥ V (S∗2 ,µ, σ2), where the

first inequality is due to the maximality of S∗1 and the second one follows immediately from the

objective function in (7). The case of decreasing in h is completely analogous.

Example 3.1 Consider the linear replenishment cost: c(S) = K + cS, where K, c > 0. The

first order condition in (9) leads to the familiar EOQ formula:

S∗ =

√2Kλa

h, where λa =

( N∑n=1

2N − 2n+ 1N2

µn

)−1. (10)

In the standard EOQ model, the demand rate is taken as the average demand per time unit,

which in our setting becomes

λ =( 1N

N∑n=1

µn

)−1. (11)

Suppose that the fixed prices satisfy p1 ≤ p2 ≤ · · · ≤ pN (or µ1 ≤ µ2 ≤ · · · ≤ µN ), i.e.,

a lower price is charged when the inventory level is higher (we will see this price pattern in

Proposition 3.3). Higher weights are given to smaller µn’s in (10), while µn’s are equally

weighted in (11). Hence, λ ≤ λa. This implies that the standard EOQ with a time-average

demand rate may lead to a replenishment level lower than the optimally desirable.

Example 3.2 Let c(S) = 100 + 5S, λ(p) = 50 − p, h = 1 and N = 1. In Figure 1(a), the

EOQ quantity is shown as a function of the single fixed price, S∗ = 10√

100− 2p. Figure 1(b)

illustrates the optimal profit corresponding to various levels of demand variability. The peak of

each V ∗ curve is reached when both the price and the replenishment level are jointly optimized.

When the price deviates far away from the optimum, the profit falls dramatically.

Figure 1: Optimal replenishment with a fixed price

10

3.2 Optimal Pricing with a Fixed Replenishment Level

3.2.1 The Single-Price Problem

Here N = 1. We want to optimize the single price, or equivalently, the demand rate, as follows:

maxλ∈L

V (S, λ) = r(λ)− hS

2− λa(S)− hσ2

2λ.

Under Assumptions 1 and 2, V (S, λ) is strictly concave in λ. Assuming an interior solution

(which is rather innocuous, since the last two terms in the objective forbid λ to be extreme),

the unique optimal λ follows from the first-order condition:

r′(λ) +hσ2

2λ2= a(S). (12)

We have the following proposition.

Proposition 3.2 Suppose a single price is optimized while the replenishment level is held fixed.

(i) If the optimal price p∗ is in the interior, then the marginal revenue at p∗ is no larger than

the average replenishment cost, i.e., r′(λ(p∗)) ≤ a(S).

(ii) The optimal price p∗ is decreasing in σ and h.

(iii) The optimal price p∗ is increasing in a(·); if a(S) is decreasing in S, then p∗ is decreasing

in S.

(iv) The optimal profit is decreasing in σ and h.

Proof. The statement in (i) follows immediately from (12). For (ii) and (iii), note that∂2V∂λ∂σ = hσ

λ2 > 0, ∂2V∂λ∂h = σ2

2λ2 > 0, and ∂2V∂λ∂(a(S)) = −1 < 0. That is, the objective function

V is supermodular in (λ, σ), supermodular in (λ, h) and submodular in (λ, a(S)). Hence, (ii)

and (iii) follow from standard results for super/submodular functions (Topkis 1978). For (iv),

follow a similar argument to the one that proves Proposition 3.1 (ii).

Example 3.3 If λ(p) = βp−1, c(S) = K + cS, where β,K, c > 0, then r(λ) = β, and the

first-order condition (12) becomeshσ2

2λ2=K

S+ c,

leading to the optimal price

p∗ =β

λ∗=β

σ

√2(K

S + c)h

,

which is clearly decreasing in σ, h and S.

11

Example 3.4 If λ(p) = α − βp, where α, β > 0, then r(λ) = (α − λ)λ/β, and the first-order

condition in (12) becomes

α− 2λβ

+hσ2

2λ2= a(S),

which can be simplified to

f(λ) ≡ 4λ3 + 2[βa(S)− α]λ2 − hβσ2 = 0.

Note that f(0) < 0, f(+∞) = +∞, f ′(λ) = 4λ[3λ + βa(S) − α]; hence, f(λ) is either strictly

increasing or first decreasing and then increasing. In either case, the above equation has a

unique solution λ∗ in (0,∞). If λ∗ ≤ α, it is the optimal demand rate, and the optimal price

is p∗ = (α − λ∗)/β. If λ∗ > α, it can be shown that V is increasing in λ for λ ∈ [0, λ∗], and

therefore the optimal demand rate is α and the optimal price is zero.

If σ or h increases or a(·) decreases, then f(λ) decreases for any λ; and consequently, λ∗

increases. This illustrates the monotonicity in (ii) and (iii) of Proposition 3.2.

Example 3.5 Under the same setting as Example 3.2, Figure 2 plots the optimal price and

the corresponding profit against various levels of replenishment and demand variability. The

results depict the qualitative trends in Proposition 3.2. The peak of each V ∗ curve is reached

when the price and inventory are jointly optimized. In contrast to Figure 1(b), the profit here

appears less sensitive to the replenishment level, as long as the pricing decision is optimized.

Figure 2: Optimal price with a fixed replenishment level

3.2.2 The N-Price Problem

In this case, the decision variables are the set of prices p, or equivalently, µ. Specifically, the

problem is

maxµ∈MN

V (S,µ) =

∑Nn=1

[p( 1

µn)− hS

N (N − n+ 12)µn − hσ2

2 µ2n − a(S)

]∑N

n=1 µn

. (13)

The strict concavity of the revenue function r(λ) (Assumption 1) implies that the function

p( 1µ) is strictly concave in µ. This is because r′′(λ) = 2p′(λ) + λp′′(λ) and d2p( 1

µ)/dµ2 =1µ3 (2p′( 1

µ)+ 1µp

′′( 1µ)) have the same sign. Thus, the numerator of the objective is strictly concave

in µ. The ratio of a positive concave function over a positive linear function is known to be

12

pseudo-concave (Mangasarian 1970). Hence, V (S,µ) is pseudo-concave in µ. Furthermore, it

is strictly pseudo-concave in the sense that

∇µV (S,µ1)(µ2 − µ1) ≤ 0 ⇒ V (S,µ2) < V (S,µ1). (14)

Hence, the optimal µ must be unique. (This follows from a straightforward extension of Man-

gasarian (1970) from pseudo-concavity to strict pseudo-concavity.) Suppose the optimality is

achieved at an interior point. Then the optimal µ is uniquely determined by the first-order

condition:

∂vn(S, µn)∂µn

=∑N

k=1 vk(S, µk)− c(S)∑Nk=1 µk

, n = 1, . . . , N, (15)

where vn(S, µn) = vn(S, p( 1µn

)). Note that the right side of the above does not depend on n.

This implies that the optimal pricing must be such that the marginal profit is the same across

all periods (assuming an interior optimum).

The following proposition describes the basic optimal pricing pattern.

Proposition 3.3 For any replenishment quantity S, the optimal prices are increasing over the

periods, i.e., p∗1 ≤ p∗2 ≤ · · · ≤ p∗N .

Proof. Since pn = p( 1µn

) is increasing in µn, it suffices to show that the optimal µ∗n is increasing

in n. But this follows immediately from (13), since given any (µ1, . . . , µN ), rearranging these

variables (but not changing their values) in increasing order will maximize the term∑N

n=1 nµn

in the numerator, while all other terms remain unchanged.

Hence, within each cycle, the optimal prices are increasing over the periods. The following

lemma gives an inequality to bound the price increments. The inequality can be derived from

the equal marginal profit property, but it still holds without the interior optimum assumption,

and its proof is in the appendix.

Lemma 3.1 For fixed S, let µ∗ be the optimal N prices. Then for 1 ≤ m < n ≤ N ,

µ∗n − µ∗m ≤ S(n−m)N(σ2 + B

h

) ≤ S(n−m)Nσ2

,

where B = inf{−

d2p( 1µ

)

dµ2 : µ ∈ [µ∗1, µ∗N ]}.

Next we explore the impact of the parameters, σ, h and S on the optimal prices. We shall

write µ∗(σ), µ∗(h) and µ∗(S) to emphasize the dependence of the maximizer of (13) on these

parameters. The proof of the next lemma is also in the appendix.

13

Lemma 3.2 Assuming interior optimum,

(i) µ∗(σ) is differentiable and dµ∗n(σ)dσ < 0 if and only if

N∑k=1

µ∗k(σ)2 − 2µ∗n(σ)N∑

k=1

µ∗k(σ) < 0; (16)

(ii) µ∗(h) is differentiable and dµ∗n(h)dh < 0 if and only if

S

N

N∑k=1

(n− k)µ∗k + σ2N∑

k=1

(µ∗k2

2− µ∗nµ∗k

)< 0. (17)

(iii) µ∗(S) is differentiable and dµ∗n(S)dS < 0 if and only if

h

N2

N∑k=1

(n− k)µ∗k(S) < −a′(S); (18)

The results provided in the above lemma are local, in the sense that they only hold for

parameter values that satisfy the specific conditions in question. Nevertheless, for µ1 and µN

some of these results can be strengthened; see (i)-(iii) of the following proposition.

Proposition 3.4 Assuming interior optimum,

(i) The highest price p∗N (σ) is decreasing in σ.

(ii) The lowest price p∗1(h) is decreasing in h.

(iii) If a′(S) < 0, then p∗1(S) is decreasing in S;

(iv) The optimal profit is decreasing in σ and h.

Proof. (i) By Lemma 3.2, the inequality in (16) always holds for n = N , as µ∗N is the largest

component of µ following Proposition 3.3. Hence µ∗N is decreasing in σ.

(ii) Denote the left side of (17) by Ln. We will show that L1 + LN < 0 and L1 < LN , which

then implies L1 < 0.

L1 + LN =S

N

N∑k=1

(N + 1− 2k)µ∗k + σ2N∑

k=1

[µ∗k

2 − (µ∗1 + µ∗N )µ∗k]

=S

N

[(N − 1)(µ∗1 − µ∗N ) + (N − 3)(µ∗2 − µ∗N−1) + · · ·+ (µ∗bN

2c − µ

∗bN+3

2c)]

+ σ2N∑

k=1

[µ∗k(µ

∗k − µ∗N )− µ∗1µ∗k

]< 0,

14

where the last inequality follows from µ∗n increasing in n. Next, note that

L1 − LN =[S(1−N)

N+ σ2(µ∗N − µ∗1)

] N∑k=1

µ∗k ≤ 0,

where the last inequality follows from Lemma 3.1. Hence L1 < 0, and p∗1(h) is decreasing in h.

(iii) By Lemma 3.2, if a′(S) < 0 and n = 1, the inequality in (18) always holds. Hence µ∗1 is

decreasing in S.

(iv) The proof is completely analogous to the proof for Proposition 3.1 (ii).

The proof of part (i)-(iii) of the above proposition relies on the interior optimum assumption

in Lemma 3.2, but we find through numerical tests that these monotonicity results hold even

without the interior optimum assumption. The next example illustrates Proposition 3.3 and

3.4 and provides comparison plots for Propositions 3.4 and 3.2.

Example 3.6 (a) Let p(λ) = 50 − 10λ, c(S) = 500 + 5S, S = 100, h = 2, and let σ vary

from 0 to 20. For N = 4, the optimal prices are plotted in Figure 3 (a). The highest price is

decreasing in σ while the others are not. There also exist other instances in which all the prices

are decreasing in σ. The demand variability incurs additional holding cost hσ2∑

µ2n∑µ . When σ

increases, in order to balance out the increase of this cost, intuitively we need to decrease µ

and at the same time decrease the spread of µ (i.e., µN − µ1). The composite effect is mixed

except for µ∗N .

(b) Let p(λ) = 50 − 10λ, c(S) = 500 + 5S, S = 100, σ = 1, and let h vary from 0 to 3. The

optimal prices are shown in Figure 3 (b). The lowest price is decreasing in h while the others

are not. The spread of the prices increases significantly with the increase in h.

(c) Let p(λ) = 50 − λ, c(S) = 100 + S, h = 0.5, σ = 3, and let S vary from 0 to 400. The

optimal prices are shown in Figure 3 (c). Note that a′(S) < 0 for all S. The lowest price is

decreasing in S while the others are not. The spread of the prices increases in S.

Figure 3: Optimal N prices and optimal single price

All the above examples show p∗1 < · · · < p∗N . For comparison, the optimal price for the

single-price problem are also plotted as p∗ in Figure 3.

Dynamic pricing obviously produces more profit than using a single fixed price, since the

latter is a special case of the dynamic pricing strategy. To quantify the potential benefit of

dynamic pricing is the purpose of the following proposition.

15

Proposition 3.5 For fixed S, let µ∗ be the optimal N prices, and let V ∗N be the optimal average

profit defined in (13). Then,

V ∗N − V ∗1 ≤hS2

(1−N−2

)12µ

(σ2 + B

h

) ,where µ = 1

N

∑Nn=1 µ

∗n and B = inf

{−

d2p( 1µ

)

dµ2 : µ ∈ [µ∗1, µ∗N ]}.

Proof. We consider a feasible single-price policy that charges a price p(µ−1), and compare

this with the optimal N -price policy. We show the latter has a lower average revenue, a higher

additional holding cost due to demand variability, and the same average ordering cost. Hence,

for the optimal N -price policy to yield a higher profit, it must have a lower average holding

cost attributed to the deterministic part of the demand. By the same reasoning, the difference

between these two average holding-cost terms corresponding to the two policies constitute an

upperbound on the improvement from the single-price policy to the N -price policy.

Specifically, the average profit of charging a single price p(µ−1) is

V1 =p( 1

µ)− hS2 µ−

hσ2

2 µ2 − a(S)

µ.

We have

V ∗N − V ∗1≤ V ∗N − V1

=

∑Nn=1

[p( 1

µ∗n)− p( 1

µ)]− hS

N

∑Nn=1

[(N − n+ 1

2)µ∗n − N2 µ)

]− hσ2N

2

[1N

∑Nn=1 µ

∗2n − µ2

]∑N

n=1 µ∗n

The first term in the numerator,∑N

n=1

[p( 1

µ∗n)− p( 1

µ)]≤ 0, since p( 1

µ) is concave in µ; and

the last term in the numerator 1N

∑Nn=1 µ

∗2n − µ2 ≥ 0, which is essentially the Cauchy-Schwartz

inequality. Thus,

V ∗N − V ∗1 ≤−hS

N

∑Nn=1

[(N − n+ 1

2)µ∗n − N2 µ)

]∑Nn=1 µ

∗n

=hS

N

(∑Nn=1 nµ

∗n∑N

n=1 µ∗n

− N + 12

)

=hS

N

(2µ∗1 + 4µ∗2 + · · ·+ 2Nµ∗N − (N + 1)(µ∗1 + · · ·+ µ∗N )

2∑N

n=1 µ∗n

)

=hS

N

((N − 1)(µ∗N − µ∗1) + (N − 3)(µ∗N−1 − µ∗2) + . . .

2Nµ

)16

where in the last line, the series ends with µ∗N/2+1−µ∗N/2 ifN is even, and ends with 2

(µ∗(N+3)/2−

µ∗(N−1)/2

)if N is odd.

Applying Lemma 3.1 and the identity:

(N − 1)2 + (N − 3)2 + · · ·+ (N + 1− 2⌊N

2

⌋)2 =

(N − 1)N(N + 1)6

,

we have

V ∗N − V ∗1 ≤ hS2

µ(σ2 + B

h

) ((N − 1)2 + (N − 3)2 + · · ·+ (N + 1− 2bN2 c)2

2N3

)

=hS2

µ(σ2 + B

h

) (N − 1)N(N + 1)12N3

=hS2

(1−N−2

)12µ

(σ2 + B

h

) .The bound in the above proposition indicates that the N -price policy cannot improve much

over the single-price policy when the replenishment quantity and the holding cost are low and

the demand variability is high.

Also note that this bound does not rely on the interior optimum assumption. It will be used

in the next section to derive a bound on improvement for the joint pricing and replenishment

strategy.

3.3 Joint Pricing-Replenishment Optimization

3.3.1 One Price

In this case, the firm’s problem is

maxS>0, λ∈L

V (S, λ) = r(λ)− hS

2− λa(S)− hσ2

2λ. (19)

The first-order conditions are given by (9) and (12). The difficulty is, V (S, λ) may not be

concave in (S, λ), and it may even have multiple local maxima (see Example 3.7 below). How-

ever, under linear demand and linear replenishment cost, a solution satisfying the first order

conditions and yielding positive profit must be the global optimum, as demonstrated below.

Proposition 3.6 Suppose λ(p) = α− βp, c(S) = K + cS, where α, β,K, c > 0. If S∗ > 0 and

λ∗ ∈ (0, α] satisfy the first-order conditions in (9) and (12), and V (S∗, λ∗) > 0, then (S∗, λ∗)

solves problem (19).

17

Proof. From Example 3.1, the optimal replenishment level for each λ is S∗(λ) =√

2Kλ/h.

We replace S in the objective by S∗(λ), and then find the optimal λ. Let

W (λ) := V (S∗(λ), λ) = (α− λ)λ/β − cλ−√

2Khλ− hσ2

2λ.

We prove that λ∗ satisfying W ′(λ∗) = 0 and W (λ∗) > 0 is the global maximizer.

Let f(λ) := (α − λ)λ/β − cλ −√

2Khλ. It is easy to see that f(0) = 0, f ′(0) = −∞ and

f ′′(λ) = − 2β +

√hK8λ3 . Let λ be the inflection point, i.e., f ′′(λ) = 0. Then, f(λ) starts decreasing

from zero, is convex in [0, λ] and concave in [λ,∞).

Suppose f ′(λ) ≤ 0, then by convexity/concavity, f ′(λ) ≤ f ′(λ) ≤ 0 for all λ ≥ 0. Therefore,

W (λ) < f(λ) ≤ f(0) = 0 for all λ ≥ 0, so there exists no maximizer with positive profit in this

case. Hence, we must have f ′(λ) > 0.

Next, let λ1 be the local minimizer for f(λ) in the convex part of the function. Obviously,

λ1 < λ. First, λ∗ 6∈ [0, λ1] because in this region W (λ) < f(λ) ≤ 0. Second, λ∗ 6∈ [λ1, λ] because

in this region W ′(λ) = f ′(λ) + hσ2

2λ2 > 0. Hence, λ∗ ∈ [λ, α]. (λ < α in order for λ∗ to exist.)

Since W (λ) = f(λ) − hσ2

2λ is strictly concave in [λ, α], λ∗ is the unique maximizer for W (λ) in

[λ, α]. It is the global maximizer since all other local maximizers (if any) must be within [0, λ1],

the region where W (λ) < 0.

The following example illustrates multiple local maxima and the above proposition.

Example 3.7 Consider linear demand λ(p) = 50−p, linear replenishment cost c(S) = 500+2S,

σ = 0.2 and h = 1. The optimal S for any fixed λ is given by S(λ) = 10√

10λ. Substituting

this EOQ into the first-order condition (12) gives

50− 2λ+1

50λ2=

5√

10√λ

+ 2.

However, simply solving the above equation yields three stationary points: λ1 = 0.01619,

λ2 = 0.1010, and λ3 = 22.327. Using the second-order condition, it can be verified that λ1 and

λ3 are local maxima, while λ2 is a local minimum. Furthermore, V (S(λ1), λ1) = −4.482 and

V (S(λ3), λ3) = 423.8. Hence λ∗ = λ3 = 22.327 and S∗ = 149.42.

The monotonicity of the joint optimum is explored in the following proposition (compared

with Proposition 3.1 and 3.2 where a single decision is optimized).

Proposition 3.7 If there is a unique optimal price and replenishment level, then

(i) the optimal price is decreasing in σ;

18

(ii) the optimal replenishment level is increasing in σ and decreasing in h;

(iii) the optimal profit is decreasing in σ and h.

Proof. (i) The joint optimization problem can be solved sequentially. We first optimize S for

each fixed λ > 0 (same as Section 3.1). From Proposition 3.1, the optimal S is invariant to σ,

and we denote it by S∗(λ). Then, the optimal λ can be found by

maxλ∈L

V (λ, σ) := r(λ)− hS∗(λ)2

− λa(S∗(λ))− hσ2

2λ.

Now, V (λ, σ) is supermodular in (λ, σ) since

∂2V

∂λ∂σ=hσ

λ2≥ 0,

and therefore λ∗(σ) is increasing in σ, or equivalently, the optimal price is decreasing in σ.

(ii) That S∗ is increasing in σ follows immediately from (i) and Proposition 3.1 (i). To examine

the effect of h on S∗, we first optimize λ for each fixed S > 0 (same as Section 3.2), and denote

the maximizer by λ∗(S, h). Then, the optimal S is determined by

maxS>0

V (S, h) := r(λ∗(S, h))− hS

2− λ∗(S, h)a(S)− hσ2

2λ∗(S, h).

Let λ∗S , λ∗h and λ∗Sh denote the partial derivatives. We have

∂2V

∂S∂h= r′′λ∗Sλ

∗h + r′λ∗Sh −

12− aλ∗Sh − a′λ∗h +

σ2

2λ∗2λ∗S −

hσ2

λ∗3λ∗Sλ

∗h +

hσ2

2λ∗2λ∗Sh

= λ∗S

(r′′λ∗h +

σ2

2λ∗2− hσ2

λ∗3λ∗h

)− 1

2− a′λ∗h +

(r′ − a+

hσ2

2λ∗2)λ∗Sh

Since λ∗(S, h) is uniquely determined by (12), the last term in the above is zero, and

λ∗S =a′

r′′ − hσ2

λ∗3

, λ∗h =− σ2

2λ∗2

r′′ − hσ2

λ∗3

,

Then, the first term in the above is also zero, and we have

∂2V

∂S∂h= −1

2− a′λ∗h = −1

2+

a′ σ2

2λ∗2

r′′ − hσ2

λ∗3

Now we show that the above is less than zero if evaluated at S∗. This is because S∗ satisfies

(9), which implies a′(S∗) = − h2λ∗ , and

∂2V

∂S∂h= −1

2+

hσ2

4λ∗3

−r′′ + hσ2

λ∗3

< −12

+hσ2

4λ∗3

hσ2

λ∗3

< −14.

19

Hence, V (S, h) is submodular in (S, h) in the neighborhood of the optima, and therefore, S∗(h)

is decreasing in h.

(iii). Let σ1 ≤ σ2, and the corresponding maximizers be (S∗1 , λ∗1) and (S∗2 , λ

∗2). We have

V (S∗1 , λ∗1, σ1) ≥ V (S∗2 , λ

∗2, σ1) ≥ V (S∗2 , λ

∗2, σ2), where the first inequality is due to the maximality

of (S∗1 , λ∗1) and the second follows immediately from the objective function. The proof of

decreasing in h is completely analogous.

Remark: When the optimal solutions are not unique, the results in the above proposition

continue to hold (except the monotonicity of the optimal replenishment level in h). In lieu of

increasing and decreasing, the relevant properties are ascending and descending, respectively.

Refer to Topkis (1978).

Intuitively, the higher the unit holding cost h, the less inventory should be held; and the

less order quantity means the higher average cost and thus the higher price. But this intuition

is only partially correct — when the demand is deterministic (σ = 0). To see this, note that

when σ = 0, the optimal λ is given by maxλ

r(λ)−λa(S∗(h)), i.e., λ∗ depends on h only through

S∗. But we know S∗ is increasing in h, while r(λ)− λa(S∗) is supermodular in (S∗, λ). Hence,

the price is increasing in h. However, when σ > 0, a lower price may offset the additional

holding cost due to demand variability; the composite effect is mixed, as shown in the following

example.

Example 3.8 Let λ(p) = αe−p with α > 0. Let c(S) = S(K − logS) for 0 ≤ S ≤ eK−1, where

K > 0 is given. Note that r(λ) = λ log(α/λ) is strictly concave for λ ∈ (0, α], c(S) is strictly

increasing for S ∈ [0, eK−1] and a(S) = K − logS is strictly convex and approaches to infinity

as S → 0, so Assumptions 1 and 2 are satisfied.

In this case, the first-order conditions (9) and (12) become

logα− log λ− 1 = K − logS − hσ2

2λ2and − 1

S= − h

2λ,

which determine the optimal solution:

λ∗ =σ

2

√2h

K + 1 + log(h/2α)and S∗ =

2λ∗

h.

(The above is indeed a global optimal solution when λ∗ < α, and S∗ ≤ eK−1; the latter is

satisfied if we choose K large enough.) Now, the partial derivative,

∂(λ∗2)∂h

=σ2(K + log(h/2α))

2(K + 1 + log(h/2α))2

20

varies from negative to positive, as h increases from zero. Thus, λ∗ first decreases and then

increases in h, or equivalently, p∗ first increases and then decreases in h.

Finally, we compare the joint optimization here with the sequential optimization scheme

that is usually followed in practice: the marketing/sales department first makes the pricing

decision, and then the purchase department decides the replenishment quantity based on de-

mand projection as a consequence of the pricing decision. For instance, the marketing/sales

department solves the problem:

λ∗ = arg maxλ{r(λ)},

and sets p∗ = p(λ∗). Then, the purchase department takes λ∗ as given and solves the problem:

S∗ = arg minS

{λ∗a(S) + hS/2

}.

Clearly, the sequential decision procedure does not take demand variability into account. The

optimal price and inventory level found by the sequential scheme certainly satisfy the first-order

condition in (9). But the first-order condition in (12) holds only when the demand variability

happens to be σ = λ∗√

2a(S∗)/h.

Corollary 3.8 Comparing to the joint optimization,

(i) if σ < σ, then the sequential decision underprices and overstocks;

(ii) if σ > σ, then the sequential decision overprices and understocks.

The proof is a straightforward application of the monotonicity result in Proposition 3.7, and will

be omitted. In general, the sequential optimization is sub-optimal. If the demand variability is

far away from σ, the sequential decision can lead to significant profit loss, as illustrated in the

following examples.

Example 3.9 Let λ(p) = 20 − p, c(S) = 100 + 5S and h = 1. Suppose the marketing/sales

department sets price at p∗ = 10 in order to maximize the revenue rate. Then, the purchase

department minimizes the operating cost using the EOQ model: S∗ = 20√

5 ≈ 44.7. The

threshold σ = λ∗√

2a(S∗)/h ≈ 38.

Figure 4: Sequential decision vs. joint decision

In Figure 4 we compare the sequential decision with the joint decision. The parameter range

is chosen such that the best decision will be able to achieve a positive profit. Substantial profit

21

loss is observed: when σ = 0, the sequential decision both underprices and overstocks by 28%,

resulting 73% profit loss compared to the joint decision; when σ = 10, the sequential decision

underprices by 25% and overstocks by 22%, making almost no profit.

3.3.2 N Prices

The problem is

maxS>0, µ∈MN

V (S,µ) =

∑Nn=1

[p( 1

µn)− hS

N (N − n+ 12)µn − hσ2

2 µ2n − a(S)

]∑N

n=1 µn

. (20)

As in the one-price case, the objective function V (S,µ) may not be jointly concave. Under the

interior optimum assumption, the optimal solution must satisfy the first-order conditions in (9)

and (15).

The result in Proposition 3.3 (which holds for any replenishment level) continues to hold

here, i.e., p∗1 ≤ p∗2 ≤ · · · ≤ p∗N . However, the monotonicity in Proposition 3.4 and Proposition 3.7

need not hold here, as evident from the following example.

Example 3.10 Let p(λ) = 10 − 10−3λ + λ−1, c(S) = 50 + S2 and h = 0.2. (Note the term

λ−1 in p(λ) only adds a constant to the objective function to ensure that the average profit is

positive.) Consider N = 2. The optimization problem is:

maxµ1,µ2,S

p( 1µ1

) + p( 1µ2

)− hS4 (3µ1 + µ2)− hσ2

2 (µ21 + µ2

2)− 2a(S)

µ1 + µ2.

The optimal solutions are plotted in Figure 5. Two observations emerge from the figure. First,

Figure 5: Impact of demand variability on optimal solutions

there exist jumps in the optimal objective function, due to multiple local maxima. For example,

for σ = 0.243, there are at least two local maximizers: (S = 4.917, µ1 = 0.401, µ2 = 41.51) and

(S = 4.464, µ1 = 5.637, µ2 = 43.44). The first yields an objective value V = 0.2639, which is

slightly higher than the second one (V = 0.2638). For σ = 0.244, the two local maximizers are

slightly different: (S = 4.918, µ1 = 0.448, µ2 = 41.33) and (S = 4.425, µ1 = 6.248, µ2 = 43.41).

However, the objective value corresponds to the second one (V = 0.261908) is slightly better

than the first (V = 0.261903). Hence, the optimal objective value exhibits discontinuity when

σ varies from 0.243 to 0.244. (These numerical results are accurate to all the decimal places

used. Furthermore, these phenomena can also be verified analytically.) Our second observation

22

is that when the optimal solutions are continuous in σ, there exists a range in which both µ1

and µ2 are increasing in σ, while S is decreasing in σ. This is in sharp contrast with the results

in the single-price case.

Next, we develop a bound on the profit improvement as the number of prices (N) increases.

Proposition 3.9 Let (S∗N ,µ∗) be the optimal joint pricing-replenishment decision, and V ∗N be

the corresponding optimal profit in (20). Then,

V ∗N − V ∗1 ≤hS∗N

2(1−N−2

)12µ

(σ2 + BN

h

) ,where µ = 1

N

∑Nn=1 µ

∗n and BN = inf

{−

d2p( 1µ

)

dµ2 : µ ∈ [µ∗1, µ∗N ]}.

Proof. Let V ∗N (S) denote the optimal profit when S is given (i.e., pricing decision only). If S

happens to be fixed at S∗N , then the profit is V ∗N , i.e., V ∗N = V ∗N (S∗N ). Applying Proposition 3.5,

we have the desired bound immediately:

V ∗N − V ∗1 = V ∗N (S∗N )− V ∗1 (S∗1) ≤ V ∗N (S∗N )− V ∗1 (S∗N ) ≤hS∗N

2(1−N−2

)12µ

(σ2 + BN

h

) .

The shortfall of the above bound is that it involves the solution to the N -price problem.

Heuristically, we can use the solution to the joint single-price and replenishment problem,

denoted by (S∗1 , µ∗), in the upperbound. Specifically, replace S∗N by S∗1 , and let B1 = − d2p

dµ2 ( 1µ∗ ).

That is,

V ∗N − V ∗1heur≤ hS∗1

2

16µ∗(σ2 + B1

h

) , (21)

whereheur≤ means “heuristically less than”.

Example 3.11 Let c(S) = K + cS and p(λ) = a− bλ, where a, b, c,K are all positive param-

eters. The optimization problem in (20) becomes

maxS>0, µ∈[b/a,∞)N

V (S,µ) =

∑Nn=1

[a− b

µn− hS

N (N − n+ 12)µn − hσ2

2 µ2n − K

S − c]

∑Nn=1 µn

.

23

Applying a change of variables, µ = bµ and S = KS, we can rewrite the above problem as

follows:

maxS>0, µ∈[1/a,∞)N

V (S, µ) =

∑Nn=1

[a− c− 1

µn− KhbS

N (N − n+ 12)µn − hb2σ2

2 µ2n − 1

S

]b∑N

n=1 µn

.(22)

Clearly, the above expression indicates that there are four degrees of freedom in terms of

independent parameters: (N, a− c, Khb, hb2σ2). Specifically, the four degrees of freedom are

determined by N , either a or c, and two from (K,h, b, σ). In the numerical studies reported

here and below, we choose to vary (N, c, h, σ) while fixing (K, a, b).

Figure 6: The effect of N , the number of price changes.

Figure 6 shows the optimal replenishment levels (Figure 6(a)), prices (Figure 6(b)) and

profit values (Figure 6(c)) corresponding to different values of N , the number of price changes.

We see that as N increases the optimal prices are inter-leaved (e.g., the optimal single price

is sandwiched between the two-price solutions, which, in turn, are each sandwiched between a

neighboring pair of the three-price solutions). (This inter-leaving property seems to persist in

all examples we have studied.) Figure 6(c) shows that N has a decreasing marginal effect on

profit. Using two prices already achieves most of the potential profit improvement, and beyond

N = 8, the marginal improvement is essentially nil.

Example 3.12 Here we continue with the last example, but focus on comparing the optimal

profits under N = 1 and N = 8. We consider the parameter values c ∈ (0, 30], h ∈ (0, 50], σ ∈(0, 50], while fixing the others at K = 100, a = 50, b = 1.

The results reported in Figure 7 are for c = 1. Similar results are observed for c =

5, 10, 20, 30.

Figure 7: Multiple prices vs. a single price

Figure 7(a) shows the optimal profit corresponding to a single price. The profit is clearly

decreasing in h and σ (Proposition 3.7). Furthermore, the negative effect of σ is larger when h

is larger and vice versa, suggesting that the profit function is submodular in (h, σ). Intuitively,

since the effect of σ shows up in the profit function through the additional holding cost, the

higher the h (resp. σ) value, the more sensitive is the profit to σ (resp. h).

Figure 7(b) shows the absolute improvement in profit when using 8 prices (V ∗8 − V ∗1 ). The

improvement is increasing in h and decreasing in σ. Intuitively, as the inventory holding cost

24

increases, the right trade-off among revenue, holding cost and replenishment cost becomes more

important, thus more pricing options over time is more beneficial. However, pricing becomes

less effective as demand variability (σ) increases.

Note that when the profit under a single price V ∗1 approaches zero, the absolute improvement

(V ∗8 −V ∗1 ) does not diminish. In particular, while V ∗1 decreases in h, the improvement increases

in h. Indeed, the relative improvement (= V ∗8 −V ∗

1V ∗1

) is increasing in h, and approaching infinity

when V ∗1 is close to zero, as demonstrated in Figure 7(c) and (d).

Figure 8: The bound on profit improvement

To conclude this example, we show in Figure 8 the profit improvement in comparison with

the heuristic bound in (21). The latter can be seen in this example as giving a good estimate

of the maximum potential improvement.

3.3.3 The Algorithm: Fractional Programming

To solve the fractional optimization problem in (20), we can instead solve the following:

maxS>0, µ∈MN

Vη :=N∑

n=1

[p( 1µn

)−hS(N − n+ 1

2)N

µn −hσ2

2µ2

n − a(S)

]− η

N∑n=1

µn, (23)

where η is a parameter such that when the optimal objective value of the problem in (23) is

zero, the corresponding solution is the optimal solution to the original problem in (20), and the

corresponding η is the optimal objective value of the original problem. To see the equivalence,

write the optimal value in (20) as V ∗ = A∗/B∗. Then, when η = V ∗, the optimal value of (23)

is zero. On the other hand, suppose there exists an η which yields a zero objective value in

(23), specifically, A∗ − ηB∗ = 0, then for any feasible A,B, we have A− ηB ≤ 0, which implies

A/B ≤ η = A∗/B∗.

The algorithm described below takes advantage of the separability of the objective function

with respect to µ when S and η are given. Specifically, the sub-problems are:

maxµn∈M

gn(µn) := p( 1µn

)−hS(N − n+ 1

2)N

µn −hσ2

2µ2

n − a(S)− ηµn, n = 1, . . . , N. (24)

Under Assumption 1, the objective in (24) is concave in µn.

Algorithm for solving (20)

1. Initialize η = η0, S = S0, and µn = µ for n = 1, . . . , N.(For instance, η0 = 0; S0 = S∗ and µn = µ∗ for all n,with (S∗, µ∗) being the optimal solution to the single-price problem.)Set ε and δ at small positive values (according to required precision).

25

2. Solve the N single-dimensional concave function maximization problems in (24).

3. Update S following a specified stepsize or use equation (9).If the difference between the new and the old S values is smaller than ε,go to step 4; otherwise, return to step 2.

4. If |Vη| ≤ ε, stop. Otherwise,if Vη > ε, increase η to η + δ;if Vη < −ε, decrease η to η − δ.Go to step 2.

In step 2, there are ways to accelerate the search procedure. Firstly, in searching the optimal

µ∗n for n = 1, . . . , N , we can take the advantage of the monotonicity of µ∗n in n following

Proposition 3.3. Secondly, we have gn(µ∗n) ≤ gn+1(µ∗n) ≤ gn+1(µ∗n+1), for n = 1, . . . , N − 1.

Hence, once we find µ∗1, if g1(µ∗1) > ε/N , then we can be sure that Vη > ε, and to bypass the

rest of the algorithm to increase η directly and proceed to the next loop. Thirdly, we can make

use of the monotonicity of µ∗n(S) in S according to Proposition 3.4 (iii). This is particularly

useful when the stepsize for updating S in step 3 is small.

In step 3, if we are in the early stage of the algorithm, i.e., when Vη is still substantially

away from zero, then the updating of S needs to cover a wide range so as not to miss the true

optimum. This can be done by using large stepsize first and then reduce them gradually. When

we are at the late stage of fine-tuning Vη, we can even bypass step 3 and simply use the same

S value (when it was last updated).

In step 3, we can use (9) as an updating scheme. This is in fact the coordinate assent method:

we alternate between optimizing µ for fixed S and optimizing S for fixed µ. This procedure

is guaranteed to converge to a local maximum, but not necessary the global maximum (see

Luenberger (1984)). So it is useful once the algorithm enters the region containing the true

optimum without other local maxima.

It can be verified that the optimal objective value in (23) is strictly decreasing in η, so there

is a unique zero-crossing point at which Vη is zero. Thus, in step 4, we can update η following

a standard line search algorithm, such as bi-section or the golden ratio.

4 Discounted Objective

We now turn to examining the problems in the last section under a discounted objective. We

will focus on the contrasts rather than the similarities between these two classes of models. So

as to present the results with minimal distraction, all proofs are relegated to the appendix.

26

Recall that period n refers to the period in which the price pn applies and the inventory

drops from Sn−1 = (N−n+1)SN to Sn = (N−n)S

N . Let γ > 0 be the discount rate. Let vn,γ(λn)

denote the expected profit over period n discounted to the beginning of the period. We have

vn,γ(λn) = E

[∫ τn

0e−γt

[p(λn)dD(t+ Tn−1)− hX(t+ Tn−1)dt

]]= E

[∫ τn

0e−γt

(p(λn) +

h

γ

)dD(t+ Tn−1) +

he−γτnSn

γ− hSn−1

γ

]=

(p(λn) +

h

γ

)λn

γ

(1− E[e−γτn ]

)+hE[e−γτn ]Sn

γ− hSn−1

γ

= f(λn) +hS(1−Θ(λn))

Nγn, (25)

where,

f(λ) =(p(λ) +

h

γ

)λγ

(1−Θ(λ)

)+hΘ(λ)S

γ− hS(N + 1)

Nγ,

Θ(λ) = E[e−γτn(λ)] = e−b(λ)S/N ,

b(λ) =

√λ2 + 2σ2γ − λ

σ2=

2γ√λ2 + 2σ2γ + λ

.

Here Lemma 2.1 is applied to derive the expression for Θ(λ). Assuming that the optimal λn is

finite, then Θ(λn) < 1. Let Vγ(S,λ) denote the discounted profit starting from zero inventory

under the policy (S,λ). Then,

Vγ(S,λ) =vγ(S,λ)

1−Θγ(S,λ), (26)

where,

vγ(S,λ) = v1,γ(λ1) + Θ(λ1)v2,γ(λ2) + · · ·+ Θ(λ1) . . .Θ(λN−1)vN,γ(λN )− c(S),

Θγ(S,λ) = Θ(λ1) . . .Θ(λN ) < 1.

For a single price, (26) becomes

Vγ(S, λ) =r(λ)γ− S(h/γ + a(S))

1− e−b(λ)S+hλ

γ2. (27)

Note in the above expressions vn,γ and Θ depend on S as well, but we have suppressed S to

simplify notation. For the same reason, we shall omit S in vγ(S,λ) and Θγ(S,λ) when S is not

a decision variable.

27

Optimal Replenishment

The problem is

maxS>0

Vγ(S, λ).

The first-order condition is

[h/γ + a(S) + Sa′(S)](ebS − 1) = bS[h/γ + a(S)]. (28)

Proposition 4.1 Vγ(S, λ) is concave in S if a′(S) < 0 and c′′(S) ≥ 0.

The condition a′(S) < 0 holds if S satisfies (28). Hence, the first-order condition in (28) is

sufficient for optimality if c′′(S) ≥ 0. We have also identified examples of multiple stationary

points when c′′(S) < 0.

Proposition 4.2 With price fixed,

(i) the optimal replenishment level S∗γ is increasing in σ, decreasing in h and p;

(ii) the optimal profit is decreasing in σ and h.

Recall, under the average objective and fixed price, the demand variability has no effect on

the optimal replenishment level. In contrast, the demand variability will raise the replenishment

level under the discounted objective. The following example further illustrates this point. Other

monotonicity properties in Proposition 3.1 continue to hold here.

Example 4.1 (EOQ under small discount rate) Suppose c(S) = K + cS. When γ is

small, b is also small, and the first order condition (28) can be approximately written as

(h/γ + c)(bS + 12b

2S2) = bS(h/γ +K/S + c).

Then,

S∗γ =

√2Kγ

b(h+ cγ).

Approximating b by Taylor series: b ≈ γλ −

σ2γ2

2λ3 , we have

S∗γ ≈

√2K(λ+ σ2γ

2λ )h+ cγ

.

Porteus (1985b) pointed out that the EOQ solution under a discounted objective can be ap-

proximated by√

2Kλh+cγ . Our result generalizes this result by incorporating the effect of demand

variability.

28

The Single-Price Problem

The problem is

maxλ∈L

Vγ(S, λ)

It is immediate to show that Vγ(S, λ) is concave in λ, and hence the optimal single price is

given by the first-order condition,

r′(λ) +hS + γc(S)

(1− e−b(λ)S)2Se−b(λ)Sb′(λ) +

h

γ= 0. (29)

Proposition 4.3 With the replenishment level fixed at S, let the optimal price be p∗. Then

(i) r′(λ(p∗)) ≤ a(S);

(ii) p∗ is decreasing in h;

(iii) p∗ is decreasing in S for S satisfying a′(S) ≤ 0;

(iv) the optimal profit is decreasing in σ and h.

Most results in Proposition 3.2 continue to hold here. The only difference is that the optimal

price is decreasing in demand variability under the average objective, while here the effect of

demand variability is rather unclear. Numerically, we have observed that in most cases the

optimal price is still decreasing in σ under the discounted objective, but exceptions do exist.

The pricing-replenishment joint optimization lacks the monotonicity properties of the opti-

mal decisions in general.

The N-Price Problem

The problem is

maxλ∈LN

Vγ(S,λ) (30)

This problem has an equivalent dynamic programming formulation. Let Vn,γ denote the optimal

discounted profit-to-go starting from the beginning of period n. Following the principle of

optimality, we have the following recursions:

V1,γ = maxλ∈L

{−c(S) + v1,γ(λ) + Θ(λ)V2,γ} ,Vn,γ = max

λ∈L{vn,γ(λ) + Θ(λ)Vn+1,γ} , n = 2, . . . , N − 1,

VN,γ = maxλ∈L

{vN,γ(λ) + Θ(λ)V1,γ} .(31)

The following parallels Proposition 3.3.

29

Proposition 4.4 For any fixed replenishment level S, the optimal prices under the discounted

objective are increasing over the periods, i.e., p∗1 ≤ p∗2 ≤ ... ≤ p∗N .

A Fixed-Point Algorithm

Instead of searching in LN for the optimal λ, we provide an iterative algorithm with each step

solving a one-dimensional problem in L.

A Fixed-Point Algorithm for the dynamic program (31)

1. Set an arbitrary initial value V o1,γ, and set ε > 0 small.

2. Use V o1,γ in the last equation of (31), and solve for VN,γ , VN−1,γ , . . . , V1,γ recursively

following (31).

3. If |V o1,γ − V1,γ | < ε, stop. Otherwise, let V o

1,γ ← V1,γ, and go to Step 2.

To explore the convergence of the algorithm, we substitute the equations for V2,γ , . . . , VN,γ

in (31) recursively into the first equation and obtain the following:

V1,γ = maxλ∈LN

{vγ(λ) + Θγ(λ)V1,γ

}≡ max

λ∈LNψ(λ, V1,γ) ≡ Ψ(V1,γ). (32)

Proposition 4.5 Let ψ and Ψ be defined as above, then

(i) the mapping Ψ(V ) is increasing and Lipschitz continuous with modulus 1;

(ii) there exists a unique fixed point satisfying V ∗ = Ψ(V ∗) = ψ(λ∗, V ∗). Furthermore, λ∗

solves (30) with the maximum discounted profit being V ∗; and

(iii) The above algorithm for solving (31) converges to the optimal solution from any initial

value V o1,γ.

Joint Pricing-Replenishment

In general, when N prices are optimized jointly with the replenishment level S, we can use

the iterative algorithm developed above for each fixed S, and then do a line search to find the

optimal S.

Convergence to the Average Objective

The discounted profit converges to the average profit in the following sense.

Proposition 4.6 Let vn,γ and Vγ be defined as (25) and (26), and let vn and V follow (6) and

(7) under the average objective. Then,

limγ→0

vn,γ = vn and limγ→0

γVγ = V.

30

As shown in the appendix, the first-order conditions in (28) and (29) converge to their

counterparts in (9) and (12), respectively. This implies that the optimal pricing-replenishment

decisions under the discounted objective converge to those under the average objective, notwith-

standing the contrasts highlighted above.

Proposition 4.7 Let (S∗γ , p∗γ) and (S∗, p∗) denote, respectively, the optimal solution under the

discounted and the average objectives. Then, we have

limγ→0

S∗γ = S∗ and limγ→0

p∗γ = p∗.

5 Concluding Remarks

We have demonstrated the effectiveness of the Brownian motion model for price-sensitive de-

mand, in particular in making optimal pricing and replenishment decisions, quantifying the

profit improvement, and bringing out the impact of demand variability. The Brownian mo-

tion model also enables us to work out explicit analytical solutions, which facilitate making

connections to and comparisons against, wherever applicable, previously known results in both

deterministic and stochastic settings.

Our models can be extended to allow backlogging, in which case the replenishment takes

the form of a (−s, S) policy: a replenishment order is issued whenever the backlog has reached

s, to bring the inventory level back to S. (Hence, the replenishment quantity is S + s. Assume

zero leadtime as before.) The pricing policy is modified accordingly: equally divide S and s

into N and M (positive integers) segments, respectively, such that

S = S0 > S1 > · · · > SN = 0 > SN+1 > · · · > SN+M = −s,

and apply price pn until the the net inventory (inventory on hand net the backlogs) falls to Sn,

n = 1, . . . , N +M .

Most of the results we have derived for the no-backlog case will continue to hold, with

suitable modifications. For instance, the optimal pricing will satisfy the monotonicity:

p∗1 ≤ p∗2 ≤ · · · ≤ p∗N , p∗N+1 ≥ p∗N+2 ≥ · · · ≥ p∗N+M .

That is, the optimal prices increase as the on-hand inventory is depleted, and decrease as the

backlog increases. The bound on the profit improvement is (assuming M = N):

V ∗N − V ∗1 ≤(1−N−2

)12

[hS2

µh

(σ2 + B

h

) +bs2

µb

(σ2 + B

b

)] ,31

where µh = 1N

∑Nn=1 µ

∗n, µb = 1

N

∑2Nn=N+1 µ

∗n, and B denotes, as before, the lower bound for

−d2p( 1

µ)

dµ2 over the spread of µ∗.

6 Appendix

Proof of Lemma 2.1. (This lemma follows from well-known results regarding optional

stopping applied to Brownian motion; refer to, e.g., Karlin and Taylor (1975), and Ross (1996).

The proof here is included for completeness.) Omit the subscript n, and write T := τn, x :=

T/N . We have

λT + σB(T ) = x. (33)

This, combined with EB(T ) = 0, which follows from applying optional stopping to B(t), leads

to (3).

To derive the second moment of T , we have

σ2[B2(T )− T ] = (x− λT )2 − σ2T.

Applying optional stopping again, to the martingale B2(t)− t, we have

E[(x− λT )2] = σ2E(T ),

which can be expressed as

λ2Var(T ) = σ2E(T ).

This establishes (4).

More generally, consider E(e−γT ), where γ > 0 is a constant (discount rate). Write

−γT = aB(T )− 12a2T − bx, (34)

where a and b are parameters to be determined. Then, applying optional stopping to the

exponential martingale, exp[aB(t)− 12a

2t], we have

E(e−γT ) = e−bx · E[eaB(T )− 12a2T ] = e−bx.

The power b can be derived as follows. Substituting (33) into the right hand side of (34), we

have

−γT = aB(T )− 12a2T − bσB(T )− bλT.

32

Equating the coefficients of T and B(T ) on both sides yields:

a = bσ,12a2 + bλ = r,

leading to

σa2 + 2λa− 2γσ = 0.

Hence,

a =

√λ2 + 2σ2γ − λ

σ;

and

E(e−γT ) = e−bx = exp[−√λ2 + 2σ2γ − λ

σ2x].

Proof of Lemma 3.1. First, assuming interior optimum. The first-order conditions in (15)

hold, which imply that

dvm(µm)dµm

=dvn(µn)dµn

,

or equivalently,

dpdµ( 1

µ∗m)− dp

dµ( 1µ∗n

) + hSN (m− n) + hσ2(µ∗n − µ∗m) = 0. (35)

By Proposition 3.3, µ∗n ≥ µ∗m for n > m. Next, applying the mean-value theorem and using the

definition of B, we have

dpdµ( 1

µ∗m)− dp

dµ( 1µ∗n

) = − d2pdµ2 (1

ζ )(µ∗n − µ∗m) ≥ B(µ∗n − µ∗m), (36)

where ζ ∈ [µ∗m, µ∗n]. Combining (35) and (36), we have

hSN (m− n) + (hσ2 +B)(µ∗n − µ∗m) ≤ 0,

or

µ∗n − µ∗m ≤ S(n−m)N(σ2 + B

h

) .Next, we show the above result holds even when the optimum is on the boundary. If

µ∗m = µ∗n, then the desired inequality obviously holds. If µ∗m < µ∗n, consider the following

33

alternative pricing policy: (µ∗1, . . . , µ∗m + δ, . . . , µ∗n − δ, . . . , µ∗N ), where 0 < δ < µ∗n−µ∗m

2 . Since

this alternative cannot be better than the optimum, we have

p( 1µ∗m+δ ) + p( 1

µ∗n−δ )− hSN

[(N −m+ 1

2)(µ∗m + δ) + (N − n+ 12)(µ∗n − δ)

]−hσ2

2

[(µ∗m + δ)2 + (µ∗n − δ)2

]≤ p( 1

µ∗m) + p( 1

µ∗n)− hS

N

[(N −m+ 1

2)µ∗m + (N − n+ 12)µ∗n

]− hσ2

2

[µ∗2m + µ∗2n

].

Taylor’s expansion with straightforward algebra simplifies the inequality to

dpdµ( 1

µ∗m+ζ1δ )δ − dpdµ( 1

µ∗n−ζ2δ )δ + hSN δ(m− n) + hσ2δ(µ∗n − µ∗m − δ) ≤ 0,

where ζ1, ζ2 ∈ [0, 1]. Dividing both sides by δ, and then letting δ → 0, we have

dpdµ( 1

µ∗m)− dp

dµ( 1µ∗n

) + hSN (m− n) + hσ2(µ∗n − µ∗m) ≤ 0. (37)

Combining (36) and (37) yields the desired inequality.

Proof of Lemma 3.2. We only prove part (i); the proof of part (ii) and (iii) is completely

analogous. Consider an equivalent problem to the fractional problem (13):

maxµ

N∑n=1

[p( 1µn

)−hS(N − n+ 1

2)N

µn −hσ2

2µ2

n − a(S)

]− η(σ)

N∑n=1

µn,

where η(σ) is the optimal value of (13) when the demand variability is σ and all the other

parameters are fixed. To solve the above problem, we can maximize each µn separately:

maxµn

gn(µn, σ) = p( 1µn

)−hS(N − n+ 1

2)N

µn −hσ2

2µ2

n − η(σ)µn.

Since µ∗(σ) is in the interior, applying the envelope theorem, we have

dη

dσ= −

hσ∑N

k=1 µ∗k2∑N

k=1 µ∗k

.

To establish the desired monotonicity, it suffices to establish the submodularity of gn(µn, σ)

(refer to Topkis 1978):

∂2gn(µ∗n, σ)∂µn∂σ

= −2hσµ∗n −∂η

∂σ

= −2hσµ∗n +hσ∑N

k=1 µ∗k2∑N

k=1 µ∗k

.

=hσ∑Nk=1 µ

∗k

(N∑

k=1

µ∗k2 − 2µ∗n

N∑k=1

µ∗k

),

34

If∑µ∗k

2−2µ∗n∑µ∗k < 0, then the objective gn(µn, σ) is submodular in (µn, σ) in a neighborhood

of (µ∗n, σ). Thus, µ∗n is (locally) decreasing in σ. If the converse is true, µ∗n is (locally) increasing

in σ.

Proof of Proposition 4.1. Recall

Vγ(S, λ) =r(λ)γ− S(h/γ + a(S))

1− e−b(λ)S+hλ

γ2.

We first show that Vγ(S, λ) is concave in S if a′(S) ≤ 0 and c′′(S) ≥ 0. To this end, we derive

∂2Vγ

∂S2= −

(bSe−bS + 2e−bS + bS − 2)e−bSb(hγ + a(S))

(1− e−bS)3+

2bSe−bSa′(S)(1− e−bS)2

− c′′(S)1− e−bS

.

The first term on the right side is negative due to the fact that

h(x) := xe−x + 2e−x + x− 2 ≥ 0 for all x ≥ 0, (38)

since h(0) = h′(0) = 0 and h′′(x) = xe−x ≥ 0 for all x ≥ 0; the second term is negative since

a′(S) ≤ 0; the third term is also negative if c(S) is convex in S. Hence Vγ(S, λ) is concave in S

if a′(S) ≤ 0 and c′′(S) ≥ 0.

To show Vγ is concave in λ, it suffices to show that g(λ) := 11−e−b(λ)S is convex in λ. We

have,

g′′(λ) = g2e−bSS(b′

2S(2ge−bS + 1)− b′′

)=

g2e−bSS

λ2 + 2γσ2

(b2S(2ge−bS + 1)− 2γ√

λ2 + 2γσ2

),

where the last equality follows from b′ = − b√λ2+2γσ2

and b′′ = 2γ(λ2+2γσ2)3/2 . Since

b2S(2ge−bS + 1)− 2γ√λ2 + 2γσ2

> b2S(2ge−bS + 1)− 2b

= bg(bSe−bS + bS + 2e−bS − 2)

≥ 0,

where the last inequality is again due to (38), g(λ) is convex in λ, and hence, Vγ(S, λ) is concave

in λ.

Proof of Proposition 4.2.

35

(i). To show that S∗ is decreasing in p, it suffices to verify that Vγ(S, λ) is supermodular in

(S, λ); i.e., ∂2V∂S∂λ ≥ 0. Since the optimal replenishment level must satisfy a′(S) < 0 by (i), it

suffices to verify the supermodularity in the sub-lattice where a′(S) < 0.

We have

∂2Vγ(S, λ)∂S∂λ

= g2e−bSb′S[hγ

+ c′(S)−(hγ

+ a(S))(bS

1 + e−bS

1− e−bS− 1)].

Since b′(λ) < 0, and since a′(S) < 0 is equivalent to c′(S) < a(S), the desired ∂2Vγ

∂S∂λ ≥ 0 is

implied by

bS1 + e−bS

1− e−bS− 1 ≥ 1.

But this last inequality follows immediately from (38).

The proof of S∗ increasing in σ is completely analogous: we can follow the above to establish∂2Vγ

∂S∂σ ≥ 0 in the sub-lattice where a′(S) < 0; the only change is to replace b′(λ) by b′(σ). Note

in particular that b′(σ) < 0 too.

To show that S∗ is decreasing in h, note that Vγ is submodular in (S, h):

∂2Vγ

∂S∂h= −1− e−bS − bSe−bS

γ(1− e−bS)2≤ 0,

since 1− e−bS ≥ bSe−bS .

(ii). The proof is analogous to the proof of Proposition 3.1 (ii).


(i). From (29),

r′(λ) =hS + a(S)Sγ(1− e−bS)2

Se−bS(−b′)− h

γ

= a(S)e−bSS2γ

(1− e−bS)2b√

λ2 + 2σ2γ+

h√λ2 + 2σ2γ

bS2e−bS

(1− e−bS)2− h

γ

= a(S)e−bSS2b2

(1− e−bS)21

2− λγ b

+h

b√λ2 + 2σ2γ

(b2S2e−bS

(1− e−bS)2− b√λ2 + 2σ2γ

γ

)

≤ a(S)2− λ

γ b+

h

b√λ2 + 2σ2γ

(1− b

√λ2 + 2σ2γ

γ

)≤ a(S),

36

where the first inequality results from the following:

x2e−x

(1− e−x)2=

x2

ex + e−x − 2=

x2∑∞n=1 2x2n/(2n)!

≤ 1, (39)

and in the last inequality we used the fact that

γ√λ2 + 2σ2γ

≤ b =2γ√

λ2 + 2σ2γ + λ≤ γ

λ. (40)

(ii) To prove the monotonicity in h, we show that Vγ is supermodular in (λ, h):

∂2V

∂λ∂h=

S2e−bSb′

γ(1− e−bS)2+

1γ2

=1bγ2

[− e−bSS2b2

(1− e−bS)2γ√

λ2 + 2σ2γ+ b

]

≥ 1bγ2

[− γ√

λ2 + 2σ2γ+ b

]≥ 0,

where the first inequality follows from (39) and the last inequality follows from (40).

(iii) The monotonicity in S relies on the supermodularity of Vγ in (S, λ), which has been

established in the proof of Proposition 4.2 (ii).

(iv). The proof is analogous to the proof of Proposition 3.1 (iii).

Proof of Proposition 4.4. Let the optimal profit-to-go be V ∗n,γ , for n = 1, . . . , N , which

satisfy (32). For simplicity, denote V ∗N+1,γ := V1,γ . In view of (32) and (25), we can write

λ∗n ∈ arg maxλ∈L

{F (n, λ)}, for n = 1, . . . , N,

where

F (n, λ) = f(λ) +hS(1−Θ(λ))

Nγn+ Θ(λ)V ∗n+1,γ .

Thus, λ∗n is decreasing in n if we can show that F (n, λ) is submodular in (n, λ). To this end,

notice that

F (n, λ)− F (n− 1, λ) =hS(1−Θ(λ))

Nγ+ Θ(λ)(V ∗n+1,γ − V ∗n,γ)

=hS

Nγ+ Θ(λ)

(V ∗n+1,γ − V ∗n,γ −

hS

Nγ

), for n = 2, . . . , N.

37

Since Θ(λ) is decreasing in λ, the above difference is decreasing in λ if V ∗n+1,γ−V ∗n,γ ≥ hSNγ . This

last inequality can be verified as follows. For n = 2, . . . , N , if we start from the inventory levelS(N−n+1)

N , then the optimal profit-to-go is V ∗n,γ . Now consider the following strategy: always

keep SN units in stock while following the optimal policy as if the inventory starts at S(N−n)

N .

The profit associated with this strategy is V ∗n+1,γ − hSNγ , where hS

Nγ is the discounted cost of

holding the extra SN units. Since this strategy is sub-optimal, we have

V ∗n+1,γ − V ∗n,γ ≥ V ∗n+1,γ −(V ∗n+1,γ −

hS

Nγ

)=

hS

Nγ.

The submodularity of F (n, p) hence follows and λ∗n is decreasing in n; or equivalently, p∗n is

increasing in n.


(i). For V > V , let λ and λ be the maximizers (not necessarily in the interior) of ψ(·, V ) and

ψ(λ, V ), respectively (refer to (32)). Then,

Ψ(V )−Ψ(V ) = ψ(λ, V )− ψ(λ, V ) ≥ ψ(λ, V )− ψ(λ, V ) = Θγ(λ)(V − V ) ≥ 0,

which implies that Ψ(V ) is increasing in V . Furthermore,

Ψ(V )−Ψ(V ) = ψ(λ, V )− ψ(λ, V ) ≤ ψ(λ, V )− ψ(λ, V ) = Θγ(λ)(V − V ). (41)

Since Θγ(λ) ≤ 1, the above inequality implies that Ψ is Lipschitz continuous, with modulus no

larger than 1. That is, Ψ(V ) is non-expansive.

(ii). Let λ∗ be the optimal solution to (30) with optimal value V ∗. We eliminate the un-

interesting case where λ∗ = ∞. Thus, Θγ(λ∗) < 1 and V ∗ = vγ(λ∗)1−Θγ(λ∗) , or equivalently,

V ∗ = ψ(λ∗, V ∗). On the other hand, V ∗ ≥ vγ(λ)1−Θγ(λ) , or equivalently, V ∗ ≥ ψ(λ, V ∗) for all

λ ∈ LN . Thus, V ∗ = maxλ∈LN ψ(λ, V ∗) = Ψ(V ∗); and hence, V ∗ is a fixed point of Ψ. This

proves the existence.

Conversely, suppose V ∗ is a fixed point of Ψ(V ) with the corresponding maximizer λ∗. Note

that λ∗ =∞ implies that all inventory can be sold out instantaneously after the replenishment

(with the revenue covering exactly the replenishment cost). If we eliminate this uninteresting

case, then Θγ(λ∗) < 1, and V ∗ = ψ(λ∗, V ∗) or V ∗ = vγ(λ∗)1−Θγ(λ∗) . At the same time, V ∗ ≥ ψ(λ, V ∗)

or V ∗ ≥ vγ(λ)1−Θγ(λ) for all λ ∈ LN . Hence, λ∗ solves the problem in (30), achieving the optimal

value V ∗.

38

Now, suppose there exist two fixed points V ∗ > V ∗ corresponding to the maximizers λ∗ and

λ∗, respectively. Then, both V ∗ and V ∗ are optimal values for the problem in (30), an obvious

contradiction (against optimality). This establishes the uniqueness.

(iii). Let V ∗ be the unique fixed point. From (i), we have Ψ(V ) > V for V < V ∗ and Ψ(V ) < V

for V > V ∗. Hence, if we start from an initial value V0 < V ∗, the sequence of values generated

by the iteration Vn = Ψ(Vn−1) is increasing and bounded from above by V ∗. So the sequence

converges to a fixed point of the mapping Ψ, which must be V ∗. A similar argument holds for

an initial value V0 > V ∗.

Proof of Proposition 4.6. When γ → 0, via Taylor expansion, we can derive the following:

b(λ) =γ

λ− σ2γ2

2λ3+O(γ3), (42)

Θ(λ) = e−b(λ)S/N = 1− Sγ

Nλ+σ2Sγ2

2Nλ3+

S2γ2

2N2λ2+O(γ3). (43)

Applying (42) and (43), we have

vn,γ(λ) =(p(λ) +

h

γ

)( SN− σ2Sγ

2Nλ2− S2γ

2N2λ

)+hS

γ− hS2

Nλ− hS(N + 1)

Nγ+hS2n

N2λ+ o(1)

=p(λ)SN

−hS2(N − n+ 1

2)N2λ

− hσ2S

2Nλ2+ o(1)

= vn(λ) + o(1), as γ → 0.

Then,

γVγ(S,λ) =vγ(λ)

γ−1(1−Θγ(λ))=

∑n vn(λn)− c(S) + o(1)

γ−1(1−∏N

n=1(1−Sγ

Nλn))

=∑

n vn(λn)− c(S) + o(1)SN

∑n λ

−1n + o(1)

= V (S,µ) + o(1), as γ → 0.

Proof of Proposition 4.7. The first-order optimality equation for S∗γ in (28) can be written

as follows:

(h+ γc′(S))(Sγ

λ− σ2Sγ2

2λ3− S2γ2

2λ2) = (1− Sγ

λ)(γ

λ− σ2γ2

2λ3)[hS + γc(S)]

Ignoring higher-order terms, we can simplify the above to the equation in (9), which is the

first-order optimality condition for S∗.

39

Using (42) and (43), when γ → 0, the first-order condition in (29) can be approximated as

0 = r′(λ) +(h+ γa(S))S

S2γ2

λ2 (1− σ2γ2λ2 − Sγ

2λ )2S(1− Sγ

λ

)(− γ

λ2+

3σ2γ2

2λ4

)+h

γ

= r′(λ)−(hγ

+ a(S))(

1 +σ2γ

λ2+Sγ

λ

)(1− Sγ

λ

)(1− 3σ2γ

2λ2

)+h

γ

= r′(λ)− h

γ− a(S)− h

γ

(σ2γ

λ2+Sγ

λ− Sγ

λ− 3σ2γ

2λ2

)+h

γ

= r′(λ)− a(S) +hσ2

2λ2,

which is exactly the same as the first-order condition in (12) for the average objective.

References

[1] Chen, F.Y., S. Ray, and Y. Song. 2003. Optimal pricing and inventory control policy in

periodic-review systems with fixed ordering cost and lost sales. Working paper, Chinese

University of Hong Kong.

[2] Chen, H. and D.D. Yao. 2001. Fundamentals of Queueing Networks: Performance, Asymp-

totics and Optimization, Springer-Verlag, New York.

[3] Chen, X. and D. Simchi-Levi. 2003a. Coordinating inventory control and pricing strategies

with random demand and fixed ordering cost: the finite horizon case. Operations Research,

forthcoming.

[4] Chen, X. and D. Simchi-Levi. 2003b. Coordinating inventory control and pricing strategies

with random demand and fixed ordering cost: the infinite horizon case. Working paper,

M.I.T.

[5] Chen, X. and D. Simchi-Levi. 2003c. Coordinating inventory control and pricing strategies

with random demand and fixed ordering cost: the continuous review model. Working paper,

M.I.T.

[6] Csorgo, M. and L. Horvath. 1993. Weighted Approximations in Probability and Statistics,

Wiley, New York.

[7] Elmaghraby W. and P. Keskinocak. 2003. Dynamic pricing in the presence of inventory

considerations: research overview, current practices and future directions. Management

Science, 49 (10), 1287-1309.

40

[8] Federgruen, A. and A. Heching. 1999. Combined pricing and inventory control under un-

certainty. Operations Research, 47 (3), 454-475.

[9] Feng, Y. and F.Y. Chen. 2002. Joint pricing and inventory control with setup costs and

demand uncertainty. Working paper, Chinese University of Hong Kong.

[10] Feng, Y. and F.Y. Chen. 2003. Pricing and inventory control for a periodic-review system:

optimality and optimization of (s, S, p) policy. Working paper, Chinese University of Hong

Kong.

[11] Harrison, J.M. 1985. Brownian Motion and Stochastic Flow Systems, Wiley, New York.

[12] Harrison, J.M. 1988. Brownian models of queueing networks with heterogeneous customer

populations. In: Stochastic Differential Systems, Stochastic Control Theory and Applica-

tions, IMA Volumes in Mathematics and its Applications, volume 10, W. Flemming and

P.L. Lions (eds.), 147-186, Springer-Verlag, New York.

[13] Harrison, J.M. 2003. A broader view of Brownian networks. Annals of Applied Probability,

13 (3), 1119-1150.

[14] Karlin, S. and H. Taylor 1975. A First Course in Stochastic Processes (2nd ed.), Academic

Press, New York.

[15] Li, L. 1988. A stochastic theory of the firm. Mathematics of Operations Research, 13 (3),

447-466.

[16] Luenberger, D.G. 1984. Linear and Nonlinear Programming (2nd ed.), Addison-Wesley.

[17] Mangasarian O.L. 1970. Convexity, pseudo-convexity and quasi-convexity of composite

functions. Cahiers du Centre d’Etudes de Recherche Operationnelle, 12, 114-122.

[18] Polatoglu, H. and I. Sahin. 2000. Optimal procurement policies under price-dependent

demand. International Journal of Production Economics, 65, 141-171.

[19] Porteus, E.L. 1985a. Investing in reduced setups in the EOQ model. Management Science,

31 (8), 998-1010.

[20] Porteus, E.L. 1985b. Undiscounted approximations of discounted regenerative models. Op-

erations Research Letters, 3, 293-300.

41

[21] Puterman, M.L. 1975. A diffusion process model for a storage system. In Logistics, M.A.

Geisler (ed.), North-Holland Publishing Company, 143-159.

[22] Rajan, A., Rakesh Steinberg and Richard Steinberg. 1992. Dynamic pricing and ordering

decisions by a monopolist. Management Science, 38 (2), 240-262.

[23] Ross, S.M. 1996. Stochastic Processes (2nd ed.) Wiley, New York.

[24] Thomas, L.J. 1974. Price and production decisions with random demand. Operations Re-

search, 22 (3), 513-518.

[25] Thowsen, G.T. 1975. A dynamic, nonstationary invneotry problem for a price/quantity

setting firm. Navel Research Logistics Quaterly, 22, 461-476.

[26] Topkis, D.M. 1978. Minimizing a submodular function on a lattice. Operations Research,

26 (2), 305-321.

[27] Whitin, T.M. 1955. Inventory control and price theory. Management Science, 2 (1), 61-68.

[28] Yano, C.A. and S.M. Gilbert. 2003. Coordinated pricing and production/procurement deci-

sions: a review. In: Managing Business Interfaces: Marketing, Engineering and Manufac-

turing Perspectives, A. Chakravarty and J. Eliashberg (eds.), Kluwer Academic Publishers

(forthcoming).

[29] Zabel E. 1972. Multiperiod monopoly under uncertainty. Journal of Economic Theory, 5,

524-536.

42

1

10 20 30 40p0

20

40

60

80

S

(a)

10 20 30 40p

200

100

0

100

200

300

400

V

σσσσ 1111

σσσσ 30303030

σσσσ 60606060

(b)

Figure 1. Optimal replenishment with a fixed price

2

20 40 60 80 100S25

30

35

p

σσσσ 1111σσσσ 30303030σσσσ 60606060

(a)

20 40 60 80 100S0

100

200

300

400

500V

σσσσ 1111σσσσ 30303030σσσσ 60606060

(b)

Figure 2. Optimal price with a fixed replenishment level

3

5 10 15 200

10

20

30

40

50

p

p1

p2

p3

p4

(a) The highest price is decreasing in σ

0.5 1 1.5 2 2.5 30

10

20

30

40

50

p

p1

p2

p3

p4

(b) The lowest price is decreasing in h

100 200 300 400

24

26

28

30

p

p1

p2

p3

p4

(c) The lowest price is decreasing in S

Figure 3. Optimal N prices and optimal single price

σ

h

S

0

10

20

30

40

50p, S

5

0

5

10

15

20V

Sequential S*

Joint p*

4 8 12 16σσσσ

(a)

F

Sequential p*
Joint
Joint S*

4

4 8 12 16σσσσ

(b)

igure 4. Sequential decision vs. joint decision

Sequential

5

0.24 0.242 0.244 0.246 0.248 0.250

2

4

6

8

0.24 0.242 0.244 0.246 0.248 0.2541

41.5

42

42.5

43

0.24 0.242 0.244 0.246 0.248 0.254.2

4.4

4.6

4.8

5

Figure 5. Impact of demand variability on optimal solutions

µ1*

S*

µ2*

σ

σ

σ

6

1 2 3 4 5 6 7 8 1066

66.1

66.2

66.3

66.4

S

(a)

1 2 3 4 5 6 7 8 10

27.5

28

28.5

29p

(b)

1 2 3 4 5 6 7 8 10437.4

437.5

437.6

V

(c)

Figure 6. The effect of N, the number of price changes

50

50

50

N

N

N

7

010

2030

4050

h0

10

20

30

40

50

0

200

400

010

2030

4050

h

(a)

010

2030

4050

h0

10

20

30

40

50

0

2.5

5

7.5

10

010

2030

4050

h

(b)

h

σσσσ

h

σσσσ

8

010

2030

4050

h0

10

20

30

40

50

0

2.5

5

7.5

10

010

2030

4050

h

(c)

0 10 20 30 40 50

h

0

10

20

30

40

50

00.000250.00050.000750.001

(d)

Figure 7. Multiple prices vs. a single price

h

σσσσ

σσσσ

h

1% 2% 3% 4% 5% 8%

20%

(%)

9

1020

3040h

0

5

10

15

1020

3040h

Figure 8. The bound on profi

h

Bound

50

10

20

30

40

50

50

t improvement

σσσσ

Documents

Optimal Pricing and Replenishment in a Single-Product